archivebox

There are 33 repositories under archivebox topic.

  • ArchiveBox

    ArchiveBox/ArchiveBox

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    Language:Python25.5k1751k1.4k
  • good-karma-kit

    ArchiveBox/good-karma-kit

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

  • ArchiveBox/archivebox-browser-extension

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    Language:JavaScript37483437
  • ArchiveBox/electron-archivebox

    Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

    Language:JavaScript1786615
  • ArchiveBox/abx-dl

    ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

    Language:JavaScript87514
  • pirate/internet-archiving-talk

    🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

    Language:JavaScript59405
  • ArchiveBox/docker-archivebox

    Home of the official docker image for ArchiveBox

  • ArchiveBox/readability-extractor

    Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

    Language:JavaScript403214
  • ArchiveBox/pocket-exporter

    [FREE] A service to help export your pocket bookmarks, tags, saved article text, and more...

    Language:TypeScript310110
  • ArchiveBox/archivebox-proxy

    Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

    Language:Python30001
  • ArchiveBox/homebrew-archivebox

    Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

    Language:Ruby28103
  • pragmar/mcp-server-webcrawl

    MCP server tailored to connecting web crawler data and archives

    Language:HTML26216
  • dbeley/reddit_export_userdata

    Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.

    Language:Python22301
  • ArchiveBox/abx-spec-behaviors

    🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

    Language:JavaScript19130
  • DigestBox

    ArchiveBox/DigestBox

    DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

    Language:HTML19210
  • YunoHost-Apps/archivebox_ynh

    Self-hosted internet archiving solution to collect, save, and view sites you want to preserve offline, for YunoHost.

    Language:Shell192146
  • muna

    uriel1998/muna

    Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from ArchiveBox.

    Language:Shell18200
  • ArchiveBox/debian-archivebox

    Home of the official apt/deb package for Ubuntu/Debian-based systems.

    Language:Python17225
  • ArchiveBox/docs

    Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

    Language:CSS17127
  • dbeley/archiveboxmatic

    ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.

    Language:Python14303
  • ArchiveBox/pip-archivebox

    Official Python package for ArchiveBox, the self-hosted internet archiving solution.

  • Gertje823/ArchiveboxTelegramBot

    A simple Telegram bot to archive urls in Archivebox

    Language:Python12103
  • gjedeer/archivebox-index-generator

    View your ArchiveBox index without the bloat, just a web browser

    Language:Python621
  • ArchiveBox/community

    A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

  • thomaspaulin/archive-box-bridge

    Call me via reverse proxy to bridge Archive Box and the outside world.

    Language:Go5100
  • sij-ai/citis

    A Django-based SaaS for creating permanent, citable archives of web content. Perfect for researchers, journalists, and anyone who needs reliable citations that won't break. Mirrored from https://sij.ai/sij/citis

    Language:Python30
  • ta-dow/archivebox

    Re-design mockup for ArchiveBox

    Language:HTML3200
  • TheLovinator1/FeedVault.se

    FeedVault is an open-source web application that allows users to archive and search their favorite web feeds.

    Language:Python2050
  • abdurraheemali/HTU_to_ArchiveBox

    Colab script to set up archivebox and initialize index from history trends url

    Language:Jupyter Notebook0100
  • brunocek/archivebox-proxy

    A proxy that saves navigated URLs on ArchiveBox implemented as a script to mitmproxy.

  • SexyWerewolf/schedule_archivebox_links

    Auto Snapshot Schedule For AcrchiveBox

    Language:Shell0100
  • dotWee/docker-archivebox

    Home of the official docker image for ArchiveBox

    Language:Dockerfile10
  • harmonify/shiori-watcher

    Web Archiver meets Bookmark Manager: Watch the magic unfold as ArchiveBox and Go-Shiori join forces! Save, organize, and cherish your bookmarks effortlessly with this dynamic duo. 📚✨

    Language:Python10