web-archive
There are 39 repositories under web-archive topic.
dosyago/dn
💾 dn - offline full-text search and archiving for your Chromium-based browser.
webrecorder/replayweb.page
Serverless replay of web archives directly in the browser
Ray-D-Song/web-archive
Free web archiving and sharing service based on Cloudflare. 基于 Cloudflare 的免费网页归档和分享工具。
webrecorder/browsertrix
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
devanshbatham/ArchiveFuzz
Hunt down the secrets from the WebArchives for Fun and Profit
internetarchive/cdx-summary
Summarize web archive capture index (CDX) files.
Own-Data-Privateer/hoardy-web
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
TarekJor/bookmark-archiver
🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...
webis-de/archive-query-log
📜 The Archive Query Log.
ShaunLWM/ark
🚢 A self-hosted, personal archival application
antiufo/Shaman.Dokan.Warc
Mounts WARC files on Windows
YGGverse/YGGo
YGGo! Distributed Web Search Engine
oduwsdl/MementoMap
A Tool to Summarize Web Archive Holdings
ghobs91/Chronicl
Decentralized web archiver that distributes archives across Nostr relays
swve/gitstorykit
Build rich git projects history discovery apps with ease, used by Gitstory
bottomless-archive-project/java-warc
Read Web ARChive (WARC) files in Java.
minch-dev/DownTheMoon
A continuation of legacy XUL version of DownThemAll! ✔️preserves web.archive.org timestamps, ✔️advanced filters for remote directory tree mirroring, ✔️UI is tweaked for better UX
ysdn-info/ysdn.info
An archive of the York/Sheridan Design Program
ArtificialOSS/WebCrawl
Crawls the web to generate a huge dataset for training
ibnesayeed/utils
Miscellaneous utility scripts
india-ultimate/the-huddle
A mirror of The Huddle magazine
laxika/java-warc
Read Web ARChive (WARC) files in Java.
grey-land/warc-browser
a cli toolkit for working with web archives
q-m/replayweb.page-docker
Docker image for ReplayWeb.page
thiagolopes/alexandria
Backup and save websites
wdhdev/web-archiver
Easily scrape, download and preview websites.
AndreMor8/wubbzy-sites
Wubbzy archived sites
paulmelnikow/wabac
A versioned cache backed by cloud storage
shadowctrl/Palaceradio
PalaceRadio | A Next.js app Built from web Archive | Freelance Project @upwork
wayback-if-down/wayback-if-down.github.io
Redirect to a live website or an archived version if it's down.
jskherman/web-clips
An archive site of some webpages on the Internet created with the help of the SingleFile extension.
meadowingc/waybacker
Periodically crawl a set of websites and ensure that all of their pages are archived on the Wayback Machine. Mirror of https://codeberg.org/meadowingc/waybacker
s5-dev/archiver
Tool to archive websites and other content available on the Internet on the content-addressed S5 Network
jskherman/SingleFile-Archives
Pages saved with the SingleFile browser extension.
KaineRecycler/YouTube-Content-Archive
YouTube Content Archive Database
shadowctrl/Farsky
Farsky | A Next.js app Built from web Archive | Freelance Project @upwork