Pinned Repositories
ArchiveBot
ArchiveBot, an IRC bot for archiving websites
grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
IA.BAK
We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.
NewsGrabber
Grabbing all news.
parler-grab
Archiving Parler.
seesaw-kit
Making a reusable toolkit for writing seesaw scripts
terroroftinytown
URLTeam's second generation of URL shortener archiving tools
Ubuntu-Warrior
Scripts to build and boot warrior virtual machine containing Docker
warrior-dockerfile
A Dockerfile for the ArchiveTeam Warrior
wpull
Wget-compatible web downloader and crawler.
Archive Team's Repositories
ArchiveTeam/grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
ArchiveTeam/wpull
Wget-compatible web downloader and crawler.
ArchiveTeam/ArchiveBot
ArchiveBot, an IRC bot for archiving websites
ArchiveTeam/warrior-dockerfile
A Dockerfile for the ArchiveTeam Warrior
ArchiveTeam/wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
ArchiveTeam/imgur-grab
Archiving imgur.
ArchiveTeam/reddit-grab
Grabbing everything from reddit.
ArchiveTeam/terroroftinytown-client-grab
The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project
ArchiveTeam/ludios_wpull
wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved
ArchiveTeam/youtube-grab
Archiving all metadata from YouTube (everything except videos themselves due to size)
ArchiveTeam/urls-grab
Archiving URLs (outlinks) from a variety of sources.
ArchiveTeam/telegram-grab
Archiving public telegram messages.
ArchiveTeam/warrior4-vm
Warrior virtual machine appliance (version 4)
ArchiveTeam/pastebin-grab
Archiving pastebin
ArchiveTeam/subscene-grab
Archiving Subscene.
ArchiveTeam/mediafire-grab
Archiving mediafire.com URLs.
ArchiveTeam/grab-base-df
Base Dockerfile for warrior project grab scripts
ArchiveTeam/urls-sources
Sources for urls-grab.
ArchiveTeam/roblox-marketplace-comments-grab
Archiving comments from the Roblox Marketplace
ArchiveTeam/subscene-items
Managing items for subscene-grab.
ArchiveTeam/urls-tor-grab
Archiving some .onion URLs.
ArchiveTeam/deviantart-grab
Archiving part of DeviantArt.
ArchiveTeam/deviantart-items
Managing items for deviantart-grab.
ArchiveTeam/postnews-grab
Archiving post.news.
ArchiveTeam/postnews-items
Managing items for postnews-grab.
ArchiveTeam/roblox-marketplace-comments-items
Managing items for roblox-marketplace-comments-grab.
ArchiveTeam/taringa-grab
Archiving taringa.net.
ArchiveTeam/taringa-items
Managing items for taringa-grab.
ArchiveTeam/vbox7-grab
Archiving vbox7.
ArchiveTeam/vbox7-items
Managing items for vbox7-grab.