Download/Scraping utilities

  • Rclone: A command line program to sync files and directories to and from various cloud storage providers
  • Youtube-DL: A command-line program to download videos from YouTube and a few hundred more sites
  • annie: Youtube-DL alternative writtent in Golang
  • wikiteam: set of tools for archiving wikis
  • FicSave: online fanfiction downloader
  • yt-mango: Youtube metadata archiver
  • Youtube-MA: Youtube metadata archiver
  • CrowLeer: Powerful C++ web crawler based on libcurl
  • floatplane_ripper: Script to rip all videos from https://floatplane.rip/
  • grab-site: The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
  • dzi-dl: Deep Zoom Image Downloader
  • iiif-dl: Command-line tile downloader/assembler for IIIF endpoints/manifests
  • ChanThreadWatch: Saves threads from *chan-style boards and checks for updates until the thread dies
  • Sonarr: PVR for Usenet and BitTorrent users
  • Radarr: A fork of Sonarr to work with movies à la Couchpotato
  • Sick-Beard: PVR for newsgroup users (with limited torrent support)
  • Lidarr: Music collection manager for Usenet and BitTorrent users
  • Mylar: An automated Comic Book downloader (cbr/cbz) for use with SABnzbd, NZBGet and torrents

Compression

  • KGB Archiver: compression tool with unbelievable high compression rate
  • peazip: File archiver utility

Network

  • NetLimiter: Internet traffic control and monitoring tool for Windows

File systems

File conversion

  • AAXtoMP3: convert AAX files to common MP3, M4A, M4B, flac and ogg formats through a basic bash script frontend to FFMPEG

Utility Scripts

Content sharing

  • opds: Easy to use, Open & Decentralized Content Distribution
  • ipfs: Protocol and network designed to create a content-addressable, peer-to-peer method of storing and sharing hypermedia in a distributed file system
  • h5ai: HTTP web server index for Apache httpd, lighttpd, nginx and Cherokee

Data curation

  • DeepSort: AI powered image tagger backed by DeepDetect
  • diskover: File system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
  • fucking-weeb: A library manager for animu (and TV shows, and whatever).
  • Everything: Locate files and folders by name instantly (Windows)
  • beets: music library manager and MusicBrainz tagger
  • Calibre: Ebook manager

APIs & Online tools

  • thetvdb: TV shows metadata (used by plex)
  • iqdb: Multi-service reverse image search