Pinned Repositories
emscripten
Emscripten: An LLVM-to-JavaScript Compiler
friendster-group-id-lists
These lists contain the group ids of Friendster groups with at least 100 members.
friendster-scrape
Friendster archiving
heroku-buildpack-phantomjs-wget
Heroku buildpack with phantomjs and a recent wget
megawarc
Nondestructive warc-in-tar to warc conversion
warc-proxy
Serving content from a WARC
warctozip
Convert a warc to a zip with Hanzo warc-tools and warctozip.py
warctozip-service
An HTTP-based warc-to-zip converter
wget-lua
Wget with Lua extension
wget-warc
This is an old version of the WARC patches. Wget v1.14 and higher has WARC support.
alard's Repositories
alard/warc-proxy
Serving content from a WARC
alard/megawarc
Nondestructive warc-in-tar to warc conversion
alard/wget-lua
Wget with Lua extension
alard/warctozip-service
An HTTP-based warc-to-zip converter
alard/warctozip
Convert a warc to a zip with Hanzo warc-tools and warctozip.py
alard/wget-warc
This is an old version of the WARC patches. Wget v1.14 and higher has WARC support.
alard/friendster-group-id-lists
These lists contain the group ids of Friendster groups with at least 100 members.
alard/friendster-scrape
Friendster archiving
alard/heroku-buildpack-phantomjs-wget
Heroku buildpack with phantomjs and a recent wget
alard/emscripten
Emscripten: An LLVM-to-JavaScript Compiler
alard/friendster-graph
Scraping the Friendster social graph
alard/friendster-scrape-queue-util
Some helper scripts to run a number of bff.sh downloaders from a single queue
alard/heritrix-utils
Useful classes for the Heritrix crawler
alard/CDX-Writer
Python script to create CDX index files of WARC data
alard/seesaw-kit
Making a reusable toolkit for writing seesaw scripts
alard/tinyback
A tiny web scraper
alard/yt-dlp
A youtube-dl fork with additional features and fixes