/webarchive-indexing

Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.

Primary LanguagePythonMIT LicenseMIT

Watchers