Issues
- 0
Apache httpclient 3.1 sonatype
#95 opened by DEBARYYA - 1
Consider syncing up from the Common Crawl fork
#94 opened by anjackson - 0
- 0
- 0
Update TravisCI config
#82 opened by ruebot - 3
- 6
Fixing bad dates in WARC file
#80 opened by cjer - 1
- 1
commons-httpclient-3.1 vulnerability
#78 opened by ldko - 0
upgrade to commons-collections.jar 3.2.2
#76 opened by ndushay - 1
- 3
- 3
support WET files
#66 opened by dportabella - 1
HTTPS via a Proxy
#64 opened by PsypherPunk - 0
- 0
Non-ascii mimetypes
#60 opened - 0
dns records in ARCs
#59 opened - 1
urls with spaces unescaped
#58 opened - 2
Reorganize into mother and child pom
#55 opened by johnerikhalse - 0
Require Java 8
#56 opened by johnerikhalse - 2
- 0
WAT extractor: WARC-Filename in the WAT warcinfo record should be the WAT filename itself
#42 opened by saraaubry - 0
WAT extractor: WARC-Date in all records should be the WAT record generation date
#43 opened by saraaubry - 1
- 0
WAT extractor: missing WARC format version
#45 opened by saraaubry - 0
- 0
WAT extractor: Entity-Trailing-Slop-Bytes should be called Entity-Trailing-Slop-Length
#48 opened by ldko - 8
Minor issues with the POM
#3 opened by anjackson - 4
Error-prone HTTP-header parsing in ARCRecord
#41 opened by tokee - 3
- 0
- 3
Ensure uncompressed ARCs are properly supported
#13 opened by anjackson - 2
Use of GPL v2 code
#20 opened by kris-sigur - 6
- 6
- 0
Require Java 7
#28 opened by johnerikhalse - 1
Tests fail on Windows
#2 opened by anjackson - 0
Remove the "jar-with-dependencies" artifact
#29 opened by johnerikhalse - 3
fastutil conflicts in dependencies
#23 opened by lintool - 1
- 0
- 4
Cope with slightly malformed GZip headers
#9 opened by anjackson