Issues
- 2
ClueWeb09 WARC files faile to parse
#26 opened by sebastian-nagel - 3
- 3
- 1
UncheckedIOException, unexpected end of gzip
#85 opened by gleporeNARA - 1
IllegalArgumentException on ARC Parsing
#83 opened by gleporeNARA - 7
- 1
Custom records awkward to register due to package private constructors in the default records
#81 opened by vlofgren - 3
- 1
Multithreading issue on GzipChannel write header
#79 opened by creyer - 2
- 1
- 0
CDX indexer: support revisit records
#71 opened by ato - 1
- 0
CDX indexer: CDXJ output support
#72 opened by ato - 3
disable serviceworker in replay proxy mode
#69 opened by sberequek - 1
- 1
- 1
Raw header access
#42 opened by ato - 2
- 1
ARC parser infinite loop reading body
#62 opened by sebastian-nagel - 3
Native OSX / Linux binaries do not work
#61 opened by ikreymer - 1
wget quirk: Content-Length off by one
#29 opened by ato - 1
RecordBuilder: Date/Timestamp truncated if .date(..) is called before .version(WARC_1_1)
#58 opened by lambdaupb - 1
- 4
- 0
GunzipChannel fails on payload with uncompressed size exceeding int_max
#54 opened by sebastian-nagel - 1
Gzip compression
#53 opened by alex73 - 3
- 1
Utility methods to read payload body
#48 opened by sebastian-nagel - 1
- 0
ByteBuffer inflate and deflate support
#45 opened by ato - 1
- 0
- 0
- 2
Add lenient HttpParser
#25 opened by sebastian-nagel - 0
- 0
Chunked body parser may read over end of chunk if destination buffer has higher capacity
#34 opened by sebastian-nagel - 0
GunzipChannel input position is off by 2 if gzip extra field is present
#31 opened by sebastian-nagel - 0
WARC 1.0 quirk: angle brackets around WARC-Target-URI
#30 opened by ato - 4
- 3
- 0
- 0
- 0
IoException reading gzip extra
#14 opened by sebastian-nagel - 4
Should we include a Dockerfile?
#10 opened by ibnesayeed - 0
- 1
Add build instructions
#13 opened by machawk1 - 3
Rudimentary Memento support on replay
#11 opened by machawk1 - 0
Typo in url_byte in http and warc grammars
#9 opened by ato - 0
Run an external command on each record
#8 opened by ato