wlxiong/poly-ir-toolkit

Document map needs to store document numbers for WARC files (used by ClueWeb).

Opened this issue · 0 comments

Or, properly skip decoding them.

Original issue reported on code.google.com by Roman.Kh...@gmail.com on 6 Feb 2011 at 4:24