allenai/mmda

Failed in parsing page without chars

Opened this issue · 0 comments

If some page has no chars(like some books cover), the script would throw exception in parsing and the following process, due to array index out of bound, as the doc.pages will not include page with empty chars.