documentcloud/docsplit

Horizontal / table formatted text

Opened this issue · 0 comments

nofxx commented

Got some tables inside pdf I really needed to parse (or 100 hours of monkey job)
It's impossible without passing -layout option to the pdf parser.
This patch introduces the 'pdf_opts' param, and works as expected: #114

Just found this one too: #132