bubibubi/ExtractTablesFromPdf

Implementation using PdfPig

BobLd opened this issue · 0 comments

BobLd commented

Hi,

Not an issue but an idea: PdfPig is a C# library (port of PdfBox, Apache-2.0) where several document analysis tools were added (see the wiki: word extraction, line and text block extraction, reading order, etc.), handle graphic state and fonts but is still missing a table extraction tool. Do you think you could implement it into the library? We can chat on the gitter if you're interested.

Example of current tools:
rxyc example

Let me know what you think, happy to help