Table descriptions above tables
Opened this issue · 2 comments
Evildoor commented
PDF Analyzer's table processing algorithm includes detection of table description and separation of table lines from all other lines. These procedures work on assumption that table description is positioned below the table:
However, some documents can position descriptions above tables or even mix both kinds of positioning. PDF Analyzer either fails to extract such tables or extracts them incorrectly.
Document examples: CDS_CERN-ATL-COM-PHYS-2016-135, page 13.
Evildoor commented
Note: it seems that term "caption" rather than "description" or "header" is often used.