huridocs/pdf-table-of-contents-extractor

This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of identifying and structuring the document's TOC.

PythonApache-2.0

Stargazers

ali6parmak
alphamgmt
BobLd
London
jallspaw
Adaptive Capacity Labs, LLC
rnjailamba
Mexico City