/pdftojson

using XPDF, pdftojson extracts text from PDF files as JSON, including word bounding boxes.

Primary LanguageC++GNU General Public License v2.0GPL-2.0

Stargazers