/PDFToExcel

This is the companion repository of the article https://tomassetti.me/how-to-convert-a-pdf-to-excel/

Primary LanguagePythonApache License 2.0Apache-2.0

How to Convert a PDF to Excel

This is the companion repository of the article How to Convert a PDF to Excel. The article explains the whole process of detecting tables from PDF files to extract them into Excel files. It discusses the differences between textual PDFs and PDFs containing images and the results with both of them.

This repository also contains some example PDFs used in the article.