/pdf-to-excel

This Python script converts text from PDF files into an Excel spreadsheet format. It utilizes the pdf2image library to extract images from PDF pages and the pytesseract library for Optical Character Recognition (OCR) to recognize text from the images.

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.