mdSHash/pdf-to-excel
This Python script converts text from PDF files into an Excel spreadsheet format. It utilizes the pdf2image library to extract images from PDF pages and the pytesseract library for Optical Character Recognition (OCR) to recognize text from the images.
PythonMIT
Stargazers
No one’s star this repository yet.