This code extracts text from a PDF file using OCR, cleans it, and writes it to an Excel spreadsheet. It uses fitz, io, ocrmypdf, and pandas libraries to achieve this task.
Primary LanguagePython
No issues in this repository yet.