- Converts PDF to Excel
- Compatible with python 3
- Super Small in size (the .py files only)
It works in 3 steps
Step 1 - Find Table in PDF and Convert to Dataframe
Step 2 - Clean dataframe as per pipe and filter
Step 3 - Conversion Formatting of Dataframe to Excel
-
You will need python 3 installed, latest version will do.
-
Clone or download the files in this repo and extract them.
-
In cmd/terminal,
pip install -r requirments.txt
-
Done!
-
In cmd/terminal/Power shell enter
python pdftoexcel.py -i filetoconvert.pdf
-
DONE!
PS - type python pdftoexcel.py -h
to see all available arguements in command line.