bmharper/jstpdfextract

Random stuff to extract data out of scanned PDFs

Python

Instructions

Extract the PDF documents into a sub-folder called "pdf"
Install a recent Anaconda release
Open an Anaconda terminal
pip install -r requirements.txt
python 1-pdf-to-text.py
python 2-text-to-csv.py