/jstpdfextract

Random stuff to extract data out of scanned PDFs

Primary LanguagePython

Instructions

  1. Extract the PDF documents into a sub-folder called "pdf"
  2. Install a recent Anaconda release
  3. Open an Anaconda terminal
  4. pip install -r requirements.txt
  5. python 1-pdf-to-text.py
  6. python 2-text-to-csv.py