Dependencies

  • pdf2image (with poppler)
  • pytesseract
  • cv2
  • tkinter
  • xlswriter

Instructions

  • Clone the repo on your system
  • Install dependencies above
  • In the constatnts.py file change the tesseract_path= "" to the path on your computer where (.../Tesseract-OCR/tesseract.exe) is located
  • Place the pdf/jpeg files you want to perform in the examples folder
  • In the same constants.py file write the name of file you just placed in example folder
  • Run main.py
  • You'll find all your results in Details folder with a sub-directory of your filename for each input