
this pdf reader first requires another program to convert a PDF file to text first https://github.com/pdfminer/pdfminer.six

then reader.py reads the output. you can keep track of where you are by taking note of the sentence number that prints in the console.


figure out how to keep track of it by pdf page numbers