/IndexPDF

IndexPDF is a CLE script used to index unique words and their appearance in specific pages of a pdf document. Seeing that there was no similar python script that did not use PyPDF2 extensively, I chose to write this after discovering that Adobe forces its users to pay some gargantuant sum for this simple functionality.

Primary LanguagePythonMIT LicenseMIT

IndexerRetro

Is a small script that will index pdf files, after having been forced to do this, when realizing that the price that Adobe demands for the same service is 300$ per year.

To run:

python indexerRetro.py --parent [Abs path to parent dir of pdf] --file [pdf name] --output [file name]

It will then produce a txt file which contains all of the unique words present in the pdf document, complemented by the page that the words appear.

  • Adapted to Windows
  • Adapted to everything else
2018 Ali Rassolie