Simple project written as a Proof of Concept for the following requirements:
- Accepts a PDF, large or small
- Generates a word index for the given PDF (like the one you find at the back of books)
- Generates a new PDF with that index, nicely formatted
- Returns the ouptut for the user to be able to download right away
For example, for the given input in docs/how_to_choose_your_people.pdf, it generates the output docs/output.pdf.
Requirements: pyenv
installed, and Python version in .python-version
installed with pyenv.
make setup
make install
. venv/bin/activate
HOST='127.0.0.1' PORT='8080' PDF_WORD_INDEX_PASSWORD='asdf' python app.py
- HOST
- PORT
- PDF_WORD_INDEX_PASSWORD