Here are some details that are useful:
- The data is unstructured pdf.
- images.py fetches images from pdf.
- notes.py fetches notes from pdf
- data.json is the final output in which data is shown
- section.py is used to fetch section,figure and tables.
To run the app you need to download this repository along with the required libraries and in the command line you have to write
python file.py
to run.
Personal Finance
│
|---- images
|--------allimages
|
|
|
|
|---- data.json
|---- images.py
|---- notes.py
|---- pyapi.py
|---- section.py
|---- markdown.py
|---- Procfile
|---- README.md
|---- pdfs
|---- setup.sh
- python library - PYPDF2,AZURE
- version control - git
- Cloud Technologies used- Azure form RECOGNISER
- IDE - Vs code
- Code Repository - GitHub