PDFIntellect is a Streamlit app designed for smart PDF data retrieval. This app leverages Language Models (LLMs) to efficiently extract valuable information from PDF documents.
- Advanced PDF parsing.
- Integration with pre-trained Language Models.
- Customizable cascading LLMs.
- Intelligent short answer generation.
- Python 3.7 or higher.
- Streamlit, transformers, torch, and pdfplumber libraries.
- Access to pre-trained Language Models.
- PDF parsing libraries.
- PDF documents for extraction.
-
Clone the repository.
git clone https://github.com/jaywyawhare/PDFIntellect
-
Install the required libraries.
pip install -r requirements.txt
-
Run the Streamlit app.
streamlit run app.py
App will be available at http://localhost:8501.
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
This project is licensed under the Licence license.