/docsum-chat

Primary LanguagePythonApache License 2.0Apache-2.0

PDF Intelligence System with Retrieval Augmented Generation (RAG)

Overview

The goal of this project is to create a user-centric and intelligent system that enhances information retrieval from PDF documents through natural language queries. The project focuses on streamlining the user experience by developing an intuitive interface, allowing users to interact with PDF content using language they are comfortable with. To achieve this, we leverage the Retrieval Augmented Generation (RAG) methodology introduced by Meta AI researchers.

trial.mp4

Retrieval Augmented Generation (RAG)

Introduction

RAG is a method designed to address knowledge-intensive tasks, particularly in information retrieval. It combines an information retrieval component with a text generator model to achieve adaptive and efficient knowledge processing. Unlike traditional methods that require retraining the entire model for knowledge updates, RAG allows for fine-tuning and modification of internal knowledge without extensive retraining.

Workflow

  1. Input: RAG takes multiple pdf as input.
  2. VectoreStore: The pdf's are then converted to vectorstore using FAISS and all-MiniLM-L6-v2 Embeddings model from Hugging Face.
  3. Memory: Conversation buffer memory is used to maintain a track of previous conversation which are fed to the llm model along with the user query.
  4. Text Generation with GPT-3.5 Turbo: The embedded input is fed to the GPT-3.5 Turbo model from the OpenAI API, which produces the final output.
  5. User Interface: Streamlit is used to create the interface for the application.

Benefits

  • Adaptability: RAG adapts to situations where facts may evolve over time, making it suitable for dynamic knowledge domains.
  • Efficiency: By combining retrieval and generation, RAG provides access to the latest information without the need for extensive model retraining.
  • Reliability: The methodology ensures reliable outputs by leveraging both retrieval-based and generative approaches.

Project Features

  1. User-friendly Interface: An intuitive interface designed to accommodate natural language queries, simplifying the interaction with PDF documents.

  2. Seamless Navigation: The system streamlines information retrieval, reducing complexity and enhancing the overall user experience.

Getting Started

To use the PDF Intelligence System:

  1. Clone the repository to your local machine.

    git clone https://github.com/ArmaanSeth/ChatPDF.git
  2. Install dependencies.

    pip install -r requirements.txt
  3. Run the application.

    streamlit run app.py
  4. Open your browser and navigate to http://localhost:8000 to access the user interface.

Contributing

We welcome contributions to enhance the PDF Intelligence System. If you're interested in contributing, please follow our Contribution Guidelines.

License

This project is licensed under the Apache License.

Acknowledgments

We would like to express our gratitude to the Hugging Face community for the all-MiniLM-L6-v2 Embeddings model, and OpenAI for providing the GPT-3.5 Turbo model through their API.


Feel free to explore and enhance the capabilities of the PDF Intelligence System. Happy querying!