This is a Langchain project that enables you to interact with any PDFs via the Gradio chat interface. It not only fetches relevant answers but also renders relevant pages of the PDFs.
- Langchain
- ChromaDB as vector store
- OpenAI embeddings
- OpenAI chat model (gpt-3.5-turbo)
- Gradio
- Build a chatbot interface using Gradio
- Extract texts from pdfs and create embeddings
- Store embeddings in the Chroma vector database
- Send query to the backend (Langchain chain)
- Perform a semantic search over texts to find relevant sources of data
- Send data to LLM (ChatGPT) and receive answers on the chatbot
https://www.analyticsvidhya.com/blog/2023/05/build-a-chatgpt-for-pdfs-with-langchain/