/GenaiPdfReader

Use FAISS to accept PDF files as input and transform it into a vector database

Primary LanguagePython

GenaiPDFReader

The PDF Reader project shows how to transform data from PDFs into embeddings and store them in a vector database. It accepts a query as user input and uses FAISS (Facebook AI Similarity Search) to search for the relevant information from the database.

With the retrieved information as context, it prompts the Gemini Pro model to generate a response.

Application with Gemini, Python and PDF inputs

References

To run the file, without making any changes to the environment path file, use : python -m streamlit run app.py

How to Work With a PDF in Python

Working with PDF files in Python

LangChain's RecursiveCharacterTextSplitter

LangChain Docs : Text Splitters

AttributeError: module 'langchain_community.faiss'

Visualise pickle files in Visual Studio Code

How to run Python streamlit applications

De-serialization relies loading a pickle file