RAG - Adding Private Data to LLMs

LLMs have stunned the world with their capacity to create realistic images, code, and dialogue. Undoubtedly, ChatGPT has taken the world by storm. Millions are using it. But while it's great for general-purpose knowledge, it only knows information it was trained on, which is pre-2021 generally available internet data. It lacks awareness of your private data and remains uninformed about recent data sources. Thus, to improve them in that regard, we can provide them with information that we retrieved from a search step. This makes them more factual and gives a better ability to provide the model with up-to-date information, without the need to retrain these massive models. This is precisely what a retrieval-augmented LLM or Retrieval-Augmented Generation (RAG) system is. Indeed, this repository will precisely outline the creation of an RAG system and elucidate the optimization steps involved.

RAG
Tech Stack
Installation
Useful Links
Contact

RAG

Tech Stack

LangChain
LlamaIndex
Azure OpenAI
Gradio

Installation

Clone the Github repository

git clone https://github.com/zekaouinoureddine/Adding-Private-Data-to-LLMs.git

Requirements Cd to the project directory and ensure that you have Python 3 installed, along with the necessary dependencies.
```
cd Adding-Private-Data-to-LLMs
pip install -r requirements.txt
```
Run the Gradio app
```
python rag.py
```

Visit http://127.0.0.1:7860 on your machine to test the app. You should see something like the following:

Useful Links

Blog	Plateform	Language
Ask Your Own Data	Hiberus Blog	ES
Ask Your Own Data	Medium	EN
Ask Your Web Pages	Hiberus Blog	ES
Ask Your Web Pages	Medium	EN

Contact

If you like it, give it a ⭐, then follow me on:

LinkedIn: Nour Eddine ZEKAOUI
Twitter: @NZekaoui

Back To The Top

zekaouinoureddine/Adding-Private-Data-to-LLMs

RAG - Adding Private Data to LLMs

Table of Contents

RAG

Tech Stack

Installation

Useful Links

Contact