Doing RAG for Finance using LLama2. Highly recommend you run this in a GPU accelerated environment. I used a A100-80GB GPU on Runpod for the video!
- Clone this repo
git clone https://github.com/nicknochnack/Llama2RAG
- Go into the directory
cd Llama2RAG
- Startup jupyter by running
jupyter lab
in a terminal or command prompt - Update the
auth_token
variable in the notebook. - Hit
Ctrl + Enter
to run through the notebook! - Go back to my YouTube channel and like and subscribe 😉...no seriously...please! lol
- If you want to start up the streamlit app run
streamlit run app.py
(make sure you update your auth token in there as well!)
-Llama 2 70b Chat Model Card:hugging face model card on the model used for the video.
-Llama Index Doco:sick library used for RAG.
👨🏾💻 Author: Nick Renotte
📅 Version: 1.x
📜 License: This project is licensed under the MIT license. Feel free to use it, just don't do bad things with it.