/Llama2RAG

A working example of RAG using LLama 2 70b and Llama Index

Primary LanguageJupyter Notebook

Building LLama Banker

Doing RAG for Finance using LLama2. Highly recommend you run this in a GPU accelerated environment. I used a A100-80GB GPU on Runpod for the video!

See it live and in action 📺

Tutorial

Startup 🚀

  1. Clone this repo git clone https://github.com/nicknochnack/Llama2RAG
  2. Go into the directory cd Llama2RAG
  3. Startup jupyter by running jupyter lab in a terminal or command prompt
  4. Update the auth_token variable in the notebook.
  5. Hit Ctrl + Enter to run through the notebook!
  6. Go back to my YouTube channel and like and subscribe 😉...no seriously...please! lol
  7. If you want to start up the streamlit app run streamlit run app.py (make sure you update your auth token in there as well!)

Other References 🔗

-Llama 2 70b Chat Model Card:hugging face model card on the model used for the video.

-Llama Index Doco:sick library used for RAG.

Who, When, Why?

👨🏾‍💻 Author: Nick Renotte
📅 Version: 1.x
📜 License: This project is licensed under the MIT license. Feel free to use it, just don't do bad things with it.