A set of instructional materials, code samples and Python scripts featuring LLMs (GPT etc) through interfaces like llamaindex, langchain, Chroma (Chromadb), Pinecone etc. Mainly used to store reference code for my LangChain tutorials on YouTube.
Learn LangChain from my YouTube channel:
- Part 1: LangChain + OpenAI tutorial: Building a Q&A system w/ own text data
- Part 2: LangChain + OpenAI to chat w/ (query) own Database / CSV
- Part 3: LangChain + HuggingFace's Inference API (no OpenAI credits required!)
- Part 4: Understanding Embeddings in LLMs
- Part 5: Query any website with LLamaIndex + GPT3 (ft. Chromadb, Trafilatura)
- Part 6: Locally-hosted, offline LLM w/LlamaIndex + OPT (open source, instruction-tuning LLM)
- Clone this repo
- Install requirements:
pip install -r requirements.txt
- Some sample data are provided to you in the
news
foldeer, but you can use your own data by replacing the content (or adding to it) with your own text files. - Create a
.env
file which contains your OpenAI API key. You can get one from here.HUGGINGFACEHUB_API_TOKEN
andPINECONE_API_KEY
are optional, but they are used in some of the lessons.
The .env
file should look like this:
OPENAI_API_KEY=your_api_key_here
# optionals (not required for most of the series)
HUGGINGFACEHUB_API_TOKEN=your_api_token_here
PINECONE_API_KEY=your_api_key_here
HuggingFace and Pinecone are optional but is recommended if you want to use the Inference API and explore those models outside of the OpenAI ecosystem. This is demonstrated in Part 3 of the tutorial series.
5. Run the examples in any order you want. For example, python 6_team.py
will run the website Q&A example, which uses GPT-3 to answer questions about a company and the team of people working at Supertype.ai. Watch the corresponding video to follow along each of the examples.