/yet_another_llm_repo

Different resources on Large Language Model (LLM) which I am using or planning to use.

Apache License 2.0Apache-2.0

Yet Another LLM Repository

I am a Machine Learning Engineer/Software Developer exploring LLM for last few months. Here are the different resources on Large Language Model (LLM) which I am using or planning to refer to in future. This list doesn't include academic papers or theoretical courses. It mostly focuses on resources (code, tools, blogs, short videos) which will help an engineer to build applications around LLM. Please note, I am NOT doing AI; I am just building a layer around AI.

Not sure if you should explore LLM? Start with this highly motivational post :The Rise of the AI Engineer

What is LLM/GPT?

Getting Started

If you are already familiar with ChatGPT and want to get started with LLM (to build an application around it or to understand its capabilities), you might want to try OpenAI Playground and APIs. On signing up at OpenAI, you will be allocated with certain amount of free quota (to use the service) which is more than sufficient to build your first prototype. Save your API Key.

If you are located in US/UK, Claude-2 by Anthropic is another option. As of Aug'2023, I have not used Claude, since it's not available in India.

Third option is to use Azure OpenAI. This is only available for Azure Enterprise Customers and Most Valuable Bloggers at this moment. You can request for it here.

Online Courses

Prompt Engineering Resources

My suggestion would be not to spend lot of time going through different prompt engineering courses and guides. "GPT best practices by OpenAI" is more than enough. Browse through other resources on the need basis.

Tools

There are a bunch of Open Source libraries, frameworks that have been developed around LLM. Here are few which I have used:

What are different LLMs available and how ?

GPT's by OpenAI are not the only LLMs available. There are other commercial and Open Source models which you might be interested in. Following are the leaderboards to compare the performance of various LLMs:

Use Cases for LLM

langchain documentation captures most of the popular use cases: link

Use Case : Retrieval-Augmented Generation / Document QA

Probably the most popular application of LLM: Build a ChatBot on your private data. But remember, semantic based search doesn't always work

Blogs:

Courses

Code Examples

Tools/Frameworks

Case Study

Use Case : Agents

Blog

Courses

Code Examples

Automatic Speech Recognition

Evaluation of LLMs/LLM based systems

The way we evaluate a LLM and a system built using LLM are different. Here I am going to focus mostly on evaluation methodologies used for LLM based systems

Blogs/Courses

Tools

  • Azure Promptflow: Service to evaluate and track LLM Experiments
  • openai-evals: A framework for evaluating LLMs (large language models) or systems built using LLMs as components
  • langchain-evals: Evaluation using LangChain as a framework. Make sure to check the examples. Also read the source code for the example prompts
  • lang-kit - Library
  • langsmith

LLM In Production

Azure OpenAI

Interesting Links

Transformer Explained