Quick Start Guide to Large Language Models

Get your copy today and please leave a rating/review to tell me what you thought! ⭐⭐⭐⭐⭐

Quick Start Guide to Large Language Models

Welcome to the GitHub repository for the "Quick Start Guide to Large Language Models" book. This repository contains the code snippets and notebooks used in the book, demonstrating various applications of Transformer models.

Repository Structure

Directories

  • notebooks: This directory contains Jupyter notebooks for each chapter in the book.
  • data: Contains the datasets used in the notebooks.
  • images: Contains images and graphs used in the notebooks.

Notebooks

Here are some of the notebooks included in the notebooks directory:

Part I - Introduction to Large Language Models

  • 2_semantic_search.ipynb: An introduction to semantic search using OpenAI and open source models.
    • I have an updated version here with the updated OpenAI client usage plus the use of the latest V3 OpenAI Embedding. Spoiler alert, the open-source embedder + a fine-tuned cross encoder beat even the largest OpenAI embedder :)
  • 3_prompt_engineering.ipynb: A guide to effective prompt engineering for instruction aligned LLMs.

Part II - Getting the Most Out of LLMs

Part III - Advanced LLM Usage

We will continue to add more notebooks exploring topics like fine-tuning, advanced prompt engineering, combining transformers, and various use-cases. Stay tuned!

How to Use

To use this repository, clone it to your local machine, navigate to the notebooks directory, and open the Jupyter notebook of your choice. Note that some notebooks may require specific datasets, which can be found in the data directory.

Please ensure that you have the necessary libraries installed and that they are up to date. This can usually be done by running pip install -r requirements.txt in the terminal.

Contributing

Contributions are welcome! Feel free to submit a pull request if you have any additions, corrections, or enhancements to submit.

Disclaimer

This repository is for educational purposes and is meant to accompany the "Quick Start Guide to Large Language Models" book. Please refer to the book for in-depth explanations and discussions of the topics covered in the notebooks.

More From Sinan

  1. Check out Sinan's Newsletter AI Office Hours for more AI/LLM content!
  2. Sinan has a podcast called Practically Intelligent where he chats about the latest and greatest in AI!
  3. Follow the Getting Started with Data, LLMs and ChatGPT Playlist on O'Reilly for a curated list of Sinan's work!