/baby-llama

Simple Llama architecture LLM in pytorch

Primary LanguageJupyter Notebook

Baby Llama: Mini-LLM using Llama architecture

Paper: Llama 2: Open Foundation and Fine-Tuned Chat Models

This is a small-scale implementation of the Llama-2 language models built by Meta. The architecture explained in the Llama 2 referenced paper is the foundation on which this work has been built.