bzantium's Stars
pallets/flask
The Python micro framework for building web applications.
openai/openai-cookbook
Examples and guides for using the OpenAI API
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
karpathy/llm.c
LLM training in simple, raw C/CUDA
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
triton-lang/triton
Development repository for the Triton language and compiler
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
mosaicml/composer
Supercharge Your Model Training
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
mosaicml/llm-foundry
LLM training code for Databricks foundation models
openai/transformer-debugger
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
meta-llama/llama-agentic-system
Agentic components of the Llama Stack APIs
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
apple/axlearn
An Extensible Deep Learning Library
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
google/cld3
stanford-crfm/levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
huggingface/cosmopedia
p-lambda/dsir
DSIR large-scale data selection framework for language model training
lucastononro/llm-food-delivery
Making the food-delivery experience easy for busy folks :)
NetEase-FuXi/EETQ
Easy and Efficient Quantization for Transformers
google/maxdiffusion