bzantium

Machine Learning Engineer

KakaobrainPangyo

bzantium's Stars

pallets/flask
The Python micro framework for building web applications.
Language:Python68.4k 2.1k 2.7k16.3k
openai/openai-cookbook
Examples and guides for using the OpenAI API
Language:MDX60.9k 895 4889.7k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook40.4k 418 694.3k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.7k 2.6k 01.7k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.8k 252 1412.8k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python19.7k 134 1.2k1.4k
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
Language:Python15.7k 142 2.2k2.5k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.8k 197 1.6k1.7k
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python8.1k 50 1.1k591
mosaicml/composer
Supercharge Your Model Training
Language:Python5.2k 49 552428
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python4.8k 34 134266
mosaicml/llm-foundry
LLM training code for Databricks foundation models
Language:Python4.1k 48 385534
openai/transformer-debugger
Language:Python4.1k 25 14241
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.4k 28 369316
meta-llama/llama-agentic-system
Agentic components of the Llama Stack APIs
Language:Python3.2k 38 35308
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language:Python2.4k 43 88260
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 47 137158
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language:Jupyter Notebook2k 15 557289
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2k 33 366336
apple/axlearn
An Extensible Deep Learning Library
Language:Python1.9k 63 23277
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Language:Python1.4k 23 130106
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.4k 43 93135
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Language:Python902 30 172109
google/cld3
Language:C++804 35 63112
stanford-crfm/levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Language:Python526 14 25987
huggingface/cosmopedia
Language:Python477 12 1245
p-lambda/dsir
DSIR large-scale data selection framework for language model training
Language:Python239 21 819
lucastononro/llm-food-delivery
Making the food-delivery experience easy for busy folks :)
Language:Python207 0 651
NetEase-FuXi/EETQ
Easy and Efficient Quantization for Transformers
Language:C++187 6 2614
google/maxdiffusion
Language:Python145 12 614

bzantium

bzantium's Stars

pallets/flask

openai/openai-cookbook

mlabonne/llm-course

karpathy/LLM101n

karpathy/llm.c

unslothai/unsloth

UKPLab/sentence-transformers

triton-lang/triton

FlagOpen/FlagEmbedding

mosaicml/composer

microsoft/LLMLingua

mosaicml/llm-foundry

openai/transformer-debugger

OpenRLHF/OpenRLHF

meta-llama/llama-agentic-system

young-geng/EasyLM

huggingface/datatrove

embeddings-benchmark/mteb

NVIDIA/TransformerEngine

apple/axlearn

McGill-NLP/llm2vec

huggingface/nanotron

huggingface/lighteval

google/cld3

stanford-crfm/levanter

huggingface/cosmopedia

p-lambda/dsir

lucastononro/llm-food-delivery

NetEase-FuXi/EETQ

google/maxdiffusion