vwxyzjn's Stars
xai-org/grok-1
Grok open release
karpathy/llm.c
LLM training in simple, raw C/CUDA
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
ml-explore/mlx
MLX: An array framework for Apple silicon
astral-sh/uv
An extremely fast Python package installer and resolver, written in Rust.
state-spaces/mamba
Mamba SSM architecture
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
BartoszJarocki/cv
Print-friendly, minimalist CV page
PWhiddy/PokemonRedExperiments
Playing Pokemon Red with Reinforcement Learning
hrvach/deskhop
Fast Desktop Switching Device
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
cnpryer/huak
My experimental Python package manager.
lhao499/RingAttention
Transformers with Arbitrarily Large Context
huggingface/text-clustering
Easily embed, cluster and semantically label text datasets
abacaj/code-eval
Run evaluation on LLMs using human-eval benchmark
instadeepai/flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
liuzuxin/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
MatX-inc/seqax
seqax = sequence modeling + JAX
RL4VLM/RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
foundation-model-stack/fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
SpellcraftAI/oaib
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
DramaCow/jaxued
instadeepai/sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
emilianbold/PDFwriter
An OSX print to pdf-file printer driver
cogment/cogment-lab
A toolkit for practical Human-AI cooperation research
google/putting-dune