DonRL10's Stars
jla524/fromthetensor
From the Tensor to Stable Diffusion, a rough outline for a 9 week course.
hubertsiuzdak/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
arpitingle/gpu-alpha
clu0/unet.cu
UNet diffusion model in pure CUDA
karpathy/deep-vector-quantization
VQVAEs, GumbelSoftmaxes and friends
karpathy/LLM101n
LLM101n: Let's build a Storyteller
lucidrains/titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
roboflow/supervision
We write your reusable computer vision tools. 💜
microsoft/Samba
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
kvfrans/jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
jwasham/coding-interview-university
A complete computer science study plan to become a software engineer.
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
mcinglis/c-style
My favorite C programming practices.
OpenMOSS/AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
alessiodm/drl-zh
Deep Reinforcement Learning: Zero to Hero!
Jokeren/Awesome-GPU
Awesome resources for GPUs
myshell-ai/OpenVoice
Instant voice cloning by MyShell.
ggerganov/llama.cpp
LLM inference in C/C++
karpathy/llm.c
LLM training in simple, raw C/CUDA
srush/Triton-Puzzles
Puzzles for learning Triton
joey00072/ohara
Collection of autoregressive model implementation
OpenInterpreter/01
The open-source language model computer
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
tysam-code/hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).
ezelikman/quiet-star
Code for Quiet-STaR
mshumer/gpt-prompt-engineer