db7894

ML Compiler Eng, erstwhile CS-Math Major @ HMC

Harvey Mudd College

db7894's Stars

deepseek-ai/DeepSeek-V3
Language:Python93549
likenneth/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Language:Python48738
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python137k27.4k
jerber/lang-jepa
Language:Python854
chavinlo/musicgen_trainer
simple trainer for musicgen/audiocraft
Language:Python164
aeromamba-super-resolution/aeromamba
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models", presented in LAMIR 2024 Workshop
Language:Python303
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
Language:Python71737
a-ghorbani/pocketpal-ai
An app that brings language models directly to your phone.
Language:TypeScript1.4k109
jbloomAus/DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
Language:Jupyter Notebook7618
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.7k611
cmsflash/efficient-attention
An implementation of the efficient attention module.
Language:Python29526
NiekM/scrybe
Type-and-example directed program synthesis using example propagation, as described in Program Synthesis Using Example Propagation.
Language:HTML12
kevinniechen/scalinglaws
Language:Jupyter Notebook1
pytorch-labs/attention-gym
Helpful tools and examples for working with flex-attention
Language:Python55027
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Language:Python1.2k84
dust-tt/llama-ssp
Experiments on speculative sampling with Llama models
Language:Python1207
Lesterpaintstheworld/terminal-velocity
A novel created autonomously by a team of 10 AI agents
Language:Python93161
kuleshov-group/llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
Language:Python71176
kuleshov-group/awesome-discrete-diffusion-models
A curated list for awesome discrete diffusion models resources.
1807
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Language:C++1.4k514
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python32.6k5k
stanford-cs149/asst4-trainium
Language:Python1422
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
Language:Python3.2k317
llvm-hs/llvm-hs
Haskell bindings for LLVM
Language:LLVM514121
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
Language:Python22716
emscripten-core/emscripten
Emscripten: An LLVM-to-WebAssembly Compiler
Language:C++26k3.3k
hughbzhang/o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
Language:Python623
google-deepmind/optax
Optax is a gradient processing and optimization library for JAX.
Language:Python1.7k197
kyegomez/Mixture-of-Depths
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Language:Python795
Infatoshi/cuda-course
Language:Cuda688105

db7894

db7894's Stars

deepseek-ai/DeepSeek-V3

likenneth/honest_llama

huggingface/transformers

jerber/lang-jepa

chavinlo/musicgen_trainer

aeromamba-super-resolution/aeromamba

AnswerDotAI/ModernBERT

a-ghorbani/pocketpal-ai

jbloomAus/DecisionTransformerInterpretability

sgl-project/sglang

cmsflash/efficient-attention

NiekM/scrybe

kevinniechen/scalinglaws

pytorch-labs/attention-gym

Lightning-AI/lightning-thunder

dust-tt/llama-ssp

Lesterpaintstheworld/terminal-velocity

kuleshov-group/llmtools

kuleshov-group/awesome-discrete-diffusion-models

llvm/torch-mlir

vllm-project/vllm

stanford-cs149/asst4-trainium

xjdr-alt/entropix

llvm-hs/llvm-hs

histmeisah/Large-Language-Models-play-StarCraftII

emscripten-core/emscripten

hughbzhang/o1_inference_scaling_laws

google-deepmind/optax

kyegomez/Mixture-of-Depths

Infatoshi/cuda-course