db7894's Stars
deepseek-ai/DeepSeek-V3
likenneth/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
jerber/lang-jepa
chavinlo/musicgen_trainer
simple trainer for musicgen/audiocraft
aeromamba-super-resolution/aeromamba
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models", presented in LAMIR 2024 Workshop
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
a-ghorbani/pocketpal-ai
An app that brings language models directly to your phone.
jbloomAus/DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
cmsflash/efficient-attention
An implementation of the efficient attention module.
NiekM/scrybe
Type-and-example directed program synthesis using example propagation, as described in Program Synthesis Using Example Propagation.
kevinniechen/scalinglaws
pytorch-labs/attention-gym
Helpful tools and examples for working with flex-attention
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
dust-tt/llama-ssp
Experiments on speculative sampling with Llama models
Lesterpaintstheworld/terminal-velocity
A novel created autonomously by a team of 10 AI agents
kuleshov-group/llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
kuleshov-group/awesome-discrete-diffusion-models
A curated list for awesome discrete diffusion models resources.
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
stanford-cs149/asst4-trainium
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
llvm-hs/llvm-hs
Haskell bindings for LLVM
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
emscripten-core/emscripten
Emscripten: An LLVM-to-WebAssembly Compiler
hughbzhang/o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
google-deepmind/optax
Optax is a gradient processing and optimization library for JAX.
kyegomez/Mixture-of-Depths
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Infatoshi/cuda-course