tranbahien's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
triton-lang/triton
Development repository for the Triton language and compiler
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
facebookresearch/ConvNeXt
Code release for ConvNeXt model
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
kimiyoung/transformer-xl
POSTECH-CVLab/PyTorch-StudioGAN
StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.
jbhuang0604/awesome-tips
thu-ml/unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
Vahe1994/AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf
bayesiains/nflows
Normalizing flows in PyTorch
VincentStimper/normalizing-flows
PyTorch implementation of normalizing flow models
acids-ircam/diffusion_models
A series of tutorial notebooks on denoising diffusion probabilistic models in PyTorch
wavefrontshaping/complexPyTorch
A high-level toolbox for using complex valued neural networks in PyTorch
ikostrikov/pytorch-flows
PyTorch implementations of algorithms for density estimation
LiyuanLucasLiu/Transformer-Clinic
Understanding the Difficulty of Training Transformers
facebookresearch/amortized-optimization-tutorial
Tutorial on amortized optimization for learning to optimize over continuous domains
google/aqt
amitness/colab-connect
Connect to Google Colab VM from your local VSCode
junhsss/consistency-models
A Toolkit for OpenAI's Consistency Models.
HazyResearch/butterfly
Butterfly matrix multiplication in PyTorch
hahnyuan/PB-LLM
PB-LLM: Partially Binarized Large Language Models
RobertCsordas/moe_attention
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
betanalpha/mcmc_diagnostics
Markov chain Monte Carlo general, and Hamiltonian Monte Carlo specific, diagnostics for Stan
microsoft/Stochastic-Mixture-of-Experts
This package implements THOR: Transformer with Stochastic Experts.
neale/HyperGAN
Generative Model for Neural Networks
cagatayyildiz/neural-ode-tutorial
Neural ODE tutorial