ZichenMiao's Stars
twistedcubic/attention-rank-collapse
[ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.
kohya-ss/sd-scripts
black-forest-labs/flux
Official inference repo for FLUX.1 models
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
karpathy/LLM101n
LLM101n: Let's build a Storyteller
zhoubolei/bolei_awesome_posters
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
uclaml/MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
qiuyu96/CoDeF
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
jannerm/ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
pesser/pytorch_diffusion
PyTorch reimplementation of Diffusion Models
yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
deep-floyd/IF
Delay-Xili/SDNet
An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"
nv-tlabs/GENIE
GENIE: Higher-Order Denoising Diffusion Solvers
wl-zhao/UniPC
[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
rishabhk108/AdvancedOptML
CS 7301: Spring 2021 Course on Advanced Topics in Optimization in Machine Learning
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
AaltoPML/node-BNN-covariate-shift
Repository for the paper Tackling covariate shift with node-based Bayesian neural networks (ICML 2022)
ISosnovik/sesn
Code for "Scale-Equivariant Steerable Networks"
google-research/rigl
End-to-end training of sparse deep neural networks with little-to-no performance loss.
verbose-void/rigl-torch
Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.
mseitzer/pytorch-fid
Compute FID scores with PyTorch.
jychoi118/P2-weighting
CVPR 2022
inria-thoth/T3SC
Official implementation of T3SC (Neurips 2021)