dongzhuoyao
Humble Ommer-group Postdoc; wechat: greekdance
Whu->PKU->University of Amsterdam-> LMUMunich
dongzhuoyao's Stars
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
baaivision/Emu3
Next-Token Prediction is All You Need
etched-ai/open-oasis
Inference script for Oasis 500M
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
pytorch-labs/attention-gym
Helpful tools and examples for working with flex-attention
NVIDIA/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
mit-han-lab/hart
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
wpeebles/G.pt
Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"
facebookresearch/MovieGenBench
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
sayakpaul/diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
FoundationVision/OmniTokenizer
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
sony/genwarp
CompVis/imagebart
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
lucidrains/gamengen-pytorch
Implementation of a framework for Gamengen in Pytorch
HKUNLP/DiffuLLaMA
DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
gle-bellier/discrete-fm
Educational implementation of the Discrete Flow Matching paper
lucasjinreal/LLaVA-Magvit2
LeapLabTHU/AdaNAT
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
ML-GSAI/SMDM
rwightman/imagenet-12k
ImageNet-12k subset of ImageNet-21k (fall11)
haoyuhsu/hyper-gaussian-splatting
GSH-Net: A Hypernetwork for 3D Gaussians Splatting