gracezhao1997
Postdoctoral researcher at Tsinghua SAIL Group @thu-ml, focusing on AIGC.
Beijing, China
gracezhao1997's Stars
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
thu-ml/RoboticsDiffusionTransformer
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
thu-ml/cond-image-leakage
Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
MyNiuuu/MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
pixeli99/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
openai/weak-to-strong
guoyww/AnimateDiff
Official implementation of AnimateDiff.
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
sherwinbahmani/4dfy
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
heheyas/V3D
V3D: Video Diffusion Models are Effective 3D Generators
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
mmathew23/improved_edm
Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"