gracezhao1997

Postdoctoral researcher at Tsinghua SAIL Group @thu-ml, focusing on AIGC.

Beijing, China

gracezhao1997's Stars

mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.4k189
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
Language:Python78418
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.4k118
thu-ml/RoboticsDiffusionTransformer
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Language:Python46440
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Language:Python2.3k225
thu-ml/cond-image-leakage
Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)
28928
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2.1k88
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Language:Python70428
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4.3k314
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
Language:Jupyter Notebook54623
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.3k56
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.5k973
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.8k90
MyNiuuu/MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
Language:Python62736
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.6k207
pixeli99/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
Language:Python60761
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
Language:Jupyter Notebook44426
openai/weak-to-strong
Language:Python2.5k307
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.6k872
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.7k598
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.3k2.2k
sherwinbahmani/4dfy
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
Language:Python3158
heheyas/V3D
V3D: Video Diffusion Models are Effective 3D Generators
Language:Python45718
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.8k181
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.4k569
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.6k1k
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.2k824
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Language:Python78763
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7k721
mmathew23/improved_edm
Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"
Language:Python884

gracezhao1997

gracezhao1997's Stars

mit-han-lab/efficientvit

NVIDIA/Cosmos-Tokenizer

jquesnelle/yarn

thu-ml/RoboticsDiffusionTransformer

jy0205/Pyramid-Flow

thu-ml/cond-image-leakage

Alpha-VLLM/Lumina-T2X

TencentARC/Open-MAGVIT2

FoundationVision/VAR

bytedance/1d-tokenizer

FoundationVision/LlamaGen

HumanAIGC/AnimateAnyone

ChenHsing/Awesome-Video-Diffusion-Models

MyNiuuu/MOFA-Video

Doubiiu/DynamiCrafter

pixeli99/SVD_Xtend

magic-research/piecewise-rectified-flow

openai/weak-to-strong

guoyww/AnimateDiff

fudan-generative-vision/champ

hpcaitech/Open-Sora

sherwinbahmani/4dfy

heheyas/V3D

PixArt-alpha/PixArt-alpha

facebookresearch/DiT

PKU-YuanGroup/Open-Sora-Plan

facebookresearch/dinov2

alibaba/animate-anything

modelscope/modelscope

mmathew23/improved_edm