onevfall's Stars
yuankunzhang/working-guides
A guide for programming in style.
TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024 D&B Track] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
wilson1yan/VideoGPT
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
lllyasviel/ControlNet
Let us control diffusion models!
hkproj/pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
stevenlsw/physgen
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)
jjihwan/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
lucidrains/lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
lucidrains/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
KaiyueSun98/T2V-CompBench
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
zhenzhiwang/HumanVid
Official implementation of HumanVid, NeurIPS D&B Track 2024
aim-uofa/MovieDreamer
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
open-mmlab/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
guoyww/AnimateDiff
Official implementation of AnimateDiff.
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models