18445864529

Tokyo Institute of TechnologyJapan

18445864529's Stars

huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27k 211 4.4k5.5k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python25k 260 3132.8k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.8k 101 370881
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.1k 97 679986
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Language:Python4.3k 51 97388
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Language:Python4.1k 67 73355
princeton-vl/RAFT
Language:Python3.4k 38 170640
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python3k 32 138267
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.7k 32 142218
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.7k 44 406163
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.9k 53 1594
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.5k 43 58153
Yujun-Shi/DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
Language:Python1.2k 27 6989
VainF/pytorch-msssim
Fast and differentiable MS-SSIM and SSIM for pytorch.
Language:Python1.1k 13 49127
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Language:Python926 25 3066
ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Language:Python914 26 4484
Vchitect/LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Language:Python906 28 2659
mayuelala/FollowYourClick
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
873 61 1435
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
849 53 1436
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Language:Python847 18 6465
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Language:Python735 10 2943
ExponentialML/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language:Python675 18 68107
TonyLianLong/LLM-groundedDiffusion
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
Language:Python444 13 1929
RQ-Wu/LAMP
[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation
Language:Python272 7 1613
TIGER-AI-Lab/ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
Language:Python231 16 2515
sutdcv/Animal-Kingdom
[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Language:Python129 5 012
microsoft/ReCo
ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023
Language:Jupyter Notebook122 5 1411
JianhongBai/UniEdit
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Language:Python99 12 74
SooLab/Free-Bloom
[NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Language:Python93 8 77
kylehkhsu/latent_quantization
Language:Python34 3 23

18445864529

18445864529's Stars

huggingface/diffusers

Stability-AI/generative-models

guoyww/AnimateDiff

salesforce/LAVIS

showlab/Tune-A-Video

Picsart-AI-Research/Text2Video-Zero

princeton-vl/RAFT

ali-vilab/VGen

Doubiiu/DynamiCrafter

InternLM/InternLM-XComposer

ChenHsing/Awesome-Video-Diffusion-Models

Picsart-AI-Research/StreamingT2V

Yujun-Shi/DragDiffusion

VainF/pytorch-msssim

Vchitect/SEINE

ali-vilab/videocomposer

Vchitect/LaVie

mayuelala/FollowYourClick

DirtyHarryLYL/LLM-in-Vision

alibaba/animate-anything

willisma/SiT

ExponentialML/Text-To-Video-Finetuning

TonyLianLong/LLM-groundedDiffusion

RQ-Wu/LAMP

TIGER-AI-Lab/ConsistI2V

sutdcv/Animal-Kingdom

microsoft/ReCo

JianhongBai/UniEdit

SooLab/Free-Bloom

kylehkhsu/latent_quantization