WuTao-CS

Zhejiang UniversityZhejiang

WuTao-CS's Stars

Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.7k 449 3155.1k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.6k 115 3951.4k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.6k 673 94981
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.9k 154 3671k
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7.1k 59 138486
pengsida/learning_research
本人的科研经验
6.2k 71 31369
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.5k 61 400350
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
3.5k 41 4309
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python3k 32 138267
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.7k 32 142218
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.7k 44 406163
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.9k 53 1594
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k 22 8885
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Language:Python1.4k 50 3675
ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Language:Python914 26 4484
Vchitect/LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Language:Python906 28 2659
ExponentialML/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language:Python675 18 68107
m-bain/webvid
Large-scale text-video dataset. 10 million captioned short videos.
Language:Python614 9 2139
Delppine1024/TGreen
Some files work well on T v1.1 (The latest support v1.8.10/1.8.3-dev), Powered by TC
591 19 1322
ali-vilab/Cones-V2
[NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjects
Language:Python520 35 1019
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML404 15 523
ID-Animator/ID-Animator
Language:Python363 24 1826
atfortes/Awesome-Controllable-Generation
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, T2I-Adapter, IP-Adapter.
337 8 118
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Language:Python288 8 1012
Akaneqwq/360DVD
[CVPR2024] 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Language:Python129 3 86
NPU-YanChi/diff-gaussian-rasterization-for-gsslam
Language:Cuda79 6 33
Zehong-Ma/OVMR
OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)
Language:Python23 3 31
Whalesong-zrs/Towards-Fine-grained-HBOE
The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).
Language:Python16 1 00
neurowelt/AnimateDiff
Official implementation of AnimateDiff.
Language:Python5 0 00
ryanpo/custom-diffusion-lora
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Language:Python2 0 01

WuTao-CS

WuTao-CS's Stars

Stability-AI/stablediffusion

IDEA-Research/Grounded-Segment-Anything

HumanAIGC/AnimateAnyone

PKU-YuanGroup/Open-Sora-Plan

cloneofsimo/lora

pengsida/learning_research

tencent-ailab/IP-Adapter

ahmetbersoz/chatgpt-prompts-for-academic-writing

ali-vilab/VGen

Doubiiu/DynamiCrafter

InternLM/InternLM-XComposer

ChenHsing/Awesome-Video-Diffusion-Models

baaivision/Emu

TencentARC/MotionCtrl

ali-vilab/videocomposer

Vchitect/LaVie

ExponentialML/Text-To-Video-Finetuning

m-bain/webvid

Delppine1024/TGreen

ali-vilab/Cones-V2

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

ID-Animator/ID-Animator

atfortes/Awesome-Controllable-Generation

OPPO-Mente-Lab/Subject-Diffusion

Akaneqwq/360DVD

NPU-YanChi/diff-gaussian-rasterization-for-gsslam

Zehong-Ma/OVMR

Whalesong-zrs/Towards-Fine-grained-HBOE

neurowelt/AnimateDiff

ryanpo/custom-diffusion-lora