kznmft

kznmft's Stars

dockur/windows
Windows inside a Docker container.
Language:Shell27.8k 145 5281.9k
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript18.5k 98 3881.4k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.5k 154 3481k
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7k 76 609718
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook5.9k 86 145596
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.7k 313 124598
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python3k 32 135264
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python2.8k 51 198344
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.6k 32 132206
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.5k 34 116262
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.7k 23 106176
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Language:Python1.6k 29 218224
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.4k 43 58148
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k 21 29125
mayuelala/FollowYourPose
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Language:Python1.3k 25 5290
showlab/Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Language:Python1.1k 39 1962
hotshotco/Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
Language:Python1.1k 13 4484
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
Language:Python949 68 2342
yerfor/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
Language:Python945 23 78108
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画
Language:Python913 22 4175
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Language:Python913 25 3064
Vchitect/LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Language:Python877 28 2659
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
Language:Python851 17 107159
jy0205/LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Language:Jupyter Notebook528 15 3529
Zhen-Dong/Magic-Me
Codes for ID-Specific Video Customized Diffusion
Language:Python460 14 1338
showlab/DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Language:Python428 16 2414
TIGER-AI-Lab/ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
Language:Python216 16 2515
YBYBZhang/VideoElevator
[Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models"
Language:Python147 12 45
hohonu-vicml/TrailBlazer
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
Language:Python91 7 1010
jylins/videoxum
[TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos
Language:Python34 2 23