text-to-video

There are 167 repositories under text-to-video topic.

THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python11.2k 135 5941.1k
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Language:Python8.4k 115 301793
Lightricks/LTX-Video
Official repository for LTX-Video
Language:Python7.9k 41 109695
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python5k 71 85385
promptslab/Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Language:Python4.9k 78 1488
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
3.2k 53 9269
SamurAIGPT/AI-Youtube-Shorts-Generator
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Language:Python2.6k 33 22403
FurkanGozukara/Stable-Diffusion
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod
Language:JavaScript2.5k 100 45346
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
2.2k 55 16111
lucidrains/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Language:Python2k 68 16190
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language:Python1.7k 73 45139
camenduru/text-to-video-synthesis-colab
Text To Video Synthesis Colab
Language:Jupyter Notebook1.5k 23 24184
Phantom-video/Phantom
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
Language:Python1.4k 60 387
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Language:Python1.3k 28 35139
PKU-YuanGroup/MagicTime
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k 18 30124
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Language:Python1.2k 13 11771
hotshotco/Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
Language:Python1.1k 14 4692
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"
Language:Python1.1k 23 4342
video-db/Director
AI video agents framework for next-gen video interactions and workflows.
Language:Python1.1k 10 34173
showlab/MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Language:Python1k 32 4357
brainrotjs/brainrot.js
Text to video generator in the brainrot form. Learn about any topic from your favorite personalities 😼.
Language:Python792 13 28103
eps696/aphantasia
CLIP + FFT/DWT/RGB = text to image/video
Language:Python790 23 37104
lucidrains/phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Language:Python780 37 3281
PKU-YuanGroup/ConsisID
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Language:Python761 12 4639
jianzhnie/awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
704 16 389
PaddlePaddle/PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Language:Python697 24 198218
ExponentialML/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language:Python687 18 68108
SamurAIGPT/Text-To-Video-AI
Generate video from text using AI
Language:Jupyter Notebook619 11 18237
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python555 21 2929
lucidrains/nuwa-pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Language:Python549 22 956
jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language:Python519 13 2064
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language:Python513 5 2021
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML509 15 530
Zhen-Dong/Magic-Me
Codes for ID-Specific Video Customized Diffusion
Language:Python457 14 1638
VideoVerses/VideoTuna
Let's finetune video generation models!
Language:Python437 11 1121
sibozhang/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Language:Python436 11 2194

text-to-video

THUDM/CogVideo

lucidrains/imagen-pytorch

Lightricks/LTX-Video

AILab-CVC/VideoCrafter

promptslab/Awesome-Prompt-Engineering

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

SamurAIGPT/AI-Youtube-Shorts-Generator

FurkanGozukara/Stable-Diffusion

ChenHsing/Awesome-Video-Diffusion-Models

lucidrains/make-a-video-pytorch

omerbt/TokenFlow

camenduru/text-to-video-synthesis-colab

Phantom-video/Phantom

lucidrains/video-diffusion-pytorch

PKU-YuanGroup/MagicTime

Vchitect/VBench

hotshotco/Hotshot-XL

ShareGPT4Omni/ShareGPT4Video

video-db/Director

showlab/MotionDirector

brainrotjs/brainrot.js

eps696/aphantasia

lucidrains/phenaki-pytorch

PKU-YuanGroup/ConsisID

jianzhnie/awesome-text-to-video

PaddlePaddle/PaddleMIX

ExponentialML/Text-To-Video-Finetuning

SamurAIGPT/Text-To-Video-AI

Vchitect/VEnhancer

lucidrains/nuwa-pytorch

jaketae/storyteller

TianxingWu/FreeInit

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

Zhen-Dong/Magic-Me

VideoVerses/VideoTuna

sibozhang/Text2Video