gtonkov's Stars
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
logtd/ComfyUI-LTXTricks
A set of ComfyUI nodes providing additional control for the LTX Video model
ali-vilab/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
fallenshock/FlowEdit
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
genmoai/mochi
The best OSS video generation models
idealo/image-super-resolution
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
mangiucugna/json_repair
A python module to repair invalid JSON, commonly used to parse the output of LLMs
IamCreateAI/Ruyi-Models
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Mrkomiljon/Live_Portrait_Monitor
Bring portraits to life via Monitor!
metercai/SimpleSDXL
Enhanced version of Fooocus for SDXL, more suitable for Chinese and Cloud
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
huggingface/smol-course
A course on aligning smol models.
MCG-NJU/EMA-VFI
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
jdh-algo/JoyVASA
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Lightricks/LTX-Video
Official repository for LTX-Video
a-r-r-o-w/finetrainers
Memory-optimized training scripts for video models based on Diffusers
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.