protector131090's Stars
hkchengrex/MMAudio
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
iamxym/Deep-Fourier-based-Arbitrary-scale-Super-resolution-for-Real-time-Rendering
SIGGRAPH 2024 Conference Paper: Deep Fourier-based Arbitrary-scale Super-resolution for Real-time Rendering
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
kijai/ComfyUI-HunyuanVideoWrapper
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Lightricks/LTX-Video
Official repository for LTX-Video
genmoai/mochi
The best OSS video generation models
pinokiofactory/cogstudio
aigc-apps/CogVideoX-Fun
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
woct0rdho/triton-windows
Fork of the Triton language and compiler for Windows support
imgly/background-removal-js
Remove backgrounds from images directly in the browser environment with ease and no additional costs or privacy concerns. Explore an interactive demo.
RexanWONG/text-behind-image
https://textbehindimage.rexanwong.xyz - create text behind image designs easily
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
williamyang1991/StyleGANEX
[ICCV 2023] StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
s0md3v/roop
one-click face swap
NVlabs/stylegan2
StyleGAN2 - Official TensorFlow Implementation
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
MachineEditor/MachineVideoEditor
This repository does not contain code, its purpose it for issue tracking and wiki
OutofAi/OutofFocus
An AI focused photo manipulation tool based on Gradio
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
XLabs-AI/x-flux
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait