Pinned Repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
LiveTalking
Real time interactive streaming digital human
MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
midjourney-proxy
MidJourney-api FORK
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
qazwsx123-design's Repositories
qazwsx123-design doesn’t have any repository yet.