qazwsx123-design

Pinned Repositories

GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python00
echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python00
echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python00
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python00
LiveTalking
Real time interactive streaming digital human
Language:Python00
MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language:Python00
midjourney-proxy
MidJourney-api FORK
Language:Java00
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python00
MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
Language:Python00
TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Language:Python00

qazwsx123-design doesn’t have any repository yet.