zhang-haojie

South China University of TechnologyGuangzhou

Pinned Repositories

echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python3.4k 49 196393
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python8.1k 450 1581.3k
ControlNet
Let us control diffusion models!
Language:Python31.2k 219 5602.8k
CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python10.3k 131 520960
Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.8k 23 106183
cogvideo-factory
Memory-optimized training scripts for video models based on Diffusers
Language:Python00
LetsTalk
Latent Diffusion Transformer for Talking Video Synthesis
48 7 21
lightning-Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python0 0 00
wesam
[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation"
Language:Python150 7 359
zhang-haojie.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
Language:JavaScript0 0 00

zhang-haojie/wesam
[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation"
Language:Python150 7 359
zhang-haojie/LetsTalk
Latent Diffusion Transformer for Talking Video Synthesis
48 7 21
zhang-haojie/lightning-Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python0 0 00
zhang-haojie/zhang-haojie.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
Language:JavaScript0 0 00