Pinned Repositories
echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
ControlNet
Let us control diffusion models!
CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Latte
Latte: Latent Diffusion Transformer for Video Generation.
cogvideo-factory
Memory-optimized training scripts for video models based on Diffusers
LetsTalk
Latent Diffusion Transformer for Talking Video Synthesis
lightning-Latte
Latte: Latent Diffusion Transformer for Video Generation.
wesam
[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation"
zhang-haojie.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
zhang-haojie's Repositories
zhang-haojie/wesam
[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation"
zhang-haojie/LetsTalk
Latent Diffusion Transformer for Talking Video Synthesis
zhang-haojie/lightning-Latte
Latte: Latent Diffusion Transformer for Video Generation.
zhang-haojie/zhang-haojie.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage