Pinned Repositories
echomimic_v2
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
TransGPT
LongVA
Long Context Transfer from Language to Vision
LLaVA-NeXT
syntax_aware_local_attention
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
LongVLM