Pinned Repositories
3D_deformable-for-VAD
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Video-XL
🔥🔥First-ever hour scale video understanding models
GPT4V-level open-source multi-modal model based on Llama3-8B
🔥🔥First-ever hour scale video understanding models