Pinned Repositories
my_mwcnn
swin
VTG-LLM
[Preprint] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
GPT4RoI
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
mamba
Mamba SSM architecture