Pinned Repositories
VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
LFAV
Towards Long Form Audio-visual Video Understanding
Hou9612.github.io
个人主页
lumo-implement-old
This repository is unmaintained, please see lumo for details.
PSTP-Net-visualization-results
MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
nncore
📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.
Hou9612's Repositories
Hou9612/Hou9612.github.io
个人主页
Hou9612/lumo-implement-old
This repository is unmaintained, please see lumo for details.
Hou9612/PSTP-Net-visualization-results
Hou9612/ShiArthur03