/MA-LMM

(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Primary LanguagePythonMIT LicenseMIT

Stargazers