Pinned Repositories
dhg-wei's Repositories
dhg-wei/DeCap
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
dhg-wei/TOPA
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
dhg-wei/MCL
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning