/SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Issues