/Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers