/groundingLMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Primary LanguagePython

Watchers

No one’s watching this repository yet.