Pinned Repositories
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
shikra
hzdzkjdxyjs's Repositories
hzdzkjdxyjs doesn’t have any repository yet.