mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Python
Watchers
- AaddX-ai
- ae86208Alibaba
- AlexTurner90
- austinggHanzhou, China
- chongruoCA
- CircleRadonZhejiang University
- daspinaki
- drchrisscheier
- eemailme
- FDInSkyBaidu
- Glupayy
- hanoonaRMBZUAI
- hmxiongDalian University of Technology
- jetyingjiachina
- jijun-chengEast China Normal University
- joez17Beijing
- justicelee
- LiWentomngZhejiang University
- mmaaz60@mbzuai
- nahidalam
- niconielsen32Denmark
- skadambi20
- sneccc
- tfgbestneal
- tianbaochou
- xing0047Zhejiang University, Nanyang Technological University
- xxl007
- yangboydBeiJing
- yunglechao
- yzj2019
- zc-zhaoHuazhong University of Science and Technology