SiyuanHuang95/ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Python
Stargazers
- aopolin-lvHarbin Institution Technology, Shenzhen
- Artanic30ShanghaiTech University
- csuhanCUHK
- DongzhuoranZhou
- goxq
- hwfan@hyperplane-lab @MVIG-SJTU @BUPT
- jiangzhengkaiTencent
- lx704612715
- Neal2020GitHub
- quanfeifan
- Silence1471
- SiyuanHuang95Shanghai AI Lab
- suha4227湖南大学
- superboySBBeijing Institute of Technology
- wanshiruyishoubinanshan
- wlongdong
- xiaogangjiaKarlsruhe Institute of Technology
- YaroslavPonomarenkoPeking University
- ZJU-PLPZhejiang University
- zwbxAdelaide, Australia