SiyuanHuang95/ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Python
Issues
- 2
LLM dir seems incomplete
#5 opened by zaixing-wang - 5
- 2
- 2
using the pretain model infer images
#2 opened by PredyDaddy - 2
image_test
#1 opened by GentlesJan