SiyuanHuang95/ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Python
Issues
- 2
Infer on Demo data
#7 opened by Greatsjk - 5
- 3
- 4
- 2
LLM dir seems incomplete
#5 opened by zaixing-wang - 2
using the pretain model infer images
#2 opened by PredyDaddy - 2
image_test
#1 opened by GentlesJan