SiyuanHuang95/ManipVQA

[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Python

Issues

Infer on Demo data
#7 opened 4 months ago by Greatsjk
2
Datasets
#4 opened 3 months ago by RussRobin
5
Fine-tuning and inference of ManipVQA on less GPU resources
#3 opened 7 months ago by hyang1974
3
Publish trained model of this project with smaller size
#6 opened 6 months ago by nacui-intel
4
LLM dir seems incomplete
#5 opened 6 months ago by zaixing-wang
2
using the pretain model infer images
#2 opened 7 months ago by PredyDaddy
2
image_test
#1 opened 8 months ago by GentlesJan
2