MMStar-Benchmark/MMStar
This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
Python
Issues
- 0
- 0
Dataset Metadata
#8 opened - 1
Qwen-VL-Chat doesn't follow prompt
#7 opened - 1
- 3
- 1
Great work
#3 opened - 1
Error when inference
#2 opened - 1