MMStar-Benchmark/MMStar
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
Python
Issues
- 0
Is there any code not using VLMEvalKit?
#11 opened by URRealHero - 0
How to contribute to the mmstar leaderboard?
#10 opened by lucasjinreal - 0
- 0
Dataset Metadata
#8 opened by LengSicong - 1
Qwen-VL-Chat doesn't follow prompt
#7 opened by RifleZhang - 1
- 3
How are the values of MG and ML calculated?
#4 opened by mary-0830 - 1
Error when inference
#2 opened by Cuiunbo - 1
Great work
#3 opened by gordonhu608 - 1