MMStar-Benchmark/MMStar

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python

Issues

Is there any code not using VLMEvalKit?
#11 opened 5 months ago by URRealHero
0
How to contribute to the mmstar leaderboard？
#10 opened 5 months ago by lucasjinreal
0
Eval Error: ZeroDivisionError: float division by zero
#9 opened 7 months ago by GasolSun36
0
Dataset Metadata
#8 opened 8 months ago by LengSicong
0
Qwen-VL-Chat doesn't follow prompt
#7 opened 8 months ago by RifleZhang
1
The calculation of ML metrics is not quite appropriate.
#6 opened 9 months ago by echo840
1
How are the values of MG and ML calculated?
#4 opened 9 months ago by mary-0830
3
Error when inference
#2 opened 9 months ago by Cuiunbo
1
Great work
#3 opened 9 months ago by gordonhu608
1
any plan to release the dataset in the near future?
#1 opened 9 months ago by demoninpiano
1