TRI-ML/vlm-evaluation

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

PythonNOASSERTION

Issues

conflict error: pip install - e .
#11 opened 4 months ago by huangwenjunlovedy
2
The issue of abnormal indicators.
#10 opened 5 months ago by tayton42
2
Question about the Dataset Type
#8 opened 5 months ago by Hannibal046
2
Error when evaluating on POPE-full
#2 opened 5 months ago by djghosh13
4
Evaluation hangs with accelerate over multiple gpus.
#4 opened 6 months ago by tyleryzhu
3
About the number of POPE dataset
#9 opened 5 months ago by Hannibal046
1
Evaluation for more datasets
#7 opened 5 months ago by Lauch1ng
1
Slow model inference when evaluation
#3 opened 6 months ago by zeyuanyin
1