TRI-ML/vlm-evaluation
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
PythonNOASSERTION
Issues
- 2
conflict error: pip install - e .
#11 opened by huangwenjunlovedy - 2
The issue of abnormal indicators.
#10 opened by tayton42 - 2
Question about the Dataset Type
#8 opened by Hannibal046 - 4
Error when evaluating on POPE-full
#2 opened by djghosh13 - 3
- 1
About the number of POPE dataset
#9 opened by Hannibal046 - 1
Evaluation for more datasets
#7 opened by Lauch1ng - 1
Slow model inference when evaluation
#3 opened by zeyuanyin