facebookresearch/unibench

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter NotebookNOASSERTION

Issues

Error evaluating the sun397 dataset
#10 opened a month ago by mseitzer
0
No local evaluation is happening
#9 opened 2 months ago by chaitanyakrishna1248
1
Dataset is not found.
#7 opened 3 months ago by PrettyMagnolia
3
Error in unibench list_benchmarks: ValueError when converting 'zero-shot' to float
#6 opened 3 months ago by jun297
3
unibench show_results - ValueError: too many inputs
#4 opened 3 months ago by LudovicArnould1
1
Adversarial attacks
#3 opened 4 months ago by HashmatShadab
1