Python Library to evaluate VLM models' robustness across diverse benchmarks
Primary LanguageJupyter NotebookOtherNOASSERTION