/YESBUT-v2

We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.

Primary LanguageJavaScriptMIT LicenseMIT

YESBUT-v2 Benchmark

This is the Project page for the Paper When ‘YES’ Meets ‘BUT’: Can AI Comprehend Contradictory Humor Through Comparative Reasoning? https://vulab-AI.github.io/YESBUT-v2