/MultipanelVQA

Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers