/MultipanelVQA

Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"

Primary LanguageJupyter NotebookMIT LicenseMIT

No issues in this repository yet.