This repository contains a spatial understanding test suite for vision-language models
Primary LanguagePython