A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
Primary LanguagePython
No issues in this repository yet.