cambridgeltl/visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
PythonApache-2.0
Stargazers
- hetong007Shanghai
- DTennant
- BellXP
- wghrayeb
- kingh0730Berkeley, California
- nschlemmHamburg
- gpantaz
- joocjunSeoul
- para-lost
- YuxiXieSingapore
- baajarmehAjjur, Palestine
- zhaohengyuan1Singapore
- xdhhh
- JamesSand
- lorenmtLondon, UK
- ngthanhtin
- cceydaSeoul, Korea
- BUAAPY
- haoenzhengBeijing
- raonigabrielCuritiba - Paraná, Brazil
- BaiqiLi123Shanghai, China
- adda1221
- Pefect96
- chioin
- ytaek-ohDaejeon, South Korea
- jun297
- HIT-peijinbeijin
- ererdewubudesi
- zhimin-zCanada
- yjhdhr
- ProNeverFakeMünchen
- bpiyushOxford
- ShreJais
- jihaonewHong Kong
- hirunimaUSA
- darkpromise98Hefei, China