cambridgeltl/visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
PythonApache-2.0
Stargazers
- 545487677Shenzhen,China
- bbhasnat
- billqxg
- CKchaosAalto University
- fly51flyPRIS
- fuzihaofzhUniversity of Cambridge
- geyuyingTencent ARC Lab
- ggsonic
- GrassBroCityU
- hardyqrGoogle DeepMind
- hzchua
- isaaccorleyUTSA
- jantpkBeijing
- JeffCarpenterCanada
- jianghaojunTsinghua University
- juletxHitz Zentroa UPV/EHU
- kaustubholpadkarStony Brook University
- KT27-ACity University of Hong Kong
- liujiahengBeihang University (BUAA)
- LuvataHanoi - Vietnam
- m-bainVGG, University of Oxford
- mengzaiqiaoUniversity of Glasgow
- peteralexandercharles
- realprocrastinatorXiaomi
- SaghebK
- SohojoeMicrosoft
- songbohuUniversity of Cambridge
- suyanzhou626UESTC
- syu-idThe Hong Kong Polytechnic University
- TengdaHanOxford, UK
- tgyy1995
- tomchen-ctj
- weiyunfei
- whtitefallUniversity of Ottawa
- yxuansuCohere, University of Cambridge
- zhjohnchanThe Chinese University of Hong Kong, Shenzhen