mertyg/vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
PythonMIT
Stargazers
- 232525Personal
- bellos1203Seoul National University
- BigHyf
- bryant1410@Netflix
- chenkq7
- csarronApple AIML
- dogu02
- ellenzhuwangUIC
- fallcat
- fly51flyPRIS
- g8a9Lisbon
- haowang-cquChongqing University
- HarmanDotpyGoogle
- hiker-lw
- HritikbansalUCLA
- joemzhaoici
- Kiwisher
- linzhiqiu
- lwaekfjlkCKC@ZJU -> LTI@CMU -> CS@UIUC
- Lycus99SIAT
- LZH-053Western University
- mu-caiUniversity of Wisconsin - Madison
- munanning
- noparkeeNAVER Corp.
- patrickjohncyh
- PolarShake
- sachit-menon
- Scarecrow0ShanghaiTech University @SHTUPLUS
- sethzhao506
- silviattiObjective, Inc
- tlin-taolin@epfml
- vinidXyla
- Weixin-LiangStanford University
- willxxyCarnegie Mellon University
- xing0047Zhejiang University, Nanyang Technological University
- YuzheWangPKUPeking University