Yangyi-Chen/SOLO
Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
Jupyter NotebookApache-2.0
Stargazers
- amadeuzou
- Bohao-LeeShenzhen
- chuanmingliuWesteros
- csuhanCUHK
- foreveryhNeuX
- forrestbingAlibaba Inc
- hehao13CUHK
- HubHopDeepSeek
- iamshnooVirginia, USA
- ilovecv
- JamesHujyTsinghua University
- jiangzhengkaiHKUST
- jojojoe
- kidmam
- liewjunhao
- limbo0000Chinese University of Hong Kong
- linjc16University of Illinois Urbana-Champaign
- lizhengwei1992Hang Zhou
- lulafunAsia/Shanghai
- Minami-su
- sathishkumar67Hosur, Tamil Nadu, India
- scalabledScalable Dynamics
- shizhediaoNVIDIA
- ShujinWu-0814Los Angeles, CA
- Space-XunNLPR
- tangxinvc
- tomatobobotUnited States
- tumurzakov
- utopic-dev
- vishaal27University of Tübingen | University of Cambridge
- yangluo23
- Yangyi-ChenUniversity of Illinois Urbana-Champaign
- Yifehuang97Stony Brook
- ytaek-ohKAIST
- ZihanWang314
- zouhaoaZhejiang University