Yangyi-Chen/SOLO

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter NotebookApache-2.0