/LAVISH

Vision Transformers are Parameter-Efficient Audio-Visual Learners

Primary LanguagePython

No issues in this repository yet.