/LAVISH

Vision Transformers are Parameter-Efficient Audio-Visual Learners

Primary LanguagePython

Watchers

No one’s watching this repository yet.