/SpeechCLIP

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Issues