atosystem/SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
PythonBSD-3-Clause
Issues
- 3
- 1
about speech-text implement
#4 opened by xiaoyaoxiaoxian - 1
about training codes
#5 opened by Benjizhang - 3
ImportError: cannot import name 'LightningLoggerBase' from 'pytorch_lightning.loggers'
#3 opened by seongq - 4
Simple Embeddings
#2 opened by corranmac - 2
Dataset source?
#1 opened by FlyToYourMooN