/cross-modal-speech-segment-retrieval

Learning a common representation space from speech and text for cross-modal retrieval given textual queries and speech files.

Primary LanguagePython

No issues in this repository yet.