This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
Primary LanguagePythonOtherNOASSERTION