/speech_pipeline

The speech_pipeline container handles multilingual ASR, speaker diarization, translation of the ASR to English, word-level alignment for some languages, and can produce a VTT file for use of subtitles in videos.

Primary LanguagePythonMIT LicenseMIT

Stargazers