/STT-Data-Collection

This project involves development of a ETL data pipeline that allows streaming millions of Amharic and Swahili speech audio files and speakers providing transcription texts for data collection in a web platforms.

Primary LanguageJupyter NotebookMIT LicenseMIT

No issues in this repository yet.