TECH-IN-EYE/Speech-to-text-data_collection
Speech-to-text data collection with Kafka, Airflow, and Spark, building a pipeline that can be deployed to process posting and receiving text and audio files from and into a data lake, apply transformation in a distributed manner, and load it into a warehouse in a suitable format to train a speech-to-text model.
Python
No issues in this repository yet.