DiegoSouzaPW/polyglot-api

API For Real-Time Voice Translation System using the Seamless M4T V2 Model

Jupyter Notebook

Polyglot API

This is a FastAPI application that uses the SeamlessM4Tv2ForSpeechToSpeech model from Hugging Face to process audio files.

Features

Process audio files and return the translated audio file.
Handle simultaneous parallel requests.
Logging with Gunicorn and colorlog.
Dockerized application.
API documentation with Swagger UI.