/polyglot-api

API For Real-Time Voice Translation System using the Seamless M4T V2 Model

Primary LanguageJupyter Notebook

Polyglot API

This is a FastAPI application that uses the SeamlessM4Tv2ForSpeechToSpeech model from Hugging Face to process audio files.

Features

  • Process audio files and return the translated audio file.
  • Handle simultaneous parallel requests.
  • Logging with Gunicorn and colorlog.
  • Dockerized application.
  • API documentation with Swagger UI.