phonemization

There are 5 repositories under phonemization topic.

  • spring-media/DeepPhonemizer

    Grapheme to phoneme conversion with deep learning.

    Language:Python381183445
  • Lyrcaxis/KokoroSharp

    Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.

    Language:C#1005134
  • de-mh/persian_phonemizer

    A tool for translating Persian text to IPA (International Phonetic Alphabet).

    Language:Python64333
  • Mobile-Artificial-Intelligence/babylon.cpp

    Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

    Language:Python16202
  • roglz/mxesco-docker

    A modular FastAPI-based application for audio processing, featuring transcription, phonemization, metadata generation, and MongoDB storage. Powered by Whisper, Wav2Vec2, and Docker

    Language:Python