phonemization
There are 5 repositories under phonemization topic.
spring-media/DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.
de-mh/persian_phonemizer
A tool for translating Persian text to IPA (International Phonetic Alphabet).
Mobile-Artificial-Intelligence/babylon.cpp
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
roglz/mxesco-docker
A modular FastAPI-based application for audio processing, featuring transcription, phonemization, metadata generation, and MongoDB storage. Powered by Whisper, Wav2Vec2, and Docker