speech-synthesis
There are 1388 repositories under speech-synthesis topic.
kalliope
Kalliope is a framework that will help you to create your own personal assistant.
RHVoice
a free and open source speech synthesizer for Russian and other languages
IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
SAM
Software Automatic Mouth - Tiny Speech Synthesizer
ComfyUI_Custom_Nodes_AlekPet
Custom nodes that extend the capabilities of Comfyui
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
merlin
This is now the official location of the Merlin project.
pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
World
A high-quality speech analysis, manipulation and synthesis system
voicefixer
General Speech Restoration
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
dsnote
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Irene-Voice-Assistant
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Cognitive-Speech-TTS
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch
athena
an open-source implementation of sequence-to-sequence based speech processing engine
flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
FastSpeech
The Implementation of FastSpeech based on pytorch.
diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
voice-builder
An opensource text-to-speech (TTS) voice building tool
sam
Software Automatic Mouth - Tiny Speech Synthesizer