speech-synthesis

There are 1388 repositories under speech-synthesis topic.

kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Language:Python1.8k
RHVoice
a free and open source speech synthesizer for Russian and other languages
Language:C++1.7k
IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
Language:Python1.6k
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1.6k
OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Language:Python1.6k
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python1.4k
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
1.4k
SAM
Software Automatic Mouth - Tiny Speech Synthesizer
Language:C1.4k
ComfyUI_Custom_Nodes_AlekPet
Custom nodes that extend the capabilities of Comfyui
Language:JavaScript1.3k
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k
merlin
This is now the official location of the Merlin project.
Language:Python1.3k
pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Language:Python1.3k
artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Language:JavaScript1.3k
World
A high-quality speech analysis, manipulation and synthesis system
Language:C++1.3k
voicefixer
General Speech Restoration
Language:Python1.2k
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python1.1k
dsnote
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Language:C++1.1k
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python1.1k
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Language:Python1.1k
Irene-Voice-Assistant
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
Language:Python1.1k
YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Language:Jupyter Notebook1k
melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Language:Python1k
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python997
Cognitive-Speech-TTS
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Language:C#984
AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch
Language:Python978
athena
an open-source implementation of sequence-to-sequence based speech processing engine
Language:C++959
flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Language:Jupyter Notebook898
FastSpeech
The Implementation of FastSpeech based on pytorch.
Language:Python876
diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language:Python861
NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Language:Python851
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language:Python838
FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language:Python795
glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Language:Python698
INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
682
voice-builder
An opensource text-to-speech (TTS) voice building tool
Language:JavaScript682
sam
Software Automatic Mouth - Tiny Speech Synthesizer
Language:JavaScript675