vocoder

There are 147 repositories under vocoder topic.

coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python38.9k 309 1.2k4.9k
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.7k 187 2k1.9k
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Jupyter Notebook9.8k 186 5661.3k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python8.9k 81 256692
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8.3k 50 01.2k
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Language:Python3.9k 79 687813
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python2.1k 31 165525
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1.6k 47 256343
mmorise/World
A high-quality speech analysis, manipulation and synthesis system
Language:C++1.2k 70 94256
haoheliu/voicefixer
General Speech Restoration
Language:Python1.1k 17 59134
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python904 31 57108
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language:Python815 21 48115
Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
Language:Python409 19 2761
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook402 16 2654
rishikksh20/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Language:Python319 10 1861
szechyjs/mbelib
P25 Phase 1 and ProVoice vocoder
Language:C++285 50 17118
lmnt-com/wavegrad
A fast, high-quality neural vocoder.
Language:Python279 14 1548
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Language:Python271 11 946
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Language:Python242 7 1949
sh123/codec2_talkie
Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)
Language:Java240 20 5140
NTT123/vietTTS
Vietnamese Text to Speech library
Language:Python223 16 3499
maum-ai/phaseaug
ICASSP 2023 Accepted
Language:Python189 5 1214
descriptinc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Language:Python187 21 1430
HidekiKawahara/legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
Language:MATLAB180 20 644
k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Language:Python171 9 2831
hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Language:Python162 15 917
yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Language:Python156 9 1213
xcmyz/FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Language:Python154 3 1119
ncsoft/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Language:Python150 4 519
Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Language:Python138 8 1220
geneing/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
Language:Python132 16 1637
jurihock/stftPitchShift
STFT based real-time pitch and timbre shifting in C++ and Python
Language:C131 6 5117
X-LANCE/UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Language:Python128 10 916
rishikksh20/Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Language:Python118 13 415
iamycy/golf
A DDSP-based neural voice synthesiser.
Language:Jupyter Notebook114 6 139
magnetophon/VoiceOfFaust
Turn your voice into a synthesizer!
Language:Faust112 9 72

vocoder

coqui-ai/TTS

PaddlePaddle/PaddleSpeech

mozilla/TTS

open-mmlab/Amphion

fishaudio/Bert-VITS2

TensorSpeech/TensorFlowTTS

jik876/hifi-gan

kan-bayashi/ParallelWaveGAN

mmorise/World

haoheliu/voicefixer

gemelo-ai/vocos

lmnt-com/diffwave

Rongjiehuang/FastDiff

ivanvovk/WaveGrad

rishikksh20/VocGAN

szechyjs/mbelib

lmnt-com/wavegrad

maum-ai/univnet

rishikksh20/iSTFTNet-pytorch

sh123/codec2_talkie

NTT123/vietTTS

maum-ai/phaseaug

descriptinc/cargan

HidekiKawahara/legacy_STRAIGHT

k2kobayashi/crank

hhguo/MSMC-TTS

yl4579/HiFTNet

xcmyz/FastVocoder

ncsoft/avocodo

Rongjiehuang/Multi-Singer

geneing/WaveRNN-Pytorch

jurihock/stftPitchShift

X-LANCE/UniCATS-CTX-vec2wav

rishikksh20/Avocodo-pytorch

iamycy/golf

magnetophon/VoiceOfFaust