vocoder

There are 147 repositories under vocoder topic.

  • coqui-ai/TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    Language:Python38.9k3091.2k4.9k
  • PaddlePaddle/PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Language:Python11.7k1872k1.9k
  • mozilla/TTS

    :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

    Language:Jupyter Notebook9.8k1865661.3k
  • Amphion

    open-mmlab/Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

    Language:Python8.9k81256692
  • fishaudio/Bert-VITS2

    vits2 backbone with multilingual-bert

    Language:Python8.3k5001.2k
  • TensorSpeech/TensorFlowTTS

    :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

    Language:Python3.9k79687813
  • jik876/hifi-gan

    HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

    Language:Python2.1k31165525
  • kan-bayashi/ParallelWaveGAN

    Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

    Language:Jupyter Notebook1.6k47256343
  • mmorise/World

    A high-quality speech analysis, manipulation and synthesis system

    Language:C++1.2k7094256
  • haoheliu/voicefixer

    General Speech Restoration

    Language:Python1.1k1759134
  • gemelo-ai/vocos

    Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

    Language:Python9043157108
  • lmnt-com/diffwave

    DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

    Language:Python8152148115
  • Rongjiehuang/FastDiff

    PyTorch Implementation of FastDiff (IJCAI'22)

    Language:Python409192761
  • ivanvovk/WaveGrad

    Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.

    Language:Jupyter Notebook402162654
  • rishikksh20/VocGAN

    VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

    Language:Python319101861
  • szechyjs/mbelib

    P25 Phase 1 and ProVoice vocoder

    Language:C++2855017118
  • lmnt-com/wavegrad

    A fast, high-quality neural vocoder.

    Language:Python279141548
  • maum-ai/univnet

    Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

    Language:Python27111946
  • rishikksh20/iSTFTNet-pytorch

    iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

    Language:Python24271949
  • sh123/codec2_talkie

    Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)

    Language:Java240205140
  • NTT123/vietTTS

    Vietnamese Text to Speech library

    Language:Python223163499
  • maum-ai/phaseaug

    ICASSP 2023 Accepted

    Language:Python18951214
  • descriptinc/cargan

    Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

    Language:Python187211430
  • HidekiKawahara/legacy_STRAIGHT

    A vocoder framework which had been widely used in research community since 1999.

    Language:MATLAB18020644
  • k2kobayashi/crank

    A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

    Language:Python17192831
  • hhguo/MSMC-TTS

    Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

    Language:Python16215917
  • yl4579/HiFTNet

    HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

    Language:Python15691213
  • xcmyz/FastVocoder

    Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

    Language:Python15431119
  • ncsoft/avocodo

    Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

    Language:Python1504519
  • Rongjiehuang/Multi-Singer

    PyTorch Implementation of Multi-Singer (ACM-MM'21)

    Language:Python13881220
  • geneing/WaveRNN-Pytorch

    Fatcord's Alternative WaveRNN (Faster training)

    Language:Python132161637
  • jurihock/stftPitchShift

    STFT based real-time pitch and timbre shifting in C++ and Python

    Language:C13165117
  • X-LANCE/UniCATS-CTX-vec2wav

    [AAAI 2024] Code for CTX-vec2wav in UniCATS

    Language:Python12810916
  • rishikksh20/Avocodo-pytorch

    Avocodo: Generative Adversarial Network for Artifact-free Vocoder

    Language:Python11813415
  • iamycy/golf

    A DDSP-based neural voice synthesiser.

    Language:Jupyter Notebook1146139
  • magnetophon/VoiceOfFaust

    Turn your voice into a synthesizer!

    Language:Faust112972