speech-synthesis

There are 1388 repositories under speech-synthesis topic.

  • kalliope

    Kalliope is a framework that will help you to create your own personal assistant.

    Language:Python1.8k
  • RHVoice

    a free and open source speech synthesizer for Russian and other languages

    Language:C++1.7k
  • IMS-Toucan

    IMS-Toucan

    Controllable and fast Text-to-Speech for over 7000 languages!

    Language:Python1.6k
  • ParallelWaveGAN

    Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

    Language:Jupyter Notebook1.6k
  • OpenSeq2Seq

    Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

    Language:Python1.6k
  • SpeechT5

    Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

    Language:Python1.4k
  • open-speech-corpora

    💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

  • SAM

    Software Automatic Mouth - Tiny Speech Synthesizer

    Language:C1.4k
  • ComfyUI_Custom_Nodes_AlekPet

    Custom nodes that extend the capabilities of Comfyui

    Language:JavaScript1.3k
  • naturalspeech2-pytorch

    Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

    Language:Python1.3k
  • merlin

    This is now the official location of the Merlin project.

    Language:Python1.3k
  • pororo

    PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

    Language:Python1.3k
  • artyom.js

    A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

    Language:JavaScript1.3k
  • World

    A high-quality speech analysis, manipulation and synthesis system

    Language:C++1.3k
  • voicefixer

    General Speech Restoration

    Language:Python1.2k
  • StreamSpeech

    StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

    Language:Python1.1k
  • dsnote

    Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

    Language:C++1.1k
  • BigVGAN

    Official PyTorch implementation of BigVGAN (ICLR 2023)

    Language:Python1.1k
  • autovc

    AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

    Language:Python1.1k
  • Irene-Voice-Assistant

    Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.

    Language:Python1.1k
  • YourTTS

    YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

    Language:Jupyter Notebook1k
  • melgan-neurips

    GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

    Language:Python1k
  • NATSpeech

    NATSpeech

    A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

    Language:Python997
  • Cognitive-Speech-TTS

    Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

    Language:C#984
  • AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

    Language:Python978
  • athena

    an open-source implementation of sequence-to-sequence based speech processing engine

    Language:C++959
  • flowtron

    Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

    Language:Jupyter Notebook898
  • FastSpeech

    The Implementation of FastSpeech based on pytorch.

    Language:Python876
  • diffwave

    DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

    Language:Python861
  • NISQA

    NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

    Language:Python851
  • Multilingual_Text_to_Speech

    An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

    Language:Python838
  • FireRedTTS

    An Open-Sourced LLM-empowered Foundation TTS System

    Language:Python795
  • glow-tts

    A Generative Flow for Text-to-Speech via Monotonic Alignment Search

    Language:Python698
  • INTERSPEECH-2023-24-Papers

    INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

  • voice-builder

    An opensource text-to-speech (TTS) voice building tool

    Language:JavaScript682
  • sam

    Software Automatic Mouth - Tiny Speech Synthesizer

    Language:JavaScript675