tts

There are 2279 repositories under tts topic.

  • Real-Time-Voice-Cloning

    CorentinJ/Real-Time-Voice-Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Language:Python51.3k9361.1k8.6k
  • MockingBird

    babysor/MockingBird

    🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Language:Python34.3k3058755.1k
  • lobe-chat

    lobehub/lobe-chat

    🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

    Language:TypeScript32.6k1611.5k7.8k
  • coqui-ai/TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    Language:Python31.4k2691k3.7k
  • RVC-Boss/GPT-SoVITS

    1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

    Language:Python27.5k1838733.2k
  • myshell-ai/OpenVoice

    Instant voice cloning by MyShell.

    Language:Python26.8k2081982.6k
  • LocalAI

    mudler/LocalAI

    :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

    Language:C++21.2k1597191.6k
  • NVIDIA/NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    Language:Python10.5k1952.1k2.2k
  • PaddlePaddle/PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Language:Python10.4k1851.9k1.8k
  • mozilla/TTS

    :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

    Language:Jupyter Notebook9k1865581.2k
  • pot-desktop

    pot-app/pot-desktop

    🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

    Language:JavaScript8.9k35614412
  • Plachtaa/VALL-E-X

    An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

    Language:Python7.4k83148737
  • fishaudio/Bert-VITS2

    vits2 backbone with multilingual-bert

    Language:Python7.3k4801k
  • jianchang512/clone-voice

    A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

    Language:Python6.6k35116638
  • netease-youdao/EmotiVoice

    EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

    Language:Python6.6k57142552
  • jaywalnut310/vits

    VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

    Language:Python6.4k531991.2k
  • wukong-robot

    wzpan/wukong-robot

    🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

    Language:Python6k1712881.3k
  • LokerL/tts-vue

    🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。

    Language:TypeScript5.6k41148810
  • rhasspy/piper

    A fast, local neural text to speech system

    Language:C++4.8k68394334
  • snakers4/silero-models

    Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

    Language:Jupyter Notebook4.7k83123295
  • jianchang512/ChatTTS-ui

    一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

    Language:Python4.5k30120488
  • shidahuilang/shuyuan

    香色闺阁+阅读3.0书源+源阅读+爱阅书香+花火阅读+读不舍手+IPTV源+IPA巨魔应用=自动更新

    Language:Python4.5k4418265
  • yl4579/StyleTTS2

    StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

    Language:Python4.3k79167336
  • MoonInTheRiver/DiffSinger

    DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

    Language:Python4.2k4399703
  • rany2/edge-tts

    Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

    Language:Python4.1k41177435
  • myshell-ai/MeloTTS

    High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

    Language:Python3.9k39121457
  • TensorSpeech/TensorFlowTTS

    :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

    Language:Python3.8k78683801
  • collabora/WhisperSpeech

    An Open Source text-to-speech system built by inverting Whisper.

    Language:Jupyter Notebook3.5k7093180
  • metavoiceio/metavoice-src

    Foundational model for human-like, expressive TTS

    Language:Python3.4k76110584
  • keithito/tacotron

    A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

    Language:Python2.9k146323966
  • tts-server-android

    jing332/tts-server-android

    这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读 ,还有自动重试,备用配置,文本替换等更多功能。

    Language:Kotlin2.9k24164236
  • zzw922cn/awesome-speech-recognition-speech-synthesis-papers

    Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

  • enhuiz/vall-e

    An unofficial PyTorch implementation of the audio LM VALL-E

    Language:Python2.9k9297415
  • tensorflow/lingvo

    Lingvo

    Language:Python2.8k120253436
  • liou666/polyglot

    🤖️ Cross-platform AI language practice app (跨平台AI语言练习应用)

    Language:TypeScript2.5k2462270
  • readbeyond/aeneas

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

    Language:Python2.4k75209215