tts

There are 3642 repositories under tts topic.

CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python58.8k 943 1.1k9.4k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python52.1k 258 2k5.7k
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Language:Python48.1k 272 2.7k3.9k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python43.4k 331 1.2k5.7k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python38.1k 211 6604.1k
mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference
Language:Go37.9k 250 1.2k3k
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python36.7k 301 8905.3k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Language:Python35.4k 245 3373.9k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python24k 136 6302k
mastra-ai/mastra
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
Language:TypeScript18.1k 78 2.2k1.3k
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python17.2k 115 1.4k1.9k
NVIDIA-NeMo/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python16.1k 228 2.9k3.2k
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Language:JavaScript15.7k 55 902741
DrewThomasson/ebook2audiobook
Generate audiobooks from e-books, voice cloning & 1107+ languages!
Language:Python15k 74 3221.2k
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Language:Python15k 94 3541.7k
readest/readest
Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
Language:TypeScript14.6k 50 1.2k771
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python12.3k 185 2k1.9k
rhasspy/piper
A fast, local neural text to speech system
Language:C++10.2k 96 601852
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Jupyter Notebook10k 184 5691.3k
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python9.3k 65 293889
krillinai/KrillinAI
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platforms like YouTube，TikTok. AI视频翻译配音工具，100种语言双向翻译，一键部署全流程，可以生抖音，小红书，哔哩哔哩，视频号，TikTok，Youtube等形态的内容成适配
Language:Go8.9k 41 120725
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频
Language:Python8.8k 46 150962
shidahuilang/shuyuan
阅读书源-香色闺阁+用心读书+源阅+阅读3.0书源+源阅读+爱阅书香+千阅+花火阅读+读不舍手+番茄+喜马拉雅+漫画+听书+书源+IPTV源+IPA巨魔应用=自动更新
Language:Python8.8k 78 26505
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8.6k 49 01.2k
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python8.4k 71 165734
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python8k 85 158788
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python7.7k 52 2141.4k
jianchang512/ChatTTS-ui
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Language:Python7.4k 39 263906
wzpan/wukong-robot
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。
Language:Python7k 174 3071.4k
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python6.9k 45 264972
LokerL/tts-vue
🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。
Language:TypeScript6.1k 42 155867
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python6k 78 243631
canopyai/Orpheus-TTS
Towards Human-Sounding Speech
Language:Python5.7k 75 219483
santinic/audiblez
Generate audiobooks from e-books
Language:Python5.6k 34 92381
snakers4/silero-models
Silero Models: pre-trained text-to-speech models made embarrassingly simple
Language:Jupyter Notebook5.5k 87 136349
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python5k 34 45451

tts

CorentinJ/Real-Time-Voice-Cloning

RVC-Boss/GPT-SoVITS

unslothai/unsloth

coqui-ai/TTS

2noise/ChatTTS

mudler/LocalAI

babysor/MockingBird

myshell-ai/OpenVoice

fishaudio/fish-speech

mastra-ai/mastra

FunAudioLLM/CosyVoice

NVIDIA-NeMo/NeMo

pot-app/pot-desktop

DrewThomasson/ebook2audiobook

index-tts/index-tts

readest/readest

PaddlePaddle/PaddleSpeech

rhasspy/piper

mozilla/TTS

rany2/edge-tts

krillinai/KrillinAI

jianchang512/clone-voice

shidahuilang/shuyuan

fishaudio/Bert-VITS2

netease-youdao/EmotiVoice

Plachtaa/VALL-E-X

jaywalnut310/vits

jianchang512/ChatTTS-ui

wzpan/wukong-robot

myshell-ai/MeloTTS

LokerL/tts-vue

yl4579/StyleTTS2

canopyai/Orpheus-TTS

santinic/audiblez

snakers4/silero-models

abus-aikorea/voice-pro