A one stop shop to track all open-access/ source TTS models as they come out. Feel free to make a PR for all those that aren't linked here.
This is aimed as a resource to increase awareness for these models and to make it easier for researchers, developers, and enthusiasts to stay informed about the latest advancements in the field.
Note
This repo will only track open source/access codebase TTS models. More motivation for everyone to open-source! 🤗
Name | GitHub | Weights | License | Fine-tune | Languages | Paper | Demo | Issues |
---|---|---|---|---|---|---|---|---|
XTTS | Repo | 🤗 Hub | CPML | Yes | Multilingual | Technical notes | 🤗 Space | Non Commercial |
TorToiSe TTS | Repo | 🤗 Hub | Apache 2.0 | Yes | English | Technical report | 🤗 Space | |
VITS/ MMS-TTS | Repo | 🤗 Hub / MMS | Apache 2.0 | Yes | English | Paper | 🤗 Space | |
Pheme | Repo | 🤗 Hub | CC-BY | Yes | English | Paper | 🤗 Space | |
OpenVoice | Repo | 🤗 Hub | CC-BY-NC 4.0 | No | ZH + EN | Paper | 🤗 Space | Non Commercial |
IMS-Toucan | Repo | GH release | Apache 2.0 | Yes | Multilingual | Paper | 🤗 Space | |
Matcha-TTS | Repo | GDrive | MIT | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
pflowTTS | Unofficial Repo | GDrive | MIT | Yes | English | Paper | Not Available | GPL-licensed phonemizer |
StyleTTS 2 | Repo | 🤗 Hub | MIT | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
VALL-E | Unofficial Repo | Not Available | MIT | Yes | NA | Paper | Not Available | |
HierSpeech++ | Repo | GDrive | MIT | No | KR + EN | Paper | 🤗 Space | |
Bark | Repo | 🤗 Hub | MIT | No | Multilingual | Paper | 🤗 Space | |
EmotiVoice | Repo | GDrive | Apache 2.0 | Yes | ZH + EN | Not Available | Not Available | Separate GUI agreement |
Amphion | Repo | 🤗 Hub | MIT | No | Multilingual | Paper | 🤗 Space | |
xVASynth | Repo | GH commit | GPL-3.0 | Yes | Multilingual | Paper | Not Available | Copyrighted materials used for training. |
OverFlow TTS | Repo | GitHub | MIT | Yes | English | Paper | GH Pages | |
Neural-HMM TTS | Repo | GitHub | MIT | Yes | English | Paper | GH Pages | |
Tacotron 2 | Unofficial Repo | GDrive | BSD-3 | Yes | English | Paper | Webpage | |
Glow-TTS | Repo | GDrive | MIT | Yes | English | Paper | GH Pages | |
Silero | Repo | GH links | CC BY-NC-SA | No | EM + DE + ES + EA | Not Available | Not Available | Non Commercial |
MahaTTS | Repo | 🤗 Hub | Apache 2.0 | No | English, Hindi, Indian English, Bengali, Tamil, Telugu, Punjabi, Marathi, Gujarati, Assamese | Not Available | Recordings, Colab | |
TTTS | Repo | 🤗 Hub | MPL 2.0 | No | ZH | Not Available | Colab |
Help make this list more complete. Create demos on the Hugging Face Hub and link them here :) Got any questions? Drop me a DM on Twitter @reach_vb.