Pinned Repositories
dataset_viewer
Streamlit app to visualize and edit TTS datasets
FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
NeMo
NeMo: a toolkit for conversational AI
openduck
Building an open-source interactive AI plush toy.
radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
uberduck-discord-example
uberduck-ml-dev
ML models for Uberduck
uberduct
CMU US English Dictionary
utterances
A repository of utterances that are fun to generate with a text-to-speech voice.
uberduck-ai's Repositories
uberduck-ai/utterances
A repository of utterances that are fun to generate with a text-to-speech voice.
uberduck-ai/uberduck-dvc-tutorial
uberduck-ai/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
uberduck-ai/torchMoji
😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
uberduck-ai/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.