Pinned Repositories
dataset_viewer
Streamlit app to visualize and edit TTS datasets
FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
NeMo
NeMo: a toolkit for conversational AI
openduck
Building an open-source interactive AI plush toy.
radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
uberduck-discord-example
uberduck-ml-dev
ML models for Uberduck
uberduct
CMU US English Dictionary
utterances
A repository of utterances that are fun to generate with a text-to-speech voice.
uberduck-ai's Repositories
uberduck-ai/uberduck-ml-dev
ML models for Uberduck
uberduck-ai/dataset_viewer
Streamlit app to visualize and edit TTS datasets
uberduck-ai/openduck
Building an open-source interactive AI plush toy.
uberduck-ai/uberduck-discord-example
uberduck-ai/NeMo
NeMo: a toolkit for conversational AI
uberduck-ai/uberduct
CMU US English Dictionary
uberduck-ai/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
uberduck-ai/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
uberduck-ai/radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
uberduck-ai/3d-text
uberduck-ai/audiogram
Turn audio into a shareable video.
uberduck-ai/g2p
g2p: English Grapheme To Phoneme Conversion
uberduck-ai/HierSpeechpp
The official implementation of HierSpeech++
uberduck-ai/morph-text
Morph Text in Remotion
uberduck-ai/remotion-wrapped
🎶 Spotify Wrapped recreated in Remotion 🎥
uberduck-ai/riffusion
Stable diffusion for real-time music generation
uberduck-ai/rvc
uberduck-ai/ai-podcast-content-assistant
uberduck-ai/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
uberduck-ai/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
uberduck-ai/image-to-rap
uberduck-ai/material_stable_diffusion
Tileable Stable Diffusion - Cog model
uberduck-ai/monotonic_align
Monotonic Alignment Search
uberduck-ai/phonemizer
Simple text to phones converter for multiple languages
uberduck-ai/RWKV-infctx-trainer
RWKV-infctx for audio generation
uberduck-ai/sample-generator
Tools to train a generative model on arbitrary audio samples
uberduck-ai/three-particles
Remotion adaptation of https://github.com/winkerVSbecks/3d-particle-effects-demo as reqeusted in Discord
uberduck-ai/torch-stft
An STFT/iSTFT for PyTorch.
uberduck-ai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
uberduck-ai/yomikata
Disambiguate japanese heteronyms