Pinned Repositories
adaptive_voice_conversion
charsiu
Charsiu: A neural phonetic aligner.
ICE-Talk
Interface for Controllable Expressive Talking Machine
MUST_P-SRL
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning
SCRIBE
Data processing and analysis of SCRIBE (Spoken Corpus Recordings In British English)
EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
ofxMotionMachine
MotionMachine is a C++ software toolkit for fast prototyping of interaction based on motion feature extraction. It brings mocap-friendly data structures and intuitive visualisation together.
speak_with_style_demo
Demo of a synthesized utterance with different styles with controllable intensities
noetits's Repositories
noetits/ICE-Talk
Interface for Controllable Expressive Talking Machine
noetits/MUST_P-SRL
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning
noetits/SCRIBE
Data processing and analysis of SCRIBE (Spoken Corpus Recordings In British English)
noetits/charsiu
Charsiu: A neural phonetic aligner.
noetits/cmudict_0.7b_json
cmudict as a json file
noetits/ddsp
DDSP: Differentiable Digital Signal Processing
noetits/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
noetits/fast-style-transfer
TensorFlow CNN for fast style transfer ⚡🖥🎨🖼
noetits/htmlpreview.github.com
HTML Preview for GitHub Repositories
noetits/latent_space_exp
noetits/markdown-here
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
noetits/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
noetits/micromamba-docker-sample
noetits/NeMo
NeMo: a toolkit for conversational AI
noetits/openslr
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
noetits/opensmile-python
Python package for openSMILE
noetits/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
noetits/phonemizer
Simple text to phones converter for multiple languages
noetits/pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian)
noetits/res
noetits/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
noetits/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
noetits/syllabify
Automatically convert plain text into phonemes (US English pronunciation) and syllabify
noetits/syllabify-1
noetits/textGenerationGTP3
noetits/Transformers-Recipe
🧠 A quick recipe to learn all about Transformers
noetits/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
noetits/vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
noetits/xlsr-wav2vec2-phoneme-recognition
noetits/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion