Pinned Repositories
adaptive_voice_conversion
charsiu
Charsiu: A neural phonetic aligner.
ICE-Talk
Interface for Controllable Expressive Talking Machine
MUST_P-SRL
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning
SCRIBE
Data processing and analysis of SCRIBE (Spoken Corpus Recordings In British English)
EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
ofxMotionMachine
MotionMachine is a C++ software toolkit for fast prototyping of interaction based on motion feature extraction. It brings mocap-friendly data structures and intuitive visualisation together.
speak_with_style_demo
Demo of a synthesized utterance with different styles with controllable intensities
noetits's Repositories
noetits/adaptive_voice_conversion
noetits/deep-learning-academy.github.io
noetits/gentle
gentle forced aligner
noetits/htk
HTK Toolkit with Linux 64 bit and Docker support
noetits/MotionMachine
MotionMachine is a C++ software toolkit for fast prototyping of interaction based on motion feature extraction. It brings mocap-friendly data structures and intuitive visualisation together.
noetits/opensmile
Opensmile 2.3.0 with ROS Sink to publish messages to ROS topics
noetits/pretty-print-confusion-matrix
plot a pretty confusion matrix with seaborn and matplotlib in python
noetits/unitselection
A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default
noetits/wakeword-benchmark
wake word engine benchmark framework
noetits/will_style_intensities
Samples of synthesized speech with different style intensities