stefantaubert
Currently I am working on my PhD about the topic of speech synthesis at Chemnitz University of Technology.
Chemnitz University of TechnologyChemnitz, Germany
Pinned Repositories
mel_cepstral_distance
Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.
en-tts
Command-line interface and Python library for synthesizing English texts into speech.
mean-opinion-score
Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).
pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.
quora-competition
Code for Quora Competition on Kaggle
tacotron
Command-line interface to train Tacotron 2 using .wav <=> .TextGrid pairs.
tacotron2
Original Tacotron 2 modified to support IPA training/synthesis and multiple speakers.
textgrid-ipa
Command-line interface which provides methods to modify TextGrids (.TextGrid) and their corresponding audio files (.wav).
tts-mos-test-mturk
Command-line interface (CLI) and Python library to evaluate text-to-speech (TTS) mean opinion score (MOS) studies done on Amazon Mechanical Turk (MTurk). The calculation of the confidence intervals is done in the same manner as described in (Ribeiro et al., 2011).
zh-tts
Web app, command-line interface and Python library for synthesizing Chinese texts into speech.
stefantaubert's Repositories
stefantaubert/pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.
stefantaubert/mean-opinion-score
Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).
stefantaubert/textgrid-ipa
Command-line interface which provides methods to modify TextGrids (.TextGrid) and their corresponding audio files (.wav).
stefantaubert/tacotron
Command-line interface to train Tacotron 2 using .wav <=> .TextGrid pairs.
stefantaubert/tts-mos-test-mturk
Command-line interface (CLI) and Python library to evaluate text-to-speech (TTS) mean opinion score (MOS) studies done on Amazon Mechanical Turk (MTurk). The calculation of the confidence intervals is done in the same manner as described in (Ribeiro et al., 2011).
stefantaubert/zh-tts
Web app, command-line interface and Python library for synthesizing Chinese texts into speech.
stefantaubert/en-tts
Command-line interface and Python library for synthesizing English texts into speech.
stefantaubert/lifeclef-geo-2018
Source code for the TUCMI submissions to the GeoLifeCLEF 2018 species recognition task
stefantaubert/pronunciation-dictionary
Library and CLI to load/save/modify pronunciation dictionaries.
stefantaubert/cmudict-parser
Python parser for CMUDict files. It returns ARBAbet and IPA transciption of dictionary words.
stefantaubert/imageclef-lifelog-2019
Source code for the TUCMI submissions to the ImageCLEF 2019 Lifelog Task
stefantaubert/speech-dataset-parser
Parser for several speech datasets.
stefantaubert/waveglow
Command-line interface (CLI) to train WaveGlow using .wav files.
stefantaubert/dict-from-annotation
Creates a pronunciation dictionary based on annotations.
stefantaubert/dict-from-dragonmapper
CLI to convert a Chinese vocabulary to IPA using dragonmapper.
stefantaubert/dict-from-g2p
Create pronuciation dictionary using g2p
stefantaubert/dict-from-pypinyin
stefantaubert/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
stefantaubert/Hybrid-Societies-2023
Supplementary material for Hybrid Societies Conference 2023
stefantaubert/ICSPCC-2022
Supplementary material for ICSPCC 2022 paper "A Comparison of Text Selection Algorithms for Sequence-to-Sequence Neural TTS".
stefantaubert/pronunciation-dict-creation
stefantaubert/pronunciation-dictionary-utils
Utils to modify pronunciation dictionaries.
stefantaubert/qmk_firmware
Open-source keyboard firmware for Atmel AVR and Arm USB families
stefantaubert/sentence2pronunciation
stefantaubert/stefantaubert
stefantaubert
stefantaubert/stefantaubert.github.io
stefantaubert/text-selection
stefantaubert/text-utils
stefantaubert/tts
stefantaubert/txt-utils
CLI to batch process lines of a single text file.