THUHCSI
Human-Computer Speech Interaction Lab at Tsinghua University
FIT Building, Tsinghua University, Beijing
Pinned Repositories
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Crystal.TTVS
Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.
FlatTN
Chinese Text Normalization and Dataset
icassp2021-emotion-tts
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
MagicMan
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
NeuCoSVC
NeuFA
Neural network-based forced alignment with bidirectional attention mechanism
SECap
SpanPSP
VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
THUHCSI's Repositories
thuhcsi/NeuCoSVC
thuhcsi/MagicMan
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
thuhcsi/SECap
thuhcsi/LightGrad
thuhcsi/S2G-MDDiffusion
thuhcsi/SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
thuhcsi/DiffVar
thuhcsi/SpeechCraft
The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.
thuhcsi/VoxInstruct
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
thuhcsi/Contextual-Biasing-Dataset
open-source Mandarian biased word dataset
thuhcsi/mm2022-conversational-tts
thuhcsi/ExpressiveBailando
thuhcsi/SCNet
thuhcsi/mst-fastspeech2
thuhcsi/dpss-exp2-HMM-2023
ex2 for dpss 2023
thuhcsi/icassp2023-coherent-tts
Please visit https://thuhcsi.github.io/icassp2023-coherent-tts
thuhcsi/melody-unsupervised-pretraining-svs
thuhcsi/Semi-Supervised-MDD
thuhcsi/StyleDub
Please visit https://thuhcsi.github.io/StyleDub/
thuhcsi/secap_demo
thuhcsi/secap_slate
thuhcsi/icassp2024-msvalle
thuhcsi/interspeech2023-DiffVar
thuhcsi/interspeech2023-NS-Extractor
demo page for lin9x's NS-Extractor
thuhcsi/interspeech2023-spontaneousTTS
thuhcsi/interspeech2024-CSG
Please visit https://thuhcsi.github.io/interspeech2024-CSG
thuhcsi/interspeech2024-SponLMTTS
thuhcsi/ls_slate
thuhcsi/Self-Supervised-MDD
thuhcsi/TASLP-MSStyleTTS
Please visit https://thuhcsi.github.io/TASLP-MSStyleTTS