Pinned Repositories
DSS-SE
Distributed Sensor Selection for Speech Enhancement with Acoustic Sensor Networks
GraphSpeech
[ICASSP'2021] GraphSpeech: Syntax-aware Graph Attention Network For Neural Speech Synthesis
i-ETTS
Interactive Emotional Text-to-Speech (ETTS) Synthesis System
ICASSP2020
M2S-ADD
[InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"
MnTTS
MonTTS
myanmar-tokenizer
A Rule-based Syllable Segmentation of Myanmar Text
python-MCD
StrengthNet
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
ttslr's Repositories
ttslr/StrengthNet
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
ttslr/MonTTS
ttslr/M2S-ADD
[InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"
ttslr/DSS-SE
Distributed Sensor Selection for Speech Enhancement with Acoustic Sensor Networks
ttslr/MnTTS
ttslr/Ai-TTS
ttslr/ttslr.github.io
ttslr.github.io
ttslr/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
ttslr/ECSS
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI2024)
ttslr/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
ttslr/FastTalker
ttslr/MSCR-ADD
ttslr/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
ttslr/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
ttslr/Chinese-Minority-PLM
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)
ttslr/CTA-TTS
ttslr/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean
ttslr/IOT
ttslr/MAM-BERT
ttslr/MoeTTS
Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan and VITS
ttslr/mongolian-nlp
Useful resources for Mongolian NLP
ttslr/MT-KD
ttslr/paper-reading
深度学习论文精读
ttslr/survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
ttslr/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
ttslr/TalkLip
ttslr/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
ttslr/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
ttslr/ttslr
ttslr/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech