mysxs

Ph.D. student at Institute of Information Engineering, Chinese Academy of Sciences

University of Chinese Academy of Sciences北京

Pinned Repositories

emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python709 17 5454
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k 30 52185
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Language:Jupyter Notebook0 0 00
emotion2vec
0 1 00
Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
Language:Cython0 0 00
SlowFast-master
1 1 00
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python0 0 00
sxs.github.io
Language:HTML00
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python8k 76 228611
Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
Language:Python1.1k 16 52220

mysxs/SlowFast-master
1 1 00
mysxs/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Language:Jupyter Notebook0 0 00
mysxs/emotion2vec
0 1 00
mysxs/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
Language:Cython0 0 00
mysxs/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python0 0 00