Pinned Repositories
Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
DWFormer
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
hie
I-Feel-Stressed-Out
I Feel Stressed Out: A Mandarin Speech Stress Dataset With New Paradigm (APSIPA ASC 2022)
IEMOCAP_GraphNetwork
S2ST
stress_work
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
WeTextProcessing
Text Normalization & Inverse Text Normalization
scutcsq's Repositories
scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
scutcsq/DWFormer
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
scutcsq/S2ST
scutcsq/I-Feel-Stressed-Out
I Feel Stressed Out: A Mandarin Speech Stress Dataset With New Paradigm (APSIPA ASC 2022)
scutcsq/IEMOCAP_GraphNetwork
scutcsq/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
scutcsq/stress_work
scutcsq/hie