Pinned Repositories
FTD-distillation
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
DatasetCondensation
Dataset Condensation
FlatTrajectoryDistillation_FTD
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
TransformerDistillation-SLU
TS-TalkNet
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
UniCodec
UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Jiang-Yidi's Repositories
Jiang-Yidi/TS-TalkNet
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
Jiang-Yidi/FlatTrajectoryDistillation_FTD
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
Jiang-Yidi/TransformerDistillation-SLU
Jiang-Yidi/DatasetCondensation
Dataset Condensation
Jiang-Yidi/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling