wsntxxn
Ph.D. candidate working on audio, speech and music processing.
Shanghai Jiao Tong UniversityShanghai
Pinned Repositories
AudioCaption
Audio captioning recipe
AudioDataViewer
BLAT
CS318
Projects of CS318: operating system
DCASE2020T6
Code and models for DCASE2020 Task 6 SJTU submission
DCASE2022T6_CLAP
Audio-text retrieval for DCASE2022 challenge task6
RichDetailAudioTextSimulation
TextToAudioGrounding
The dataset and baseline code for Text-to-Audio Grounding (TAG)
wsntxxn's Repositories
wsntxxn/AudioCaption
Audio captioning recipe
wsntxxn/TextToAudioGrounding
The dataset and baseline code for Text-to-Audio Grounding (TAG)
wsntxxn/RichDetailAudioTextSimulation
wsntxxn/DCASE2020T6
Code and models for DCASE2020 Task 6 SJTU submission
wsntxxn/DCASE2022T6_CLAP
Audio-text retrieval for DCASE2022 challenge task6
wsntxxn/BLAT
wsntxxn/AudioDataViewer
wsntxxn/CS318
Projects of CS318: operating system
wsntxxn/wsntxxn.github.io
wsntxxn/AndroidDictionary
A simple dictionary app
wsntxxn/CS306
计算机网络项目
wsntxxn/Gemm
Cannon Algorithm Implementation for matrix multiplication using MPI
wsntxxn/HEAR2021_EfficientLatent
Submission to the HEAR2021 Challenge
wsntxxn/insight-API
insight API with height limiting
wsntxxn/Jwb
jiaowoban page
wsntxxn/Parallel_Projects
Parallel computing and programming projects
wsntxxn/pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools