WangHelin1997
PhD student at Johns Hopkins University, got my bachelor's degree and master's degree at Tsinghua University and Peking University.
THU & PKU & JHUBaltimore, US
Pinned Repositories
AT-GCN
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
Automatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition
DCASE-2020-Task1A-Code
A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.
DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Fast-GeCo
Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
LibriLightMix-WHAMR
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
MaskSpec
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
nnAudio2
Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.
SpecAugment-plus
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
WangHelin1997's Repositories
WangHelin1997/SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
WangHelin1997/MaskSpec
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
WangHelin1997/DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
WangHelin1997/Automatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition
WangHelin1997/Fast-GeCo
Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
WangHelin1997/LibriLightMix-WHAMR
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
WangHelin1997/Aty-TTS
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
WangHelin1997/DCASE2021_Task6_PKU
This is the code of PKU team for DCASE 2021 Task 6.
WangHelin1997/LibriLightMix-WHAM
Python scripts to create noisy mixture audio with Libri-Light and WHAM
WangHelin1997/Speech-paper-crawl
My Python scripts for crawling paper related on speech processing.
WangHelin1997/Du-N2DVC-Demo
WangHelin1997/project2021
PKU team for 2021 project 'Guangchangwu detection'.
WangHelin1997/CommonVoice
WangHelin1997/Aty-TTS-Demo
WangHelin1997/helinwang
WangHelin1997/Your-Stable-Audio
Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation
WangHelin1997/clip-multilingual
Multilingual CLIP - Semantic Image Search in 100 languages
WangHelin1997/DuTa-VC-Demo
WangHelin1997/dynamic-superb
The official repository of Dynamic-SUPERB.
WangHelin1997/Dysarthric-Speech-Reconstruction-Demo
Demo for dysarthric speech reconstruction
WangHelin1997/fairness
WangHelin1997/hifigan-yingram-vc
vc
WangHelin1997/hyperion
Python toolkit for speech processing
WangHelin1997/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
WangHelin1997/My-PhD-Interview
WangHelin1997/RP
WangHelin1997/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
WangHelin1997/tmp
WangHelin1997/torch-nansy
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
WangHelin1997/WangHelin1997.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage