yongxuUSTC
Looking for 2024 summer interns in the US on speech & audio projects!
Tencent AI labBellevue, Seattle, USA
Pinned Repositories
cnn_rnn_spatial_audio_tagging
convolutional-autoencoder-for-raw-waveform-reconstruction
convolutional autoencoder for raw waveform reconstruction to replace the classic STFT, i called it as short-time AE transform (STAET)
dcase2017_task4_cvssp
DNN-for-speech-enhancement
DNN-for-speech-enhancement
DNN-Speech-enhancement-demo-tool
Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model
DNN-SpeechEnhancement
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
grnnbf
Generalized RNN beamformer for speech separation
mtmvdr
Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020
sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
yongxuUSTC's Repositories
yongxuUSTC/kaldi-ivector
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
yongxuUSTC/nmflib
Code from http://www.ee.columbia.edu/~grindlay/code.html