yongxuUSTC
Looking for 2024 summer interns in the US on speech & audio projects!
Tencent AI labBellevue, Seattle, USA
Pinned Repositories
cnn_rnn_spatial_audio_tagging
convolutional-autoencoder-for-raw-waveform-reconstruction
convolutional autoencoder for raw waveform reconstruction to replace the classic STFT, i called it as short-time AE transform (STAET)
dcase2017_task4_cvssp
DNN-for-speech-enhancement
DNN-for-speech-enhancement
DNN-Speech-enhancement-demo-tool
Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model
DNN-SpeechEnhancement
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
grnnbf
Generalized RNN beamformer for speech separation
mtmvdr
Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020
sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
yongxuUSTC's Repositories
yongxuUSTC/DNN-SpeechEnhancement
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
yongxuUSTC/speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
yongxuUSTC/python-pesq
A python package for calculating the PESQ.
yongxuUSTC/speech_separation
Include some core functions and model to handle speech separation
yongxuUSTC/tensorflow-without-a-phd
A crash course in six episodes for software developers who want to become machine learning practitioners.
yongxuUSTC/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
yongxuUSTC/models
Models built with TensorFlow
yongxuUSTC/pyensemble
An implementation of Caruana et al's Ensemble Selection algorithm in Python, based on scikit-learn
yongxuUSTC/pykaldi
A Python wrapper for Kaldi
yongxuUSTC/cgmm-mask-estimator
Implement offline version of "Robust mvdr beamforming using time-frequency masks for online/offline asr in noise"
yongxuUSTC/code_for_IS15_DNNmultiObjLearn_speech_enhancement
yongxuUSTC/dnn_wpe
yongxuUSTC/face_classification
Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.
yongxuUSTC/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
yongxuUSTC/LeetCode-1
👌 Python Solutions of LeetCode Questions 👌
yongxuUSTC/OMG_UMONS_submission
yongxuUSTC/sednn_modify
Python 3.5 and Windows version of Speech Enhancement using DNN by Yong Xu and Qiuqiang Kong
yongxuUSTC/aDAE_DNN_audio_tagging
yongxuUSTC/av-sync
yongxuUSTC/CapsNet-Keras
A Keras implementation of CapsNet in NIPS2017 paper "Dynamic Routing Between Capsules". Now test error = 0.34%.
yongxuUSTC/deep_complex_networks
Implementation related to the Deep Complex Networks
yongxuUSTC/espnet
End-to-End Speech Processing Toolkit
yongxuUSTC/hubpress.io
A web application to build your blog on GitHub
yongxuUSTC/OMGEmotionChallenge
Repository for th OMG Emotion Challenge
yongxuUSTC/rnnoise
Recurrent neural network for audio noise reduction
yongxuUSTC/torch-multi-head-attention
Multi-head attention in PyTorch
yongxuUSTC/yong.github.io
yongxuUSTC/yongxu.github.io
yongxuUSTC/yongxuUSTC.github.io
yongxuUSTC/youtube-dl
Command-line program to download videos from YouTube.com and other video sites