yongxuUSTC
Looking for 2024 summer interns in the US on speech & audio projects!
Tencent AI labBellevue, Seattle, USA
Pinned Repositories
cnn_rnn_spatial_audio_tagging
convolutional-autoencoder-for-raw-waveform-reconstruction
convolutional autoencoder for raw waveform reconstruction to replace the classic STFT, i called it as short-time AE transform (STAET)
dcase2017_task4_cvssp
DNN-for-speech-enhancement
DNN-for-speech-enhancement
DNN-Speech-enhancement-demo-tool
Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model
DNN-SpeechEnhancement
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
grnnbf
Generalized RNN beamformer for speech separation
mtmvdr
Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020
sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
yongxuUSTC's Repositories
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
yongxuUSTC/DNN-for-speech-enhancement
DNN-for-speech-enhancement
yongxuUSTC/DNN-Speech-enhancement-demo-tool
Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model
yongxuUSTC/grnnbf
Generalized RNN beamformer for speech separation
yongxuUSTC/mtmvdr
Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020
yongxuUSTC/dcase2017_task4_cvssp
yongxuUSTC/challenges
A personal log of learning and solving problems
yongxuUSTC/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
yongxuUSTC/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
yongxuUSTC/SpatialCodec
yongxuUSTC/torchiva
Blind source separation with independent vector analysis family of algorithm in torch
yongxuUSTC/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
yongxuUSTC/CDiffuSE
Conditional Diffusion Probabilistic Model for Speech Enhancement
yongxuUSTC/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
yongxuUSTC/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
yongxuUSTC/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
yongxuUSTC/mmvdr.github.io
yongxuUSTC/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
yongxuUSTC/pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
yongxuUSTC/pytorch_complex
A temporal module for PyTorch-ComplexTensor
yongxuUSTC/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
yongxuUSTC/Speech-Emotion-Recognition-1
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
yongxuUSTC/steernet
yongxuUSTC/UniAudio
The Open Source Code of UniAudio
yongxuUSTC/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
yongxuUSTC/yongxu
about me and self introduction
yongxuUSTC/yongxu-CV
yongxuUSTC/yongxu-cv.github.io
yongxuUSTC/yongxuUSTC
yongxuUSTC/zf