Hongjiang-Yu's Stars
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
ifnspaml/Perceptual-Weighting-Filter-Loss
A perceptual weighting filter loss for DNN training in speech enhancement
eesungkim/Speech_Enhancement_MMSE-STSA
A statistical model-based Speech Enhancement Using MMSE-STSA
haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
haoxiangsnr/IRM-based-Speech-Enhancement-using-LSTM
Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM
f90/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
covarep/covarep
A Cooperative Voice Analysis Repository for Speech Technologies
fgnt/nn-gev
Neural network supported GEV beamformer
clinicalml/structuredinference
Structured Inference Networks for Nonlinear State Space Models