deeplearningzhy's Stars
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
soulmachine/leetcode
LeetCode题解,151道题完整版。
roboticcam/machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
ibab/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
chrisdonahue/wavegan
WaveGAN: Learn to synthesize raw audio with generative adversarial networks
philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
santi-pdp/segan
Speech Enhancement Generative Adversarial Network in TensorFlow
drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising
voice-engine/make-a-smart-speaker
A collection of resources to make a smart speaker
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
philipperemy/timit
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
colinsongf/keyword_spotting
Chinese keyword spotting model using LSTM RNN
wxs/keras-mnist-tutorial
For a mini tutorial at U of T, a tutorial on MNIST classification in Keras.
DCASE-REPO/dcase2018_baseline
DCASE 2018 Baseline systems
DistantSpeechRecognition/mcse
Multi-channel speech enhancement system (MVDR beamformer + several postfilters)
ToniCreswell/attribute-cVAEGAN
Conditional Autoencoders with Adversarial Information Factorization
wangkenpu/rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
boozyguo/ClearWave
Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)
karolpiczak/paper-2017-DCASE
The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data
ssarfjoo/improvedsegan
This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training more robust and stable.
alibugra/audio-data-augmentation
Audio data augmentation examples
jerrygood0703/speech-enhancement-WGAN
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
sid0710/audio_data_augmentation
lordet01/segan
Speech Enhancement Generative Adversarial Network
mingukkang/MNIST-Tensorflow-Code
It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning rate decay, He initialization, Tensorboard, Save, Restore
BarclayII/audiogan
GAN for (raw) audio generation
WXB506/SpeechEnhancement-1
about Speech enhancement
bobateadev/doc_test
linjucs/segan_baseline