Pinned Repositories
AndroidAudioConverter
Convert audio files inside your Android app easily. Supported formats: AAC, MP3, M4A, WMA, WAV and FLAC.
DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
deepspeech.pytorch
Speech Recognition using DeepSpeech2.
espnet
End-to-End Speech Processing Toolkit
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
faad2
Freeware Advanced Audio (AAC) Decoder faad2 mirror
ipa-dict
Monolingual wordlists with pronunciation information in IPA
kaldi
This is now the official location of the Kaldi project.
LTPD-VAD
Voice activity detection based on long-term pitch divergence
bookong22's Repositories
bookong22/AndroidAudioConverter
Convert audio files inside your Android app easily. Supported formats: AAC, MP3, M4A, WMA, WAV and FLAC.
bookong22/DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
bookong22/DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
bookong22/deepspeech.pytorch
Speech Recognition using DeepSpeech2.
bookong22/espnet
End-to-End Speech Processing Toolkit
bookong22/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
bookong22/faad2
Freeware Advanced Audio (AAC) Decoder faad2 mirror
bookong22/ipa-dict
Monolingual wordlists with pronunciation information in IPA
bookong22/kaldi
This is now the official location of the Kaldi project.
bookong22/LTPD-VAD
Voice activity detection based on long-term pitch divergence
bookong22/phonemizer
Simple text to phones converter for multiple languages
bookong22/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
bookong22/speaker-verification
Implementation of state of the art d-vector approach for speaker verification
bookong22/speechbrain
A PyTorch-based Speech Toolkit
bookong22/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
bookong22/tensorflow
Computation using data flow graphs for scalable machine learning
bookong22/TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners with Latest APIs
bookong22/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
bookong22/tflearn
Deep learning library featuring a higher-level API for TensorFlow.
bookong22/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
bookong22/WaveNet
Yet another WaveNet implementation in PyTorch.