Pinned Repositories
asr-python-htk
Python tools for building a ASR with the HTK-toolkit
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
baidu-allreduce
beamforming
BeamformIt
BeamformIt acoustic beamforming software
caffe
Caffe: a fast open framework for deep learning.
CAT
A CRF-based ASR Toolkit
cmake-demo
《CMake入门实战》源码
tensorflowkaldi
voiceprint
text-independent speaker identification
iwaterxt's Repositories
iwaterxt/voiceprint
text-independent speaker identification
iwaterxt/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
iwaterxt/baidu-allreduce
iwaterxt/CAT
A CRF-based ASR Toolkit
iwaterxt/cmake-demo
《CMake入门实战》源码
iwaterxt/compound-loss-pytorch
Compound loss for PyTorch
iwaterxt/DeepSpeech
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
iwaterxt/E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition
iwaterxt/espnet
End-to-End Speech Processing Toolkit
iwaterxt/gdrive.sh
Download a file or a folder easily. curl gdrive.sh | bash -s $fileid
iwaterxt/iwaterxt.github.io
Template for a blog hosted on GitHub Pages
iwaterxt/kaldi
This is now the official location of the Kaldi project.
iwaterxt/kaldi-aslp
iwaterxt/kaldi-dnn-ali-gop
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
iwaterxt/kaldi-gop
Computes the Goodness of Pronunciation (GOP). Bases on Kaldi.
iwaterxt/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
iwaterxt/kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
iwaterxt/Multi-band-WaveRNN
iwaterxt/neural_sp
End-to-end ASR/LM implementation with pytorch.
iwaterxt/nn-vad
simple dnn based vad
iwaterxt/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
iwaterxt/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
iwaterxt/polysody
iwaterxt/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
iwaterxt/python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
iwaterxt/Socket-Programming-Python
Client Server running code described with comments here.
iwaterxt/sparrowhawk
iwaterxt/sparse_image_warp_pytorch
Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
iwaterxt/wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
iwaterxt/xdecoder
Fast, portable, enhanced ASR decoder