kaituoxu
Speech recognition, punctuation prediction, speech separation and deep learning.
Northwestern Polytechnical UniversityXi'an
Pinned Repositories
FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
E6870
My solution to course E6870 (Speech Recognition) of Columbia University.
kaldi-ktnet1
Kaldi extended by Kaituo XU with new features in nnet1.
Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
MiniPascal-Compiler-AST
MiniPascal-compiler produce symbol table, quaterlist and abstract syntax tree.
Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
TasNet
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
X-Punctuator
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.
kaituoxu's Repositories
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
kaituoxu/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
kaituoxu/TasNet
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
kaituoxu/X-Punctuator
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.
kaituoxu/Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
kaituoxu/E6870
My solution to course E6870 (Speech Recognition) of Columbia University.
kaituoxu/kaldi-ktnet1
Kaldi extended by Kaituo XU with new features in nnet1.
kaituoxu/MiniPascal-Compiler-AST
MiniPascal-compiler produce symbol table, quaterlist and abstract syntax tree.
kaituoxu/PyTorch-ASR-AM
A PyTorch implementation of ASR Acoustic Model training.
kaituoxu/IIPPython
This is my mini-projects to a Coursera class named An Introduction to Interactive Programming in Python.
kaituoxu/kaldi-aslp
kaituoxu/MiniPascal-Compiler
The compiler of MiniPascal, which produce symbol table and quaterlist.
kaituoxu/MOOC_DS
Codes of Data Structure on my Netease MOOC
kaituoxu/myself_lstm_tutorial
mine lstm tutorial
kaituoxu/Neural-Network-Python
Feedforward Neural Network implement in python.
kaituoxu/PAT
My solutions to PAT problems.
kaituoxu/python-gradient-check
Backpropagation gradient check workflow using python.
kaituoxu/stanford-cs231n-2016winter
stanford-cs231n-2016winter
kaituoxu/The-C-Programming-Language
My codes for K&R (The C Programming Language).