Pinned Repositories
3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
Audio-Sparse-Coding
For project "Sparse Codes for Speech Predict Spectrotemporal Receptive Fields in the Inferior Colliculus" of class Neural and Cognitive Computation by Prof. Xiaolin Hu (Tsinghua University).
Best_WERs
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Chinese-Poetry-Dataset
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
CTC-ASR
This is a working example of using CTC for phone recognition on TIMIT
hlthu.github.io
Lu Huang's Homepage
kaldi
This is now the official location of the Kaldi project.
masr
中文语音识别,提供预训练模型,高识别率 Chinese Speech Recognition; Mandarin Automatic Speech Recognition;
Python-OpenCV-Learn
For learning of python opencv.
Self-Contained-NN
For implemention and testing of Self-Contained Networks using PyTorch.
hlthu's Repositories
hlthu/Python-OpenCV-Learn
For learning of python opencv.
hlthu/kaldi
This is now the official location of the Kaldi project.
hlthu/Self-Contained-NN
For implemention and testing of Self-Contained Networks using PyTorch.
hlthu/Audio-Sparse-Coding
For project "Sparse Codes for Speech Predict Spectrotemporal Receptive Fields in the Inferior Colliculus" of class Neural and Cognitive Computation by Prof. Xiaolin Hu (Tsinghua University).
hlthu/Best_WERs
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
hlthu/CTC-ASR
This is a working example of using CTC for phone recognition on TIMIT
hlthu/Conv-TasNet-gantheory
Deep Neural Network for Speaker Separation
hlthu/Conv-TasNet-XuKaituo
A PyTorch implementation of Fully-Convolutional Time-domain Audio Separation Network (Conv-TasNet) with Permutation Invariant Training (PIT) for speech separation.
hlthu/DANet
tensorflow based implementation of Deep Attractor Network for Speech Separation
hlthu/DaNet-Tensorflow
Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"
hlthu/espnet
End-to-End Speech Processing Toolkit
hlthu/LaTex-Template
for some useful latex templates
hlthu/libsvm
hlthu/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
hlthu/miscellanea
:feet: Miscellanious projects during 2014-2018
hlthu/pit-speech-separation
hlthu/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
hlthu/PyTorch-Learn
For Learning with PyTorch
hlthu/ShadowsocksX-NG
Next Generation of ShadowsocksX
hlthu/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
hlthu/Speech_Analysis
Analyzes signal, finds fundamental frequency, HNR etc
hlthu/sru
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
hlthu/summer-review
:earth_asia: The review notes prepared for the interview in Sept. 2017
hlthu/TASNET-ododoyo
Time-domain Audio Separation Network
hlthu/TasNet-XuKaituo
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
hlthu/video-classification-3d-cnn-pytorch
Video classification tools using 3D ResNet