timit

There are 34 repositories under timit topic.

mravanelli/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Language:Python2.4k 93 214447
mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
Language:Python1.1k 31 106260
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language:HTML358 44 528
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Language:Python313 34 18121
philipperemy/timit
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
279 8 4121
Diamondfan/CTC_pytorch
CTC end -to-end ASR for timit and 863 corpus.
Language:Python216 6 848
HawkAaron/RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Language:Python135 8 2631
WindQAQ/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.
Language:Python90 15 1032
grausof/keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Language:Python72 4 027
hirofumi0810/asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
Language:Python68 5 523
matthijsvk/TIMITspeech
Speech recognition on the TIMIT (or any other) dataset
Language:Python39 7 112
mravanelli/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Language:Perl38 4 112
AppleHolic/PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
Language:Python33 3 05
mravanelli/theano-kaldi-rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Language:Perl32 7 013
zhaoyu611/Automatic_Speech_Recognition_with_Multi_Models
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Language:Python19 5 18
biyoml/PyTorch-End-to-End-ASR-on-TIMIT
Attention-based end-to-end ASR on TIMIT in PyTorch
Language:Python16 2 25
orbxball/timit-preprocessor
Extract mfcc vectors and phones from TIMIT dataset
Language:Shell15 3 00
anicolson/SPN-ASI
Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.
Language:Python11 1 12
colinator/timit_utils
Python/numpy/pandas convenience wrapper for the TIMIT database.
Language:Jupyter Notebook11 1 13
dingzeyuli/SpEAR-speech-database
A database of clean and noisy speech for audio research
11 4 05
WindQAQ/tensorflow-wavenet
Implementation of WaveNet network based on Tensorflow.
Language:Python9 3 12
haoxintong/gluon-audio
A toolkit providing deep learning based audio recognition algorithm powered by Mxnet Gluon. Now only Text-Independent Speaker Recognition is implemented.
Language:Python8 4 31
drkostas/bench-utils
A collection of benchmarking tools.
Language:Python7 2 00
KrishnaDN/LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
Language:Python7 2 03
jackyzha0/Speech2Braille
[🏆 Silver Medal at CWSF] Tensorflow Implementation of TIMIT Deep BLSTM-CTC with Tensorboard Support
Language:Python5 2 0
HanSeokhyeon/Speech_recognition_for_English_and_Korean
다양한 feature를 이용한 음성인식 LAS model입니다. (한국어는 개발예정)
Language:Python4 3 01
BradleyHe/TIMIT-Alignment
TIMIT forced alignment with the Montreal Forced Aligner
Language:Python2 2 00
BradleyHe/TIMIT-Phoneme-Mixer
Python project that mixes phonemes from the TIMIT dataset
Language:Python2 2 00
hammaad2002/SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
Language:Jupyter Notebook2 1 10
kipmccharen/sys6016_DL_project
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
Language:Python2 2 01
BradleyHe/TIMIT-Voice-Mixer
Python project which mixes and tests sentences from the TIMIT dataset using LAS
Language:Python1 1 00
freha-mezzoudj/Magister_works1
My magister (Bac+5+2) topic is about the Timit phonems multi_classification using GA and SVM. My works are presented here to help the research community, thanks !
1 1 00
AntonDemchenko/voiceprint_maker
Language:Python0 1 00
benivalotker/benchmarking_and_profiling
simple use for benchmarking and profiling module
Language:Python0 2 00

timit

mravanelli/pytorch-kaldi

mravanelli/SincNet

speechbrain/speechbrain.github.io

hirofumi0810/tensorflow_end2end_speech_recognition

philipperemy/timit

Diamondfan/CTC_pytorch

HawkAaron/RNN-Transducer

WindQAQ/listen-attend-and-spell

grausof/keras-sincnet

hirofumi0810/asr_preprocessing

matthijsvk/TIMITspeech

mravanelli/pytorch_MLP_for_ASR

AppleHolic/PytorchSR

mravanelli/theano-kaldi-rnn

zhaoyu611/Automatic_Speech_Recognition_with_Multi_Models

biyoml/PyTorch-End-to-End-ASR-on-TIMIT

orbxball/timit-preprocessor

anicolson/SPN-ASI

colinator/timit_utils

dingzeyuli/SpEAR-speech-database

WindQAQ/tensorflow-wavenet

haoxintong/gluon-audio

drkostas/bench-utils

KrishnaDN/LAS-Pytorch

jackyzha0/Speech2Braille

HanSeokhyeon/Speech_recognition_for_English_and_Korean

BradleyHe/TIMIT-Alignment

BradleyHe/TIMIT-Phoneme-Mixer

hammaad2002/SimpleASRmodel

kipmccharen/sys6016_DL_project

BradleyHe/TIMIT-Voice-Mixer

freha-mezzoudj/Magister_works1

AntonDemchenko/voiceprint_maker

benivalotker/benchmarking_and_profiling