/asr

Automatic speech recognition using neural networks

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

ASR - Automatic Speech Recognition

Automatic Speech Recognition using neural networks. This repo contains implementations of NVIDIA's Jasper and QuartzNet speech recognition architectures. Their approach doesn't rely on RNNs like most do, but rather opts for a fully convolutional approach.
NOTE: You'll have to unzip the ffmpeg binaries manually because we had to push them as zip archives due to Git LFS bandwith limitations. Just do Extract Here in their appropriate directories.

The research papers can be found on Arxiv:

🎓 Authors: