TsainGra/DeepSpeech
An end-to-end model for Automatic Speech Recognition(ASR) on a small VoxForge dataset. It uses a CTC loss function and a single layer B-LSTM Network. The training accuracy is around 87% and to increase the validation accuracy a much deeper network with much more data is needed.
PythonGPL-3.0
Issues
- 2
Empty Folder after download
#1 opened by jeffxsc - 0