django-deepspeech-server

This is Mozilla deepspeech server implemented in django. One can record sound in browser and submit it to get corresponding text. This is working prototype, focus is on implementation rather than design of user interface.

Acknowledgement

First of all, thanks to mozilla for such a awesome project. Speech to text is revolutionary technology that has huge scope in future and these type of open source efforts will definitely help nurture this tech. I have used wav-encoder to encode recorded sound in wav format and resampler to get 16000 Hz sample rate. Got some of my inspiration from deepspeech-server.

Installation

Download or clone this project. This project uses python3. To run this project you need to first install deepspeech. Depending on your system you can use the CPU package:

pip3 install deepspeech

Or the GPU package

pip3 install deepspeech-gpu

Install django.

Configuration

Enter path for your model, alphabet, lm and trie in speech-server-main/config/config.json file. Make change to file_name variable in speech_server_main/views.py, to match some valid path on your system.

Go to directory where manage.py is located and start server:

python3 manage.py runserver

Go to your browser and browse to http://127.0.0.1:8000/dsserver.

TODO

Make it web socket based.
Real time inference
Provide Google speech API like response, so that one only has to change websocket address.

License