django-deepspeech-server
This is Mozilla deepspeech server implemented in django. One can record sound in browser and submit it to get corresponding text. This is working prototype, focus is on implementation rather than design of user interface.
Acknowledgement
First of all, thanks to mozilla for such a awesome project. Speech to text is revolutionary technology that has huge scope in future and these type of open source efforts will definitely help nurture this tech. I have used wav-encoder to encode recorded sound in wav format and resampler to get 16000 Hz sample rate. Got some of my inspiration from deepspeech-server.
Installation
Download or clone this project. This project uses python3. To run this project you need to first install deepspeech. Depending on your system you can use the CPU package:
pip3 install deepspeech
Or the GPU package
pip3 install deepspeech-gpu
Install django.
Configuration
Enter path for your model, alphabet, lm and trie in speech-server-main/config/config.json file. Make change to file_name variable in speech_server_main/views.py, to match some valid path on your system.
Go to directory where manage.py is located and start server:
python3 manage.py runserver
Go to your browser and browse to http://127.0.0.1:8000/dsserver.
TODO
- Make it web socket based.
- Real time inference
- Provide Google speech API like response, so that one only has to change websocket address.
License
MIT(see LICENSE)