Simple api over mozilla deepspeech voice recognition engine.
POST /api/v1/stt - Just look at curl command below.
Speech data may be provided in whatever audio format which ffmpeg is able to convert to wav,
so you probably don't have to worry about this at all.
$ curl -X POST -F "speech=@speech.mp3" http://127.0.0.1:8000/api/v1/stt
{"text":"experience proves this","time":0.9638644850056153}
git clone git@github.com:zelo/deepspeech-rest-api.git
- Look at
requirements.pip
file and find what is the current deepspeech library version used by this api - Go to
https://github.com/mozilla/DeepSpeech/releases
and find release doc for this version. It should contain link to download data model. For version 0.7.1 it'shttps://github.com/mozilla/DeepSpeech/releases/download/v0.7.1/deepspeech-0.7.1-models.pbmm
- Download and extract model from above package to
<repository_root>/model.pbmm
- Enter
<repository_root>
- Run
docker-compose up
- Build image
docker build . --tag zelo/deepspeech-rest-api:0.7.1
- Run
docker run --rm --publish=127.0.0.1:8000:8000 --volume=$(pwd)/model.pbmm:/app/model.pbmm:ro zelo/deepspeech-rest-api:0.7.1
Just look at the content of Dockerfile
it contains complete instruction to setup app under debian