Golang bindings for Mozilla's DeepSpeech speech-to-text library.
As of now, astideepspeech
is only compatible with version v0.6.0
of DeepSpeech
.
Installation
Install DeepSpeech
- fetch an up-to-date
native_client.<your system>.tar.xz
matching your system from DeepSpeech's "releases" - extract its content to /tmp/deepspeech/lib
- download
deepspeech.h
from https://github.com/mozilla/DeepSpeech/raw/v0.6.0/native_client/deepspeech.h - copy it to /tmp/deepspeech/include
- export CGO_LDFLAGS="-L/tmp/deepspeech/lib/"
- export CGO_CXXFLAGS="-I/tmp/deepspeech/include/"
- export LD_LIBRARY_PATH=/tmp/deepspeech/lib/:$LD_LIBRARY_PATH
Install astideepspeech
Run the following command:
$ go get -u github.com/asticode/go-astideepspeech/...
Example
Get the pre-trained model
Run the following commands:
$ mkdir /tmp/deepspeech
$ cd /tmp/deepspeech
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.6.0/deepspeech-0.6.0-models.tar.gz
$ tar xvfz deepspeech-0.6.0-models.tar.gz
Get the audio files
Run the following commands:
$ cd /tmp/deepspeech
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.6.0/audio-0.6.0.tar.gz
$ tar xvfz audio-0.6.0.tar.gz
Use the client
Run the following commands (make sure $GOPATH/bin
is in your $PATH
):
$ cd /tmp/deepspeech
$ deepspeech -model models/output_graph.pb -audio audio/2830-3980-0043.wav -lm models/lm.binary -trie models/trie
Text: experience proves this
$ deepspeech -model models/output_graph.pb -audio audio/4507-16021-0012.wav -lm models/lm.binary -trie models/trie
Text: why should one halt on the way
$ deepspeech -model models/output_graph.pb -audio audio/8455-210777-0068.wav -lm models/lm.binary -trie models/trie
Text: your power is sufficient i said