DeepSpeech doesn't seem to be maintained anymore by Mozilla. Coqui is one of the projects that continued building on top of this awesome project. Check out the GO bindings for Coqui here.
Golang bindings for Mozilla's DeepSpeech speech-to-text library.
astideepspeech
is compatible with version v0.9.0
of DeepSpeech
.
- fetch an up-to-date
native_client.<your system>.tar.xz
matching your system from DeepSpeech's "releases" - extract its content to /tmp/deepspeech/lib
- download
deepspeech.h
from https://github.com/mozilla/DeepSpeech/raw/v0.9.0/native_client/deepspeech.h - copy it to /tmp/deepspeech/include
- export CGO_LDFLAGS="-L/tmp/deepspeech/lib/"
- export CGO_CXXFLAGS="-I/tmp/deepspeech/include/"
- export LD_LIBRARY_PATH=/tmp/deepspeech/lib/:$LD_LIBRARY_PATH
Alternatively, copy the downloaded libdeepspeech.so
and deepspeech.h
files
to directories that are searched by default, e.g. /usr/local/lib
and
/usr/local/include
, respectively.
Run the following command:
$ go get -u github.com/asticode/go-astideepspeech/...
Run the following commands:
$ mkdir /tmp/deepspeech
$ cd /tmp/deepspeech
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.0/deepspeech-0.9.0-models.pbmm
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.0/deepspeech-0.9.0-models.scorer
Run the following commands:
$ cd /tmp/deepspeech
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.0/audio-0.9.0.tar.gz
$ tar xvfz audio-0.9.0.tar.gz
Run the following commands (make sure $GOPATH/bin
is in your $PATH
):
$ cd /tmp/deepspeech
$ deepspeech -model deepspeech-0.9.0-models.pbmm -scorer deepspeech-0.9.0-models.scorer -audio audio/2830-3980-0043.wav
Text: experience proves this
$ deepspeech -model deepspeech-0.9.0-models.pbmm -scorer deepspeech-0.9.0-models.scorer -audio audio/4507-16021-0012.wav
Text: why should one hall on the way
$ deepspeech -model deepspeech-0.9.0-models.pbmm -scorer deepspeech-0.9.0-models.scorer -audio audio/8455-210777-0068.wav
Text: your power is sufficient i said