Generates text from speech audio
# apt install libsndfile-dev ffmpeg
- ASR using Jasper (from NemoToolkit )
To install the packages and its dependencies run.
python setup.py install
or with pip
pip install .[server]
The installation should work on Python 3.6 or newer. Untested on Python 2.7
from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav