/deep-speech-unity

A Unity implementation of DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices.

Primary LanguageC#Mozilla Public License 2.0MPL-2.0

deep-speech-unity

A Unity implementation of DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices. I was inspired to create this after seeing a different implementation by @voxell-tech

The sample scene includes two versions of voice recognition processing.

  • The ContinuousVoiceRecorder script feeds the audio into DeepSpeech realtime and processess the intermediate result.
  • The SpeechTextToText detects the users voice and processes the audio after the user stops talking.

Both examples run offline and can auto detect if the user is speaking using a volume threshold.

Developed using

  • Windows 64
  • Unity 2020.3.12

Note

This demo only runs on Windows 64 however, DeepSpeech does support other platforms and functionality can be expanded. Pull Requests are welcomed.