react-native-spokestack

React Native wrapper for the Spokestack speech activity detection/automated speech recognition project.

Getting started
Usage
API
Gotchas
Release
License

Getting started

$ npm install react-native-spokestack --save

React Native: 0.60.0+
Android: Android SDK 26+
iOS: iOS 13+

(Mostly) automatic installation

Android

$ react-native link react-native-spokestack

iOS

Prerequistes

iOS 13, Swift 5.0

Installation

Edit YOUR_PROJECT's Podfile and add the following contents:

platform :ios, '13.0'

target 'YOUR_PROJECT' do

  use_frameworks!

  pod 'RNSpokestack', :path => '../node_modules/react-native-spokestack'

  use_native_modules!
  use_modular_headers!

end

pod install

RN 0.58+ notes

In the Podfile remove jschelpers from the React subspec. (reference)
If using Rn 0.58.0 - 0.58.4, an additional header path needs to be added to the jsiexecutor subspec in node_modules/react-native/React.podspec. You may use patch-package as a solution. This issue is was addressed and fixed in RN 0.58.5
Remove all lib* files from Link Binary with Libraries under your project target in xCode. This prevents dueling installations of React.

...thus we need to ensure that they reference the same React Native library which you link to from the outer project.

Manual installation

Android

Open up android/app/src/main/java/[...]/MainApplication.java (could also be called MainActivity.java or similar)

Add import io.spokestack.RNSpokestack.RNSpokestackPackage; to the imports at the top of the file
Add new RNSpokestackPackage() to the list returned by the getPackages() method

Append the following lines to android/settings.gradle:

include ':react-native-spokestack'
project(':react-native-spokestack').projectDir = new File(rootProject.projectDir, 	'../node_modules/react-native-spokestack/android')

Insert the following lines inside the dependencies block in android/app/build.gradle:
```
  implementation project(':react-native-spokestack')
```

iOS (not using CocoaPods)

Currently only buildable on an amd_64 target
Drag the RNSpokestack.xcodeproj from the react-native-spokestack/ios folder to the Libraries group on Xcode in your poject.
Click on your main project file (the one that represents the .xcodeproj) select Build Phases and drag the static library, libRNSpokestack.a, from the Libraries/RNSpokestack.xcodeproj/Products folder to Link Binary With Libraries

Link the necessary libraries:

Project Build Phases
- Link Binary with Libraries:
  - /node_modules/react-native-spokestack/Frameworks
  - AVFoundation
  - SpokeStack.framework
- Copy Bundle Resources:
  - gRCPCertificate.bundle
General
- Always Embed Swift Standard Libraries: Yes
- Embedded Binaries:
  - /node_modules/react-native-spokestack/Frameworks
  - SpokeStack.framework
  - Linked Frameworks and Binaries:
  - /node_modules/react-native-spokestack/Frameworks
  - SpokeStack.framework

Android Support

`android/build.gradle`

Make sure the Google repo is listed first

buildscripts {
   ...
    repositories {
      google()
      maven { url 'https://csspeechstorage.blob.core.windows.net/maven/' }
      //...
    }
}

allprojects{
      repositories {
        google()
        maven { url 'https://csspeechstorage.blob.core.windows.net/maven/' }
        //...
    }
}

`android/app/build.gradle`


...
android {
...
packagingOptions {
exclude 'project.properties'
exclude 'META-INF/INDEX.LIST'
exclude 'META-INF/DEPENDENCIES'
}
}
...

`android/app/src/main/AndroidManifest.xml`

// for wakeword & ASR
<uses-permission android:name="android.permission.RECORD_AUDIO" />
// for TTS
<uses-permission android:name="android.permission.INTERNET" />

Usage

Javascript

The example below uses the system-provided AndroidSpeechRecognizer as the default since it's free to use. It's not available on 100% of devices, though; see our ASR documentation for more information. If you use a different ASR provider, you'll also need to change the input line to:

input: "io.spokestack.spokestack.android.MicrophoneInput",

import Spokestack from "react-native-spokestack";

// initialize the Spokestack pipeline.
//
// Spokestack configuration has five top-level keys: 'input', 'stages', and 'properties' for the speech pipeline, 'tts' for text to speech, and 'nlu' for natural language recognition. Keys for asr, tts, and nlu may be omitted if your app does not require them.
// This example configures a voice-triggered speech recongnizer
// For additional examples, see https://github.com/spokestack/spokestack-android#configuration
Spokestack.initialize({
  input: "io.spokestack.spokestack.android.PreASRMicrophoneInput", // provides audio input into the pipeline
  stages: [
    "io.spokestack.spokestack.webrtc.VoiceActivityDetector", // voice activity detection
    'io.spokestack.spokestack.webrtc.VoiceActivityTrigger', // voice activity detection triggers speech recognition
    'io.spokestack.spokestack.ActivationTimeout', // speech recognition times out after a configurable interval when voice is no longer detected
      'io.spokestack.spokestack.android.AndroidSpeechRecognizer' // one of the three supported speech recognition services
    // "io.spokestack.spokestack.google.GoogleSpeechRecognizer"
    // 'io.spokestack.spokestack.microsoft.AzureSpeechRecognizer'
  ],
  properties: {
    "locale": "en-US",
    'agc-compression-gain-db': 15,
    "google-credentials": YOUR_GOOGLE_VOICE_CREDENTIALS, // only set if using `GoogleSpeechRecognizer` stage above
    "trace-level": Spokestack.TraceLevel.DEBUG // configurable logging level
  },
  tts: {
    'ttsServiceClass': 'io.spokestack.spokestack.tts.SpokestackTTSService',
    // TTS API account properties. Only set these if you have a Spokestack account.
    'spokestack-id': 'f0bc990c-e9db-4a0c-a2b1-6a6395a3d97e', // your Spokestack API ID
    'spokestack-secret': '5BD5483F573D691A15CFA493C1782F451D4BD666E39A9E7B2EBE287E6A72C6B6' // your Spokestack API secret
  },
  nlu: {
    // NLU settings. Only set these if you are calling `Spokestack.classify`.
    'nlu-model-path': YOUR_NLU_MODEL_PATH, // string filesystem path to nlu model
    'nlu-metadata-path': YOUR_NLU_METADATA_PATH, // string filesystem path to nlu metadata
    'wordpiece-vocab-path': YOUR_NLU_VOCABULARY_PATH // string filesystem path to nlu vocab
  }
});

// Speech Pipeline

// Start and stop the speech pipeline. All methods can be called repeatedly.
Spokestack.start(); // start speech pipeline. can only start after initialize is called.
Spokestack.stop(); // stop speech pipeline
Spokestack.activate(); // manually activate the speech pipeline. The speech pipeline is now actively listening for speech to recognize.
Spokestack.deactivate(); // manually deactivate the speech pipeline. The speech pipeline is now passively waiting for an activation trigger.\
// Binding to speech pipeline events
const logEvent = e => console.log(e);
Spokestack.onActivate = logEvent;
Spokestack.onDeactivate = logEvent;
Spokestack.onError = e => {
  Spokestack.stop();
  logEvent(e);
};
Spokestack.onTrace = e => { // subscribe to tracing events according to the trace-level property
  logEvent(e);
  console.log(e.message);
};
Spokestack.onRecognize = e => {
  logEvent(e);
  console.log(e.transcript) // "Hello Spokestack"

// NLU and TTS

// Classify the intent and slot of the transcript
Spokestack.classify(e.transcript, {})
// Get a URL to a real-time synthesize
Spokestack.synthesize({'input': e.transcript, 'format': Spokestack.TTSFormat.TEXT, 'voice': 'demo-male'})
};
// Receive the transcript classifcation result
Spokestack.onClassification = e => {
  logEvent(JSON.stringify(e))
  console.log(e.result.intent)
}
// Receive the real-time transcript synthesis result
Spokestack.onSuccess = e => {
  logEvent(JSON.stringify(e))
  console.log(e.url) // https://api.spokestack.io/stream/g2dkABVnYXRld2F5QDE3Mi4yNy4xMi4yNDQAACeUAAAAAgE
};

API

Methods

Method Name	Description	Method Values	OS
Spokestack.initialize()	Initialize the speech pipeline; required for all other methods		Android, iOS
Spokestack.start()	Starts the speech pipeline. The speech pipeline starts in the `deactivate` state.		Android, iOS
Spokestack.stop()	Stops the speech pipeline		Android, iOS
Spokestack.activate()	Manually activate the speech pipeline		Android, iOS
Spokestack.deactivate()	Manually deactivate the speech pipeline		Android, iOS
Spokestack.synthesize({'input': string, 'format': int, 'voice': string})	Request a URL to a audio file of the specified voice speaking the input	format [0: text, 1: ssml, 2: speechmarkdown], voice ["demo-male"]	iOS, Android
Spokestack.classify(utterance: string, {})	Classify the utterance with an intent/slot natural language understanding model	utterance: string, context: dictionary (currently unused, can be empty)	iOS, Android

Events

Event Name	Property	Description	OS
onActivate(event)	`null`	Invoked when the speech pipeline is activated, which enables the speech recognizer and begins a new dialogue session	Android, iOS
onDeactivate(event)	`null`	Invoked when the speech pipeline has been deactivated	Android, iOS
onStart(event)	`null`	Invoked when the speech pipeline is started	Android, iOS
onStop(event)	`null`	Invoked when the speech pipeline has been stopped	Android, iOS
onRecognize(event)	`transcript`:`string`	Invoked when speech has been recognized	Android, iOS
onTimeout(event)	`null`	Invoked when no speech has been detected for `wake-active-max` after activation	Android, iOS
onTrace(event)	`message`:`string`	Invoked when a trace message become available	Android
onError(event)	`error`:`string`	Invoked upon an error in the speech pipeline execution	Android, iOS
onSuccess(ttsEvent)	`url`:`string`	Invoked upon a successful TTS synthesis request	iOS
onFailure(ttsEvent)	`error`:`string`	Invoked upon a failed TTS synthesis request	iOS
onClassification(nluEvent)	`result`:`dictionary`	Invoked upon a successful NLU utterance classification	iOS

Dictionaries

`nluEvent`

Key	Value
result	dict

`result`

Key	Value
intent	string
confidence	string
slots	dict

`slots`

Key	Value
type	string
value	string

Enums

Trace

TraceLevel	Value
DEBUG	10
PERF	20
INFO	30
NONE	100

Format

TTSFormat	Value
TEXT	0
SSML	1
SPEECHMARKDOWN	2

Gotchas

Android

Requires Android SDK 26 level support
Requires Gradle 3.0.1+ (classpath 'com.android.tools.build:gradle:3.0.1' in root build.gradle dependencies)
Add app setting for microphone permission

iOS

Add app setting for microphone permission (NSMicrophoneUsageDescription) and speech recognition (NSSpeechRecognitionUsageDescription)
Spokestack on iOS does not manage AudioSession settings. The client app is required to implement whatever AudioSession category and options are necessary. At minimum, the session category should allow for recording, eg AVAudioSessionCategoryRecord or AVAudioSessionCategoryPlayAndRecord. A simple AudioSession setting, suitable for insertion in AppDelegate.m, could be:

  [[AVAudioSession sharedInstance] setCategory:AVAudioSessionCategoryPlayAndRecord mode:AVAudioSessionModeDefault options:AVAudioSessionCategoryOptionDefaultToSpeaker  error:nil];
  [[AVAudioSession sharedInstance] setActive:YES error:nil];

Release

Ensure that CocoaPods has been installed via gem, not via brew
Increment version in package.json
git commit -a -m 'YOUR_COMMIT_MESSAGE' && git tag YOUR_VERSION && git push --origin
pod spec lint --use-libraries --allow-warnings --use-modular-headers, which should pass all but one checks (expect ERROR | [iOS] xcodebuild: Returned an unsuccessful exit code. You can use --verbose for more information.)
edit /Library/Ruby/Gems/YOUR_RUBY_VERSION/gems/cocoapods-trunk-YOUR_COCOAPODS_VERSION/lib/pod/command/trunk/push.rb, comment out validate_podspec_files (https://github.com/CocoaPods/CocoaPods/blob/master/lib/cocoapods/command/repo/push.rb#L77)*
pod trunk register YOUR_EMAIL --description='release YOUR_PODSPEC_VERSION'
npm publish to release on NPM
pod trunk push --use-libraries --allow-warnings --use-modular-headers

Since RNSpokestack iOS requires React Native headers, but does not include any React Native dependencies, it will not compile by itself, needing a client library that does include the React Native dependency.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

  http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

WeTruck/react-native-spokestack

react-native-spokestack

Table of Contents

Getting started

(Mostly) automatic installation

Android

iOS

Prerequistes

Installation

RN 0.58+ notes

Manual installation

Android

iOS (not using CocoaPods)

Link the necessary libraries:

Android Support

`android/build.gradle`

`android/app/build.gradle`

`android/app/src/main/AndroidManifest.xml`

Usage

Javascript

API

Methods

Events

Dictionaries

`nluEvent`

`result`

`slots`

Enums

Trace

Format

Gotchas

Android

iOS

Release

License