SUSI AI on Linux
SUSI.AI Linux an implementation of SUSI.AI capable to run on Linux computers and Linux devices in headless mode. It can be installed on smart speakers, desktop PCs, Linux capable IoT devices, car system, washing machines and more.
Project Overview
The whole SUSI.AI ecosystem consists of the following parts:
* server back-end
|_ susi_server (the brain of the infrastructure, a server which computes answers from queries)
|_ susi_skill_data (the knowledge of the brain, a large collection of skills provided by the SUSI.AI community)
* android front-end
|_ susi_android (Android application which is a client for the susi_server at https://api.susi.ai)
* iOS front-end
|_ susi_iOS (iOS application which is a client for the susi_server at https://api.susi.ai)
* Content Management System for the SUSI.AI Skills - Home of the SUSI.AI community
|_ accounts.susi.ai (React Application, User Account Management for the CMS)
|_ chat.susi.ai (React Application, a client for the susi_server at https://api.susi.ai)
|_ susi_skill_cms (React Application, the content management system for susi skills)
* Smart Speaker - Software to turn a Raspberry Pi into a Personal Assistant
| Several sub-projects come together in this device
|_ susi_installer (Framework which can install all parts on a RPi and Desktops, and also is able to create SUSIbian disk images)
|_ susi_python (Python API for the susi_server at https://api.susi.ai or local instance)
|_ susi_server (The same server as on api.susi.ai, hosted locally for maximum privacy. No cloud needed)
|_ susi_skill_data (The skills as provided by susi_server on api.susi.ai; pulled from the git repository automatically)
|_ susi_linux (a state machine in python which uses susi_python, Speech-to-text and Text-to-speech functions)
This project provides the following functionality:
- Hotword Detection works for hotword "Susi"
- Voice Detection working with Google Speech API / IBM Watson Speech to Text API.
- Voice Output working with Google TTS / IBM Watson TTS / Flite TTS.
- Susi AI response working through Susi AI API Python Wrapper
Installation
susi_linux
is normally installed via the SUSI Installer.
In this case there are binaries for configuration and starting and
others available in $HOME/SUSI.AI/bin
(under default installation settings).
In case of manual installations, the wrappers in wrapper
directory need to
be configured to point to the respective installation directories and location of
the config.json
file.
Setting up and configuring Susi on Linux / RaspberryPi
Configuration is done via the file config.json which normally
resides in $HOME/SUSI.AI/config.json
.
If correctly installed, susi-linux-configure
provides a GUI interface to the configuration
of susi_linux
, while susi-linux-config-generator
provides a CLI interface:
susi-linux-config-generator <stt> <tts> <hotword_detection> <wake_button>
where
stt
is the speech to text service, one of the following choices:google
- use Google STT serviceibm
- IBM/Watson STTsphinx
- PocketSphinx STT system, working offline
tts
is the text to speech service, one of the following choices:google
- use Google TTSibm
- IBM/Watson TTS (login credential necessary)flite
- flite TTS service working offline
hotword_detection
is the choice if you want to use snowboy detector as the hotword detection or noty
to use snowboyn
to use pocket sphinx
wake_button
is the choice if you want to use an external wake button or noty
to use an external wake buttonn
to disable the external wake button
Other interfaces for configuration are available for Android and iOS.
Manual configuration is possible, the allowed keys in config.json
are
Device
: the name of the current deviceWakeButton
: whether a wake button is available or notdefault_stt
: see above for possible settingsdefault_tts
: see above for possible settingsdata_base_dir
: directory where support files are installeddetection_bell_sound
: sound file that is played when detection starts, relative todata_base_dir
problem_sound
: sound file that is played on general errors, relative todata_base_dir
recognition_error_sound
: sound file that is played on detection errors, relative todata_base_dir
flite_speech_file_path
: flitevox speech file, relative todata_base_dir
hotword_engine
: see above for possible settingsusage_mode
: access mode toaccounts.susi.ai
, eitheranonymous
orauthenticated
room_name
: free form description of the roomwatson_tts_config
: a JSON array withusername
andpassword
as keys, providing the credentials for IBM/Watson
For details concerning installation, setup, and operation on RaspberryPi, see the documentation at SUSI Installer.
Information for developers
This section is intended for developer.
Important: Tests before making a new release
- The hotword detection should have a decent accuracy
- SUSI Linux shouldn't crash when switching from online to offline and vice versa (failing as of now)
- SUSI Linux should be able to boot offline when no internet connection available (failing as of now)
Roadmap
- Offline Voice Detection (if possible with satisfactory results)
General working of SUSI
- SUSI.AI follows a finite state system for the code architecture.
- Google TTS and STT services are used as default services but if the internet fails, a switch to offline services PocketSphinx (STT) and Flite (TTS) is made automatically
Run SUSI Linux for development purposes
If installed via the SUSI Installer, systemd unit files are installed:
ss-susi-linux.service
for the user bus, use as user withsystemctl --user start/enable ss-susi-linux
ss-susi-linux@.service
for the system bus, use asroot
user to start a job for a specific user, independent from whether the user is logged in or not:sudo systemctl start/enable ss-susi-linux@USER
By default, it is ran in production mode, where log messages are limited to error and warning only. In development, you may want to see more logs, to help debugging. You can switch it to "verbose" mode by 2 ways:
- Run it manually
- Stop systemd service by
sudo systemctl stop ss-susi-linux
- Use Terminal, cd to
susi_linux
directory and run
python3 -m main -v
or repeat v
to increase verbosity:
python3 -m main -vv
- Change command run by
systemd
- Edit the /lib/systemd/system/ss-susi-linux.service and change the command in
ExecStart
parameter:
ExecStart=/usr/bin/python3 -m main -v --short-log
-
Reload systemd daemon:
sudo systemctl daemon-reload
-
Restart the servive:
sudo systemctl restart ss-susi-linux
-
Now you can read the log via
journalctl
:journalctl -u ss-susi-linux
- or
journalctl -fu ss-susi-linux
to get updated when the log is continuously produced.
The -v
option is actually the same as the 1st method. The --short-log
option is to exclude some info which is already provided by journalctl
. For more info about logging
feature, see this GitHub issue.