Jarbas

Stable branch

This branch is a work in progress, all jarbas changes are being merged one by one to make this a stable version

Check the jarbas_docs folder for more info on each change as it is added

This should install in a virtual env just like mycroft, only debian scripts were updated

virtual env is named jarbas , it should be possible to have both mycroft and jarbas at same time, each one running in a different virtual env

whenever official mycroft documentation mentions "workon mycroft" use "workon jarbas"

not everything will work out of the box, expect bumps

no online config, must edit files
no mycroft stt, must configure Kaldi or other online service
no shared api keys (twitter/facebook/weather/wolfram...)
some skills not fully functional / added yet / configured for server

Want to help me work on mycroft full time?

Support me on Patreon

Or send me some magic internet money to 15B4ZQFY5UfopjRtSRQPx7ibRaFukE1Ntq

The idea

Personal assistants today are glorified voice activated script runners, with the bonus of mining your data for some company

Stick some hardware and they are glorified alarm clocks than run scripts instead

Jarbas fork intends to show that mycroft is much more than this by harnessing the power of open source, an AI assistant should do "AI stuff" on demand, where is the AI in mycroft/jarbas?

TODO links in each one

The essential parts that make it work, many options available for each

    - speech to text
    - text to speech
    - intent from text (text to action)

The "AI stuff" it can already do for you

    - Image classification
    - Object recognition
    - Face recognition
    - Voice recognition (wip)
    - Deep Dreaming
    - Generate names with Restricted Boltzman Machines
    - Generate Poetry with MarkovChains
    - Copy the style of any artist using Style Transfer
    - Colorize Black and White photos
    - Chatbot using AIML
    - Auto complete your stories with a Char-RNN
    - Learn and build a database of what it learns with LILACS

And dont forget the "not really AI" stuff

    - post to facebook / twitter / instagram
    - control a browser like a real human and do any kind of stuff with it
    - generate psychedelic images using mathematics
    - search the internet for answers
    - Search and play music from youtube (or spotify or more places!)
    - Stuff not worth mentioning like parrot back, dictation, tell jokes, hows
    the weather, which asteroids are near earth, how many sunspots currently
    visible, show you the latest image of earth from orbit..

In practice it can act as a vocal search engine, a helper for disabled people, a desktop automation/OS framework, an artificial pet, a Art assistant, a Writers Assistant, an educational toy for children, A work of art by its own merit...

Changes in this fork

        - no pairing
        - no config update from mycroft.ai
        - added "intent.start", "intent.end", "intent.failed" messages
        - skills/core.py SKILLS_DIR is now jarbas_skills folder instead of opt
        - do not remove articles in normalize (too much normalization IMO)
        - allow for multiple hot words of various kinds
        - offline stt (pocketpshinx) <- WIP missing model
        - wakeword/hotword/standupword snowboy support
        - fallback skill order override in config file
        - message_context update methods added to MycroftSkill
        - update_config method added to MycroftSkill
        - screen display service
        - speech messages set adapt context for LastSpeech and every metadata field
        - recognizer_loop:speak message type
        - runtime configuration file
        - ConfigurationManager.save checks for config existence before loading it
        - webchat client
        - many helper classes

        # temporary changes
        - no msm

Intent Layers

IntentLayers class, allows to activate and deactivate intents grouped in layers

Example usage in KonamiCode Skill

Services

Backend Service, A helper class to emit and wait for bus messages was added, needed in several skills

Used for inter skill communication in image recognition, face recognition, object recognition, deep dream, style transfer, vision, user manager...

MarkovChain

Markov Chain helper class, save and load functionality

Example usage in Poetry Skill

Art util

Generate random pictures utility based on this blog post

Example usage in Art Skill

PR#859

IntentParser class

        - helper class added to intent_service
        - intent from utterance
        - skill from intent
        - skill from utterance

PR#858

priority skills list

        - priority skills are loaded first
        - read from config

PR#860

needed for intent_layers

        - enable / disable intent can be requested in messagebus

PR#783

control skills externally, allow run-levels, external skill load and reload)

        - skills have internal id, internal id used instead of name in intent messages
        - skills main now listens for external messages for skill_manifest, skill_shutdown and skill_reload

PR#790

target bus messages

        - uses message context to target messages to source of utterance
        - uses message context to mute (not trigger tts)
        - uses message context to tell if more speech is expected
        - adds "metadata" field to speak method
        - clients check if they are the target of utterance before processing it
        - adds data to message context:
            - target = cli/speech/facebook - default = "all"
            - mute = trigger tts or not - default = "false"
            - more_speech = more speak messages expected- default = "False"
            - destinatary = used to track remote_client to send message to
            - source = source of message, default = source skill / source client

PR#556

stop speak messages

        - adds disable_speak_flag to speech and cli
        - this flag is a global mute switch, if set no tts is ever made
        - skill to mute/unmute not yet merged

Server/Client

jarbas networking

        - client / server (a "firewall" that connects to the mycroft messagebus )
        - client connects to server, by secure websockets
        - server sends public pgp key
        - client sends public pgp encrypted with server pgp
        - server sends AES key encrypted with client pgp, client pgp key authorizes client / creates account in server
        - all communication after is AES encrypted, if plaintext received client is kicked
        - server does all kinds of processing (receiving files, requesting stuff from clients, authorizing action per client)
        - on first run a new pgp key is automatically created (client and server)

client server basically connect jabas/mycroft instances, server can work as:

        - fallback mechanism
        - chat room
        - relay
        - ...

Essential Skills

Lots of skills pre-packaged

Browser Service

Objectives Service

TTS Control - change voice at runtime vocally

Control Center - shutdown/restart skills , enforce run levels

Vision Service - process webcam input

RBM Service - Generate new words

Object Recognition

Image Classification

Face Recognition

Dictation - Write to txt user speech

News - Plays news

Mute - Mute TTS

Again - Repeat last action

LILACS - Search wikipedia, wikidata, dbpedia, conceptnet, wordnik, wolframalpha and learn

TODO finish this

Blacklisted skills by default

some skills are blacklisted because they are either not common, heavy, or because i run them in server

browser service
server fallback
LILACS_core
LILACS_storage
LILACS_teach
LILACS_users
LILACS_curiosity
deep dream
image recognition
object recognition
colorize
vision
facebook
twitter
instagram

Skills that fallback to server

these skills have a flag in their init method to ask server or run locally

consider them broken, get them from server branch if you really want them

image recognition
face recognition
object recognition
deep dream
style transfer
colorize

these skills have a flag to ask client

vision service # wip

Currently working on

not updated often

Help Skill
R2D2 TTS
instagram skill
pocket sphinx offline stt
kaldi install scripts
class vizualization/deep draw in tensorflow
models download script (they auto download on first skill run which takes...long)

TH4NOS/JarbasAI