##The Care and Feeding of #SMC ##The State Machine Compiler
SMC is a Java application that translates a state transition table into a program that implements the described state machine. Output languages include Java, C, and C++. Adding other languages is trivial.
###Command Line
java -jar smc.jar -l <language> -o <directory> -f <flags>
<language>
is eitherC
,Cpp
, orJava
.<directory>
is the output directory. Your new state machine will be written there.<flags>
currently for Java only.package:package_name
will put the appropriatepackage
statement in the generated code.
###Syntax
The syntax for the state transition table is based on a simple state transition table. Here is a straightforward example that describes the logic of a subway turnstile. turnstile.sm
:
Initial: Locked
FSM: Turnstile
{
Locked Coin Unlocked unlock
Locked Pass Locked alarm
Unlocked Coin Unlocked thankyou
Unlocked Pass Locked lock
}
When this is run through SMC it produces the source code for a state machine named Turnstile
. That machine starts in the Locked
state, and follows the following logic:
- Given we are in the
Locked
state, when we get aCoin
event, then we transition to theUnlocked
state and invoke theunlock
action. - Given we are in the
Locked
state, when we get aPass
event, then we stay in theLocked
state and invoke thealarm
action. - Given we are in the
Unlocked
state, when we get aCoin
event, then we stay in theUnlocked
state and invoke thethankyou
action. - GIven we are in the
Unlocked
state, when we get aPass
event, then we transition to theLocked
state and invoke thelock
action.
###Opacity One of the goals of SMC is to produce code that the programmer never needs to look at, and does not check in to source code control. It is intended that SMC will generate the appropriate code during the pre-compile phase of your build.
The output of SMC is two sets of functions: The Event functions and the Actions functions. For most languages these functions will be arranged into an abstract class in which the Event functions are public, and the Action functions are protected and abstract.
The programmer derives an implementation class from the generated class, and implements all the action functions. The programmer then creates an instance of the implementation class and invokes the appropriate event functions as those events occur. The generated code will make sure that the appropriate action functions are called in response to those events.
The state of the generated state machine is opaque to the programmer and kept private to the generated code. From the programmer's point of view the generated code is a black box that translates events into actions.
Here is a UML diagram that depicts the situation. The programmer writes the user
and the Implementation
. The generated Turnstile
class is opaque.
+---------------+
+------+ | <<generated>> |
| user |------>| Turnstile |
+------+ +---------------+
| + Coin() |
| + Pass() |
+---------------+
A
|
|
+-------------------------+
| Implementation |
+-------------------------+
| # lock() |
| # unlock() |
| # thankyou() |
| # alarm() |
| # unhandledTransition() |
+-------------------------+
Here is the Java code that SMC will generate for this state machine. It is an abstract class that provides a simple nested switch/case implementation, a set of public event functions, and a set of protected abstract methods for the action functions.
public abstract class Turnstile {
public abstract void unhandledTransition(String state, String event);
private enum State {Locked,Unlocked}
private enum Event {Coin,Pass}
private State state = State.Locked;
private void setState(State s) {state = s;}
public void Coin() {handleEvent(Event.Coin);}
public void Pass() {handleEvent(Event.Pass);}
private void handleEvent(Event event) {
switch(state) {
case Locked:
switch(event) {
case Coin:
setState(State.Unlocked);
unlock();
break;
case Pass:
setState(State.Locked);
alarm();
break;
default: unhandledTransition(state.name(), event.name()); break;
}
break;
case Unlocked:
switch(event) {
case Coin:
setState(State.Unlocked);
thankyou();
break;
case Pass:
setState(State.Locked);
lock();
break;
default: unhandledTransition(state.name(), event.name()); break;
}
break;
}
}
protected abstract void thankyou();
protected abstract void unlock();
protected abstract void alarm();
protected abstract void lock();
}
It is often more convenient to express the abstract Action functions as an interface, or an abstract class. We can accomplish this by adding the Actions:
header to the state machine description.
Initial: Locked
FSM: Turnstile
Actions: TurnstileActions
{
...
}
The programer will write the TurnstileActions
interface to declare all the Action functions. SMC will generate code that implements that interface. Here how this looks in UML:
+-------------------------+
| <<interface>> |
| Implementation |
+-------------------------+
| + lock() |
| + unlock() |
| + thankyou() |
| + alarm() |
+-------------------------+
A
|
+---------------+
+------+ | <<generated>> |
| user |------>| Turnstile |
+------+ +---------------+
| + Coin() |
| + Pass() |
+---------------+
A
|
|
+-------------------------+
| Implementation |
+-------------------------+
| # lock() |
| # unlock() |
| # thankyou() |
| # alarm() |
| # unhandledTransition() |
+-------------------------+
NOTE: This is optional for Java, but necessary in C and C++. In C, the TurnstileActions
interface is implemented as a struct
holding pointers to functions to the Action functions. (See the test_cases directory).
While the syntax described so far is sufficient for any state machine, there are things we can do to make it more convenient. For example, the Turnstile can be expressed more simply by combining the transitions that share the same starting state:
Initial: Locked
FSM: Turnstile
{
Locked {
Coin Unlocked unlock
Pass Locked alarm
}
Unlocked {
Coin Unlocked thankyou
Pass Locked lock
}
}
Now let's add an Alarming
state that must be reset by a repairman:
Initial: Locked
FSM: Turnstile
{
Locked {
Coin Unlocked unlock
Pass Alarming alarmOn
Reset - {alarmOff lock}
}
Unlocked {
Reset Locked {alarmOff lock}
Coin Unlocked thankyou
Pass Locked lock
}
Alarming {
Coin - -
Pass - -
Reset Locked {alarmOff lock}
}
}
We use the dash (-
) character for two purposes. When used as an action it means that there are no actions to perform. When used as the next-state it means that the state does not change. Note: the star (*
) character can be used as a synonym for dash.
When more than one action should be performed, they can be grouped together in braces ({}
).
###Super States
Notice the duplication of the Reset
transition. In all three states the Reset
event does the same thing. It transitions to the Locked
state and it invokes the lock
and alarmOff
actions. This duplication can be eliminated by using a Super State as follows:
Initial: Locked
FSM: Turnstile
{
// This is an abstract super state.
(Resetable) {
Reset Locked {alarmOff lock}
}
Locked : Resetable {
Coin Unlocked unlock
Pass Alarming alarmOn
}
Unlocked : Resetable {
Coin Unlocked thankyou
Pass Locked lock
}
Alarming : Resetable { // inherits all it's transitions from Resetable.
}
}
The parenthesis around Resetable
indicate that it is an abstract state. The State machine will never actually be in this state. Rather, it exists to be used as a super-state by Locked
, Unlocked
, and Alarming
. Note the use of the colon (:
) character to denote that those three states derive from Resetable
.
A state with a super-state inherits all the transitions of that super-state. Super-state transitions can be overridden if necessary. What's more, a state can derive from more than one super-state by using additional colon operators as follows:
state : superstate1 : superstate2 {...
Super-states do not have to be abstract. A state can derive from any other state, whether abstract or not. However, if we mark a state as abstract, then SMC will ensure that it is never used as the target of a transition. The state machine will never be in that state.
A comment is any string beginning with two slashes, and ending with a line-end. They can be placed at the start of a line as in the example above; or they can be placed at the end of a line.
In the previous example, the fact that the alarm is turned on every time the Alarming
state is entered and is turned off every time the Alarming
state is exited, is hidden within the logic of several different transitions. We can make it explicit by using entry actions and exit actions.
Initial: Locked
FSM: Turnstile
{
(Resetable) {
Reset Locked -
}
Locked : Resetable <lock {
Coin Unlocked -
Pass Alarming -
}
Unlocked : Resetable <unlock {
Coin Unlocked thankyou
Pass Locked -
}
Alarming : Resetable <alarmOn >alarmOff - - -
}
The less-than (<
) character denote an entry-action. It is invoked whenever the state is entered. Likewise the greater-than (>
) character denotes an exit-action which is invoked whenever the state is exited.
In the above example, notice that nearly all the actions have been restated as entry- and exit-actions. You may find that this makes the state machine more readable.
The Entry- and Exit-actions of superstates are inherited by their derivative states.
Note also that there is a slight semantic difference between the last two examples. If we are in the Locked
state, and we get a Reset
event, then the lock
action will be invoked even though we are already in the locked state. This is because every transition invokes all the exit- and entry-actions, regardless of whether the state is actually changing. Thus, when we are in the Unlocked
state, and we get a Coin
event, even though we stay in the Unlocked
state, the unlock
action will be invoked.
The internal structure of SMC is a simple traditional compiler. Here is a picture:
+-------+ +--------+ +----------+ +-----------+ +-----------+ +--------------+
| Lexer |--->| Parser |--->| Semantic |--->| Optimizer |--->| Generator |--->| Implementing |
+-------+ +--------+ | Analyzer | +-----------+ +-----------+ | Visitor |
+----------+ +--------------+
This closely reflects the package structure of the java code.
- The Lexer translates the source code into a stream of lexical tokens which act as events going into the Parser.
- The Parser is a simple finite state machine that implements the Backus-Naur description of the source code (See below). That state machine is implemented as a simple state transition table held within a Java array of
Transition
objects. The actions of that parser state machine use the Builder pattern to create a Syntax Data Structure. - The Semantic Analyzer ensures that the Syntax Data Structure describes a true finite state machine, and if so, translates it into a Semantic Data Structure that can only hold true finite state machines.
- The Optimizer then translates the Semantic Data Structure into a simple state transition table. It reduces all the super state inheritance, and the entry- and exit-actions back into vanilla states, events, and actions.
- The Generator converts the optimized state transition table into a set of code-generation-nodes that represent a Nested Switch Case statement in a language agnostic way.
- Finally, the Implementing Visitors translate the code-generation-nodes into a true programming language, like Java.
The upshot of all this is that you can generate a new language, like C#, by simply writing a new Implementing Visitor, which is a relatively trivial task. (See Below)
The value of the -l
command line argument is used to find a class whose name is:
smc.generators.<LANGUAGE>CodeGenerator
.
This class decides which Implementing Visitor to create, and how the output files should be written.
You can create a generator for a new language by deriving that class from smc.generators.CodeGenerator
and putting it in the classpath. Check out the source code for the Java code generator. It's pretty straightforward.
The Backus-Naur form (BNF) of the SMC source code is:
<FSM> ::= <header>* <logic>
<header> ::= <name> ":" <name>
<logic> ::= "{" <transition>* "}"
<transition> ::= <state-spec> <subtransition>
| <state-spec> "{" <subtransition>* "}"
<state-spec> :== <state> <state-modifier>*
<state> ::= <name> | "(" <name> ")"
<state-modifier> :== ":" <name>
| "<" <name>
| ">" <name>
<subtransition> :: <event> <next-state> <action>
<action> ::= <name> | "{" <name>* "}" | "-"
<next-state> ::= <state> | "-"
<event> :: <name> | "-"
You may use this program and source code for any purpose at all at your own risk. It is not copyrighted. Have fun!