New: GitHub Discussions for this project is now active! Let's use that for general support questions, and reserve the GitHub Issues section for bugs and feature requests.
A low-memory, fast-switching, cooperative multitasking library using stackless coroutines on Arduino platforms.
This library is an implementation of the
ProtoThreads library for the
Arduino platform. It emulates a stackless coroutine that can suspend execution
using a yield()
or delay()
functionality to allow other coroutines to
execute. When the scheduler makes it way back to the original coroutine, the
execution continues right after the yield()
or delay()
.
There are only 3 classes in this library:
Coroutine
class provides the context variables for all coroutines,CoroutineScheduler
class optionally handles the scheduling,Channel
class allows coroutines to send messages to each other. This is an experimental feature whose API and feature may change considerably in the future.
The library provides a number of macros to help create coroutines and manage their life cycle:
COROUTINE()
: defines an instance of theCoroutine
class or an instance of a user-defined subclass ofCoroutine
COROUTINE_BEGIN()
: must occur at the start of a coroutine bodyCOROUTINE_END()
: must occur at the end of the coroutine bodyCOROUTINE_YIELD()
: yields execution back to the caller, oftenCoroutineScheduler
but not necessarilyCOROUTINE_AWAIT(condition)
: yield untilcondition
becomestrue
COROUTINE_DELAY(millis)
: yields back execution formillis
. Themillis
parameter is defined as auint16_t
.COROUTINE_DELAY_MICROS(micros)
: yields back execution formicros
. Themicros
parameter is defined as auint16_t
.COROUTINE_DELAY_SECONDS(seconds)
: yields back execution forseconds
. Theseconds
parameter is defined as auint16_t
.COROUTINE_LOOP()
: convenience macro that loops foreverCOROUTINE_CHANNEL_WRITE(channel, value)
: writes a value to aChannel
COROUTINE_CHANNEL_READ(channel, value)
: reads a value from aChannel
Here are some of the compelling features of this library compared to others (in my opinion of course):
- low memory usage
- each coroutine consumes only 15 bytes of RAM on 8-bit processors (AVR) and 28 bytes on 32-bit processors (ARM, ESP8266, ESP32)
- the
CoroutineScheduler
consumes only 2 bytes (8-bit) or 4 bytes (32-bit) no matter how many coroutines are active
- extremely fast context switching
- ~5.3 microseconds on a 16 MHz ATmega328P
- ~2.5 microseconds on a 48 MHz SAMD21
- ~1.8 microseconds on a 72 MHz STM32
- ~1.5 microseconds on a 80 MHz ESP8266
- ~0.4 microseconds on a 240 MHz ESP32
- ~1.0 microseconds on 96 MHz Teensy 3.2 (depending on compiler settings)
- uses the computed goto
feature of the GCC compiler (also supported by Clang) to avoid the
Duff's Device hack
- allows
switch
statements in the coroutines
- allows
- C/C++ macros eliminate boilerplate code and make the code easy to read
- the base
Coroutine
class is easy to subclass to add additional variables and functions - fully unit tested using AUnit
Some limitations are:
- A
Coroutine
cannot return any values. - A
Coroutine
is stackless and therefore cannot preserve local stack variables across multiple calls. Often the class member variables or function static variables are reasonable substitutes. - Coroutines are currently designed to be statically allocated, not dynamically created and destroyed. This is mostly because dynamic memory allocation on an 8-bit microcontroller with 2kB of RAM should probably be avoided. Dynamically created coroutines may be added in the future for 32-bit microcontrollers which have far more memory.
- A
Channel
is an experimental feature and has limited features. It is currently an unbuffered, synchronized channel. It can be used by only one reader and one writer.
After I had completed most of this library, I discovered that I had essentially
reimplemented the <ProtoThread.h>
library in the
Cosa framework. The difference is that
AceRoutine is a self-contained library that works on any platform supporting the
Arduino API (AVR, Teensy, ESP8266, ESP32, etc), and it provides a handful of
additional macros that can reduce boilerplate code.
Version: 1.2.4 (2021-01-22)
Changelog: CHANGELOG.md
Important Change in v1.1: This library now depends on the AceCommon library (https://github.com/bxparks/AceCommon). See the Installation section below.
- Hello Coroutines
- Installation
- Documentation
- Comparisons
- Resource Consumption
- System Requirements
- License
- Feedback and Support
- Authors
This is the HelloCoroutine.ino sample sketch which
uses the COROUTINE()
macro to automatically handle a number of boilerplate
code, and some internal bookkeeping operations. Using the COROUTINE()
macro
works well for relatively small and simple coroutines.
#include <AceRoutine.h>
using namespace ace_routine;
const int LED = LED_BUILTIN;
const int LED_ON = HIGH;
const int LED_OFF = LOW;
COROUTINE(blinkLed) {
COROUTINE_LOOP() {
digitalWrite(LED, LED_ON);
COROUTINE_DELAY(100);
digitalWrite(LED, LED_OFF);
COROUTINE_DELAY(500);
}
}
COROUTINE(printHelloWorld) {
COROUTINE_LOOP() {
Serial.print(F("Hello, "));
Serial.flush();
COROUTINE_DELAY(1000);
Serial.println(F("World"));
COROUTINE_DELAY_SECONDS(4);
}
}
void setup() {
delay(1000);
Serial.begin(115200);
while (!Serial); // Leonardo/Micro
pinMode(LED, OUTPUT);
}
void loop() {
blinkLed.runCoroutine();
printHelloWorld.runCoroutine();
}
The printHelloWorld
coroutine prints "Hello, ", waits 1 second, then prints
"World", then waits 4 more seconds, then repeats from the start. At the same
time, the blinkLed
coroutine blinks the builtin LED on and off, on for 100 ms
and off for 500 ms.
The HelloScheduler.ino sketch implements the same
thing using the CoroutineScheduler
:
#include <AceRoutine.h>
using namespace ace_routine;
... // same as above
void setup() {
delay(1000);
Serial.begin(115200);
while (!Serial); // Leonardo/Micro
pinMode(LED, OUTPUT);
CoroutineScheduler::setup();
}
void loop() {
CoroutineScheduler::loop();
}
The CoroutineScheduler
can automatically manage all coroutines defined by the
COROUTINE()
macro, which eliminates the need to itemize your coroutines in
the loop()
method manually.
The HelloManualCoroutine.ino program shows what
the code looks like without the convenience of the COROUTINE()
macro. For more
complex programs, with more than a few coroutines, especially if the coroutines
need to communicate with each other, this coding structure can be more powerful.
#include <Arduino.h>
#include <AceRoutine.h>
using namespace ace_routine;
const int LED = LED_BUILTIN;
const int LED_ON = HIGH;
const int LED_OFF = LOW;
class BlinkLedCoroutine: public Coroutine {
public:
int runCoroutine() override {
COROUTINE_LOOP() {
digitalWrite(LED, LED_ON);
COROUTINE_DELAY(100);
digitalWrite(LED, LED_OFF);
COROUTINE_DELAY(500);
}
}
};
class PrintHelloWorldCoroutine: public Coroutine {
public:
int runCoroutine() override {
COROUTINE_LOOP() {
Serial.print(F("Hello, "));
Serial.flush();
COROUTINE_DELAY(1000);
Serial.println(F("World"));
COROUTINE_DELAY(4000);
}
}
};
BlinkLedCoroutine blinkLed;
PrintHelloWorldCoroutine printHelloWorld;
void setup() {
delay(1000);
Serial.begin(115200);
while (!Serial); // Leonardo/Micro
pinMode(LED, OUTPUT);
blinkLed.setupCoroutine(F("blinkLed"));
printHelloWorld.setupCoroutine(F("printHelloWorld"));
CoroutineScheduler::setup();
}
void loop() {
CoroutineScheduler::loop();
}
The latest stable release is available in the Arduino IDE Library Manager. Two libraries need to be installed since v1.1:
- Search for "AceRoutine". Click Install.
- Search for "AceCommon". Click Install.
The development version can be installed by cloning the 2 git repos:
- AceRoutine (https://github.com/bxparks/AceRoutine)
- AceCommon (https://github.com/bxparks/AceCommon)
You can copy these directories to the ./libraries
directory used by the
Arduino IDE. (The result is a directory named ./libraries/AceRoutine
and
./libraries/AceCommon
). Or you can create symlinks from /.libraries
to these
directories.
The develop
branch contains the latest working version.
The master
branch contains the stable release.
The source files are organized as follows:
src/AceRoutine.h
- main header filesrc/ace_routine/
- implementation filessrc/ace_routine/testing/
- internal testing filestests/
- unit tests which depend on AUnitexamples/
- example programs
- README.md - this file
- Doxygen docs published on GitHub Pages
- USER_GUIDE.md
The following programs are provided under the examples
directory:
- Beginner Examples
- HelloCoroutine.ino
- HelloScheduler.ino: same as
HelloCoroutine
except using theCoroutineScheduler
instead of manually running the coroutines - HelloManualCoroutine.ino: same as
HelloCoroutine
except theCoroutine
subclasses and instances are created and registered manually
- Intermediate Examples
- BlinkSlowFastRoutine.ino: use coroutines to read a button and control how the LED blinks
- BlinkSlowFastManualRoutine.ino:
same as BlinkSlowFastRoutine but using manual
Coroutine
subclasses - CountAndBlink.ino: count and blink at the same time
- Delay.ino: validate the various delay macros
(
COROUTINE_DELAY()
,COROUTINE_DELAY_MICROS()
andCOROUTINE_DELAY_SECONDS()
)
- Advanced Examples
- SoundManager: Use a sound manager coroutine to
control the sounds made by a sound generator coroutine, using the
reset()
function to interrupt the sound generator.
- SoundManager: Use a sound manager coroutine to
control the sounds made by a sound generator coroutine, using the
- Channels
- Pipe.ino: uses a
Channel
to allow a Writer to send messages to a Reader through a "pipe" (unfinished) - Task.ino: uses a
Channel
to allow a Writer to send messages to a Reade (unfinished) - a working example of Channels can be found in the CommandLineInterface package in the AceUtils library (https://github.com/bxparks/AceUtils).
- Pipe.ino: uses a
- Benchmarks
- Internal programs to extract various CPU and memory benchmarks.
- AutoBenchmark.ino: performs CPU benchmarking
- MemoryBenchmark.ino: determines the flash and static memory consumptions of certain AceRoutine features
- ChannelBenchmark.ino: determines the amount
of CPU overhead of a
Channel
by using 2 coroutines to ping-pong an integer across 2 channels
There are several interesting and useful multithreading libraries for Arduino. I'll divide the libraries in to 2 camps:
- tasks
- threads or coroutines
Task managers run a set of tasks. They do not provide a way to resume
execution after yield()
or delay()
.
In order of increasing complexity, here are some libraries that provide broader abstraction of threads or coroutines:
- Littlebits coroutines
- Implemented using Duff's Device which means that nested
switch
statements don't work. - The scheduler has a fixed queue size.
- The context structure is exposed.
- Implemented using Duff's Device which means that nested
- Arduino-Scheduler
- Overrides the system's
yield()
for a seamless experience. - Uses
setjmp()
andlongjmp()
. - Provides an independent stack to each coroutine whose size is configurable at runtime (defaults to 128 for AVR, 1024 for 32-bit processors).
- ESP8266 or ESP32 not supported (or at least I did not see it).
- Overrides the system's
- Cosa framework
- A full-featured, alternative development environment using the Arduino IDE, but not compatible with the Arduino API or libraries.
- Installs as a separate "core" using the Board Manager.
- Includes various ways of multi-tasking (Events, ProtoThreads, Threads, Coroutines).
- The
<ProtoThread.h>
library in the Cosa framework uses basically the same technique as thisAceRoutine
library.
The AceRoutine library falls in the "Threads or Coroutines" camp. The
inspiration for this library came from
ProtoThreads and
Coroutines in C
where an incredibly brilliant and ugly technique called
Duff's Device
is used to perform labeled goto
statements inside the "coroutines" to resume
execution from the point of the last yield()
or delay()
. It occurred to me
that I could make the code a lot cleaner and easier to use in a number of ways:
- Instead of using Duff's Device, I could use the GCC language extension
called the
computed goto.
I would lose ANSI C compatbility, but all of the Arduino platforms
(AVR, Teensy, ESP8266, ESP32) use the GCC compiler and the Arduino
software already relies on GCC-specific features (e.g. flash strings using
PROGMEM
attribute). In return,switch
statements would work inside the coroutines, which wasn't possible using the Duff's Device. - Each "coroutine" needs to keep some small number of context variables.
In the C language, this needs to be passed around using a
struct
. It occurred to me that in C++, we could make the context variables almost disappear by making "coroutine" an instance of a class and moving the context variables into the member variables. - I could use C-processor macros similar to the ones used in AUnit to hide much of the boilerplate code and complexity from the user
I looked around to see if there already was a library that implemented these
ideas and I couldn't find one. However, after writing most of this library, I
discovered that my implementation was very close to the <ProtoThread.h>
module
in the Cosa framework. It was eerie to see how similar the 2 implementations had
turned out at the lower level. I think the AceRoutine library has a couple of
advantages:
- it provides additional macros (i.e.
COROUTINE()
andEXTERN_COROUTINE()
) to eliminate boilerplate code, and - it is a standalone Arduino library that does not depend on a larger framework.
All objects are statically allocated (i.e. not heap or stack).
- 8-bit processors (AVR Nano, UNO, etc):
sizeof(Coroutine)
: 15sizeof(CoroutineScheduler)
: 2sizeof(Channel<int>)
: 5
- 32-bit processors (e.g. Teensy ARM, ESP8266, ESP32)
sizeof(Coroutine)
: 28sizeof(CoroutineScheduler)
: 4sizeof(Channel<int>)
: 12
In other words, you can create 100 Coroutine
instances and they would use only
1400 bytes of static RAM on an 8-bit AVR processor.
The CoroutineScheduler
consumes only 2 bytes of memory no matter how many
coroutines are created. That's because it depends on a singly-linked list whose
pointers live on the Coroutine
object, not in the CoroutineScheduler
. But
the code for the class increases flash memory usage by about 150 bytes.
The Channel
object requires 2 copies of the parameterized <T>
type so its
size is equal to 1 + 2 * sizeof(T)
, rounded to the nearest memory alignment
boundary (i.e. a total of 12 bytes for a 32-bit processor).
The examples/MemoryBenchmark program gathers flash and memory consumption numbers for various boards (AVR, ESP8266, ESP32, etc) for a handful of AceRoutine features. Here are some highlights:
- AVR (e.g. Nano)
- 1 Coroutine: 1098 bytes
- 2 Coroutines: 1326 bytes
CoroutineScheduler()
+ 1 Coroutine: 1240 bytesCoroutineScheduler()
+ 2 Coroutines: 1390 bytes
- ESP8266
- 1 Coroutine: 680 bytes
- 2 Coroutines: 908 bytes
CoroutineScheduler()
+ 1 Coroutine: 808 bytesCoroutineScheduler()
+ 2 Coroutines: 908 bytes
See examples/AutoBenchmark. In summary, the overhead of AceRoutine context switching is about 5 micros on an 8-bit AVR, to as low as 0.41 micros on a 32-bit ESP32.
The library has been extensively tested on the following boards:
- Arduino Nano clone (16 MHz ATmega328P)
- SparkFun Pro Micro clone (16 MHz ATmega32U4)
- SAMD21 M0 Mini (48 MHz ARM Cortex-M0+)
- STM32 Blue Pill (STM32F103C8, 72 MHz ARM Cortex-M3)
- NodeMCU 1.0 (ESP-12E module, 80 MHz ESP8266)
- WeMos D1 Mini (ESP-12E module, 80 MHz ESP8266)
- ESP32 dev board (ESP-WROOM-32 module, 240 MHz dual core Tensilica LX6)
- Teensy 3.2 (96 MHz ARM Cortex-M4)
I will occasionally test on the following hardware as a sanity check:
- Arduino Pro Mini clone (16 MHz ATmega328P)
- Mini Mega 2560 (Arduino Mega 2560 compatible, 16 MHz ATmega2560)
- Teensy LC (48 MHz ARM Cortex-M0+)
This library was developed and tested using:
- Arduino IDE 1.8.13
- Arduino CLI 0.14.0
- Arduino AVR Boards 1.8.3
- Arduino SAMD Boards 1.8.9
- SparkFun AVR Boards 1.1.13
- SparkFun SAMD Boards 1.8.1
- STM32duino 1.9.0
- ESP8266 Arduino 2.7.4
- ESP32 Arduino 1.0.4
- Teensydino 1.53
It should work with PlatformIO but I have not tested it.
The library works on Linux or MacOS (using both g++ and clang++ compilers) using the EpoxyDuino emulation layer.
I use Ubuntu 18.04 and 20.04 for most of my development and sometimes do sanity checks on MacOS 10.14.5.
If you find this library useful, consider starring this project on GitHub. The stars will let me prioritize the more popular libraries over the less popular ones.
If you have any questions, comments and other support questions about how to use this library, please use the GitHub Discussions for this project. If you have bug reports or feature requests, please file a ticket in GitHub Issues. I'd love to hear about how this software and its documentation can be improved. I can't promise that I will incorporate everything, but I will give your ideas serious consideration.
Please refrain from emailing me directly unless the content is sensitive. The problem with email is that I cannot reference the email conversation when other people ask similar questions later.
Created by Brian T. Park (brian@xparks.net).