reference-system-autoware

With the distributed development of ROS across many different organizations it is sometimes hard to benchmark and concretely show how a certain change to a certain system improves or reduces the performance of that system. For example did a change from one executor to another actually reduce the CPU or was it something else entirely?

In order to try and address this problem we at Apex.AI would like to propose a definition of a reference system that simulates a real world scenario - in this case Autoware.Auto and its lidar data pipeline - that can be repeated no matter the underlying change of any piece of the full stack (i.e. executor, DDS or even RMW).

Future reference systems could be proposed that are more complex using the same basic node building blocks developed for this first scenario.

Reference System

A reference system is defined by:

A platform is defined by:
- Hardware (e.g. an off-the-shelf single-board computer, embedded ECU, etc.)
  - if there are multiple configurations available for such hardware, ensure it is specified
- Operating System (OS) like RT linux, QNX, etc. along with any special configurations made
for simplicity and ease of benchmarking, all nodes must run on a single process
a fixed number of nodes
- each node with:
  - a fixed number of publishers and subscribers
  - a fixed processing time or a fixed publishing rate
a fixed message type of fixed size to be used for every node.

With these defined attributes the reference system can be replicated across many different possible configurations to be used to benchmark each configuration against the other in a reliable and fair manner.

With this approach unit tests can also be defined to reliably confirm if a given reference system meets the requirements.

Supported Platforms

To enable as many people as possible to replicate this reference system, the platform(s) were chosen to be easily accessible (inexpensive, high volume), have lots of documentation, large community use and will be supported well into the future.

Platforms were not chosen for performance of the reference system - we know we could run “faster” with a more powerful CPU or GPU but then it would be harder for others to validate findings and test their own configurations. Accessibility is the key here and will be considered if more platforms want to be added to this benchmark list.

Platforms:

Raspberry Pi 4B:
- 4 GB RAM version is the assumed default
  - other versions could also be tested / added by the community
- real-time linux kernel

Note: create an issue to add more platforms to the list, keeping in mind the above criteria

Concept Overview

Rather than trying to write code to cover all potential variations of executors, APIs, and future features we cannot even imagine today we have chosen instead to define what we call a “reference system” based on part of a real-world system, Autoware.Auto.

The above node graph can be boiled down to only a handful of node "types" that are replicated to make this complex system:

Node Types:

Sensor Node
- input node to system
- publishes message cyclically at some fixed frequency
Transformer Node
- one subscriber, one publisher
- starts processing for N milliseconds after a message is received
- publishes message after processing is complete
Fusion Node
- two subscribers, one publisher
- starts processing for N milliseconds after a message is received from all subscriptions
- publishes message after processing is complete
Reactor Node
- 0..N subscribers, one publisher
- cyclically publishes at least one message or one for every received message
Command Node
- prints output stats everytime a message is received

These basic building-block nodes can be mixed-and-matched to create quite complex systems that replicate real-world scenarios to benchmark different configurations against each other.

Reference Systems Overview

The first reference system benchmark proposed is based on the Autoware.Auto lidar data pipeline as stated above and shown in the node graph image above as well.

Reference System Autoware.Auto
- ROS2:
  - Executors:
    - Default:

Results below show various characteristics of the same simulated system (Autoware.Auto).

To add your own executor / middleware / configuration to the list above follow the Contributing section below.

Benchmark Results

Results will be added to different tagged releases along with the specific configurations ran during the tests.

Unit Testing

Unit tests can be written for the reference system to check if all nodes, topics and other requirements are met before accepting test results.

Tests are provided to automatically generate results for you by running colcon test on a supported platform above.

To run the test, simply run the following command from your workspace:

colcon test  --packages-up-to reference_system_autoware

Alternatively if for some reason you do not or cannot use colcon the tests are simple gtests that can be ported and ran on any configuration.

Contributing

If you see a missing configuration on the list above that you would like to see benchmarked against please follow the steps below to request it to be added.

look over the open / closed issues to make sure there isn't already an open ticket for the configuration you are looking for create include/reference_system/MY_EXECUTOR_NAME_nodes

Howto Implement Your Custom Executor

Read over the above documentation on the base node types
Review the base rclcpp nodes that are provided and determine if your executor can use them
If you cannot, implment your own version of each base node type and place the source in include/reference_system/nodes
Add your new nodes as a seperate node system in include/reference_system/system/systems.hpp
Copy one of the provided example .cpp files from the src/ros2/executor directory and replace the create_autoware_nodes template type with your new node system which should be in the system/systems.hpp file already included
Add new .cpp source file as a new executable in the CMakelist.txt
Add new executable to test wtihin the CMakelist.txt
Build and run tests!

ApexAI/reference-system-autoware