/libreactorng

libreactor is a high performance, robust and secure, generic event-driven application framework for Linux

Primary LanguageCMIT LicenseMIT

CodeQL Build

libreactor

About

libreactor is a high performance, robust and secure, generic event-driven application framework for Linux. The design goal is to minimize software overhead, cpu usage, energy consumption and environmental footprint. libreactor is directly built on top of the Linux kernel io_uring system call interface, offering both much simplified access to low level asynchronous kernel calls, as well as high level event-driven abstractions such as HTTP servers. Furthermore libreactor is built completely without third-party dependencies, minimizing supply chain risk.

Key Features

Performance

The current version of libreactor is completely refactored to use the Linux kernel io_uring system call interface, achieving a clear performance jump from the previous epoll()-based version that has been a top contender of the Techempower benchmark for many years (the benchmark is in itself flawed in many ways but should still give an indication of performance potential).

A fast framework can help you achieve performance, but it can not replace the need for optimized software design and architecture. If your software design is flawed from a performance perspective you will not achieve high performance and high availability, regardless of implementation details.

Security

The libreactor pipeline is built with unit-tests that require 100% line coverage, and 100% branch coverage (the latter probably in itself indicates an OCD-diagnosis) to succeed. The above tests are also completed using Valgrind to ensure memory management hygiene. Static analysis is done using CodeQL.

Although this is a refactored version of libreactor, the previous version has been running in production as web API servers 24/7/365 for almost 10 years serving many millions of unique end-users, at consistently very high RPS rates, actually with perfect availability so far. Since there are few moving parts, no real third-party dependencies, and the kernel API is very stable (perhaps less so with io_uring) change management is typically very simple.

Installation

git clone https://github.com/fredrikwidlund/libreactorng.git
cd libreactorng/
./autogen.sh
./configure
make && sudo make install

Usage

cat > main.c <<EOF
#include <reactor.h>

int main()
{
  buffer_t b;

  buffer_construct(&b);
  buffer_load(&b, "/etc/motd");
  buffer_prepend(&b, data_string("--- first line ---\n"));
  buffer_append(&b, data_string("--- last line ---\n"));
  buffer_save(&b, "/tmp/motd");
  buffer_destruct(&b);
}
EOF
gcc -Wall -o main main.c `pkg-config --libs --cflags libreactor`
./main

Data types

Data vectors

Data vectors are generic data containers reducing the need for the common use case of handling separate variables for pointers and size. Data vectors also remove the need for zero-terminated strings, and help reduce strlen() runtime usage.

Example

void out(data_t d)
{
  write(1, data_base(d), data_size(d));
}

int main()
{
  data_t d = data_string("compiled with \"-flto -O2\" this application will not call strlen() later down the call stack\n");
  out(d);
}                                                                                                                                                    ```

Buffers

Buffers offers generic data containers with dynamic memory allocation. Buffers can inserted into, erased from, resized and compacted, saved to and loaded from files.

Example

int main()
{
  buffer_t b;

  buffer_construct(&b);
  buffer_load(&b, "/etc/motd");
  buffer_prepend(&b, data_string("--- first line ---\n"));
  buffer_append(&b, data_string("--- last line ---\n"));
  buffer_save(&b, "/tmp/motd");
  buffer_destruct(&b);
}

Lists

Lists are doubly linked sequence containers, similar to C++ std::list, with O(1) inserts (given a known position) and deletes.

Example

Naive example printing primes up to n (10000).

int main()
{
  list_t l;
  int i, n = 10000, *p_first, *p, *p_next;

  list_construct(&l);
  /* add all integers to n */
  for (i = 1; i < n; i++)
    list_push_back(&l, &i, sizeof i);
  /* skip first integer 1 */
  p_first = list_next(list_front(&l));
  /* remove all multiples of first prime in list */
  while (p_first != list_end(&l) && *p_first <= n / 2)
  {
    p = list_next(p_first);
    while (p != list_end(&l))
    {
      p_next = list_next(p);
      if (*p % *p_first == 0)
        list_erase(p, NULL);
      p = p_next;
    }
    p_first = list_next(p_first);
  }
  /* print remaining integers */
  list_foreach(&l, p)
    printf("%d\n", *p);
  list_destruct(&l, NULL);
}

Vectors

Vectors are dynamically resized arrays, similar to C++ std::vector, with O(1) random access, and O(1) inserts and removals at the end.

Example

Create a vector with the first 50 Fibonacci integers.

void fib(vector_t *v, size_t a, size_t b, int remaining)
{
  if (!remaining)
    return;
  vector_push_back(v, &a);
  fib(v, b, a + b, remaining - 1);
}

int main()
{
  vector_t v;
  int i;

  vector_construct(&v, sizeof (size_t));
  fib(&v, 0, 1, 50);
  for (i = 0; i < vector_size(&v); i++)
    printf("%lu\n", *(size_t *) vector_at(&v, i));
  vector_destruct(&v, NULL);
}

Maps

Maps are associative containers that contains key-value pairs with unique keys, similar to std::unordered_map, with average O(1) random access, inserts, and removals.

Maps come in a generic map_t flavour, as well as mapi_t for integer keys and maps_t for string keys.

Example

Constant lookups against a Fibonacci table.

void fib(mapi_t *m, size_t a, size_t b, int remaining)
{
  if (!remaining)
    return;
  mapi_insert(m, a, 1, NULL);
  fib(m, b, a + b, remaining - 1);
}

int main()
{
  mapi_t m;
  size_t test[] = {233, 234, 377, 378, 610, 611};
  int i;

  mapi_construct(&m);
  fib(&m, 0, 1, 50);
  for (i = 0; i < sizeof test / sizeof *test; i++)
    printf("%lu %s a Fibonacci number\n", test[i], mapi_at(&m, test[i]) ? "is" : "is not");
  mapi_destruct(&m, NULL);
}

UTF-8 strings

UTF-8 compliant strings store an explicit string length to avoid zero termination issues.

Example

Prints a simple unicode string including the zero character.

int main()
{
  string_t s = string_utf8_decode("unicode character \\ud83d\\ude03, zero character \\u0000, some more text...\\n", NULL);

  printf("string length: %lu\n", string_utf8_length(s));
  fwrite(string_base(s), string_size(s), 1, stdout);
}

JSON

JSON support includes an opaque complex value_t container that can dynamic generic data structures runtime, as well as serialize to, and deserialize from fully compliant JSON notation. The value_t type is extendable to support non standard JSON values such as binary data and references.

Example

Creates and serializes a simple value_t object.

int main()
{
  value_t *v, *k;
  string_t s;

  v = value_object();
  value_object_set_release(v, string("a smiley"), value_string(string("\U0001F600")));
  k = value_array();
  value_object_set_release(v, string("some interesting numbers"), k);
  value_array_append_release(k, value_number(184467440737095516.0L));
  value_array_append_release(k, value_number(123.456e-789L));
  value_array_append_release(k, value_number(3.14L));

  s = value_encode(v, 1);
  fwrite(string_base(s), string_size(s), 1, stdout);
  string_release(s);

  value_release(v);
}

Memory pools

Memory pools improves performance when frequently allocating and deallocating objects of a predetermined size.

Example

Allocates an integer and returns it to the pool.

int main()
{
  pool_t p;
  int *x;

  pool_construct(&p, sizeof (int));
  x = pool_malloc(&p);
  *x = 42;
  pool_free(&p, x);
  pool_destruct(&p);
}

Events

Event driven architecture is a core design pattern of a libreactor application and implies using asyncronous operations to avoid blocking.

The simplest possible (do nothing) application is shown below. It will initialize the reactor, process all events, and then release all reasources required for the reactor. Since there is nothing triggering events it will terminate without side expressions.

int main()
{
  reactor_construct();
  reactor_loop();
  reactor_destruct();
}

An event is a value created asyncronously by a producer, for example as a result of a asyncronous kernel syscall, or a high order abstract object.

To define how to receive events a callback and a state is typically prefixed to a low level syscall, or as arguments when constructing an object. The state and data are sent as part of the resulting event (event->state and event->data). Below is a simple application that asyncronously will read from stdin into a buffer. When the reactor_read command is completed the loop will exit.

void callback(reactor_event_t *event)
{
  printf("read returned %d\n", (int) event->data);
}

int main()
{
  char buffer[1024];

  reactor_construct();
  reactor_read(callback, NULL, 0, buffer, sizeof buffer, 0);
  reactor_loop();
  reactor_destruct();
}

The power of the event driven pattern is that actors can operate concurrently in the same thread, with a shared state, without a need for locks, mutexes and semaphores, and without a need for context switching. This allows for highly scalable and performance optimized applications, while also minimizing the risk for race conditions.

Event driven applications tend to be flexible, loosely-coupled and scalable, easier to develop and amenable to change and significantly more tolerant of failure.

Threads

Threads can be spawned and results collected using an event driven pattern.

Example

void async_callback(reactor_event_t *event)
{
  if (event->type == REACTOR_CALL)
  {
    printf("separate thread that can block and manipulate state\n");
    sleep(1);
    printf("exit\n");
  }
}

int main()
{
  reactor_construct();
  reactor_async(async_callback, NULL);
  reactor_loop();
  reactor_destruct();
}

IO Uring

libreactor is built directly on top of the io_uring asynchronous system call interface which is quite complex to work with. Using liburing simplifies the task somewhat but using it to build large applications is still cumbersome. libreactor presents a streamlined and very simple interface to work with.

Besides the usual event loop construction, libreactor defines a simple function call for each io_uring syscall, prefixed with callback and state variables that are used when handling the result.

Instead of the syncronous connect() call below

fd = connect(sock, &sa, sizeof sa);

The asyncronous reactor_connect() version is

id = reactor_connect(callback, state, sock, &sa, sizeof sa);

id is an identifier for the asyncronous syscall and can be used to abort the operation.

The same pattern is used for all io_uring syscalls. For a a playful example look at shufflecat that asyncronously opens N files, and concurrently streams from all files to stdout, one byte at the time. All operations are non-blocking and concurrent in the same thread.

Signals

Polled signals sent between actors or threads, based on the eventfd interface.

Example

Spawns a separate thread and waits for a signal.

void callback(reactor_event_t *event)
{
  signal_t *signal = event->state;

  printf("signal received\n");
  signal_close(signal);
}

void producer(reactor_event_t *event)
{
  signal_t *signal = event->state;

  if (event->type == REACTOR_CALL)
    signal_send_sync(signal);
}

int main()
{
  signal_t signal;

  reactor_construct();
  signal_construct(&signal, callback, &signal);
  signal_open(&signal);
  reactor_async(producer, &signal);
  reactor_loop();
  signal_destruct(&signal);
  reactor_destruct();
}

Timers

Timers triggers alerts at a set point in time and optionally continuously with a delay.

Example

Trigger an alert a second (1000000000ns) from now.

void callback(__attribute__((unused)) reactor_event_t *event)
{
  printf("timeout\n");
}

int main()
{
  timeout_t t;

  reactor_construct();
  timeout_construct(&t, callback, &t);
  timeout_set(&t, reactor_now() + 1000000000, 0);
  reactor_loop();
  reactor_destruct();
}

Notify

Monitoring filesystem events based on inotify.

Example

Displays the first event for each file given as argument to the program.

void callback(reactor_event_t *event)
{
  notify_t *notify = event->state;
  notify_event_t *e = (notify_event_t *) event->data;

  fprintf(stdout, "%d:%04x:%s:%s\n", e->id, e->mask, e->path, e->name);
  notify_remove_path(notify, e->path);
}

int main(int argc, char **argv)
{
  notify_t notify;
  int i, e;

  reactor_construct();
  notify_construct(&notify, callback, &notify);
  notify_open(&notify);
  for (i = 1; i < argc; i++)
  {
    e = notify_add(&notify, argv[i], IN_ALL_EVENTS);
    if (e == -1)
      err(1, "notify_add");
  }
  reactor_loop();
  notify_destruct(&notify);
  reactor_destruct();
}

Streams

Buffered streams with read/write/flush semantics on top of file descriptors and sockets.

Example

Stream from stdin.

static void input(reactor_event_t *event)
{
  stream_t *stream = event->state;
  data_t data;

  switch (event->type)
  {
  case STREAM_READ:
    data = stream_read(stream);
    stream_consume(stream, data_size(data));
    fwrite(data_base(data), data_size(data), 1, stdout);
    break;
  case STREAM_CLOSE:
    stream_destruct(stream);
    break;
  }
}

int main()
{
  stream_t s;

  reactor_construct();
  stream_construct(&s, input, &s);
  stream_open(&s, 0, 0);
  reactor_loop();
  reactor_destruct();
}

Servers

Simple network server routines.

Example

Create 100 concurrent TCP servers on ports 10000 to 10099.

 void callback(reactor_event_t *event)
{
  int fd;

  if (event->type == NETWORK_ACCEPT)
  {
    fd = event->data;
    printf("connection on fd %d\n", fd);
    reactor_close(NULL, NULL, fd);
  }
}

int main(int argc, char **argv)
{
  int i;

  reactor_construct();
  for (i = 10000; i < 10100; i ++)
    network_accept(callback, NULL, NULL, i, NETWORK_REUSEADDR);
  reactor_loop();
  reactor_destruct();
}

Clients

Simple network client routines.

Example

Scan 50 most common TCP ports (For a slightly more advanced version checkout scan.c).

const int top50[] = {
  21,22,23,25,26,53,80,81,110,111,113,135,139,143,179,199,443,445,465,514,515,
  548,554,587,646,993,995,1025,1026,1027,1433,1720,1723,2000,2001,3306,3389,
  5060,5666,5900,6001,8000,8008,8080,8443,8888,10000,32768,49152,49154
};

static void callback(reactor_event_t *event)
{
  if (event->type == NETWORK_CONNECT)
    printf(" %d", *(int *) event->state);
}

int main(int argc, char **argv)
{
  int i, j;

  reactor_construct();
  for (i = 1; i < argc; i ++)
  {
    printf("[%s]", argv[i]);
    for (j = 0; j < sizeof top50 / sizeof *top50; j++)
      network_connect(callback, (void *) &top50[j], argv[i], top50[j]);
    reactor_loop();
    printf("\n");
  }
  reactor_destruct();
}

Resolvers

Network address and service translation. The resolver will cache results for 10 seconds (10^10ns), configurable with network_expire(). Identical requests will be consolidated and queued to avoid thundering herd issues.

Example

Lookup the IP address for www.google.com.

void callback(reactor_event_t *event)
{
  struct addrinfo *ai = (struct addrinfo *) event->data;
  char host[NI_MAXHOST];

  while (ai)
    {
      getnameinfo(ai->ai_addr, ai->ai_addrlen, host, sizeof host, NULL, 0, NI_NUMERICHOST);
      printf("%s\n", host);
      ai = ai->ai_next;
    }
}

int main(int argc, char **argv)
{
  reactor_construct();
  network_resolve(callback, NULL, "www.google.com");
  reactor_loop();
  reactor_destruct();
}

HTTP

Server framework for creating webservers and API services.

Example

Simple HTTP server.

static void handle_request(reactor_event_t *event)
{
  server_plain((server_session_t *) event->data, string("hello"), NULL, 0);
}

int main()
{
  server_t server;

  reactor_construct();
  server_construct(&server, handle_request, NULL);
  server_open(&server, "127.0.0.1", 80);
  reactor_loop();
  reactor_destruct();
}

Queues

Message queues suitable for communication between different reactor threads.

Example

Single-producer single-consumer example.

static void sender(reactor_event_t *event)
{
  queue_t *queue = event->state;
  queue_producer_t producer;
  int i;

  if (event->type == REACTOR_RETURN)
    return;

  reactor_construct();
  queue_producer_construct(&producer);
  queue_producer_open(&producer, queue);
  for (i = 10; i >= 0; i--)
    queue_producer_push(&producer, (int []){i});
  reactor_loop();
  queue_producer_destruct(&producer);
  reactor_destruct();
}

static void receiver_event(reactor_event_t *event)
{
  int *element = (int *) event->data;

  printf("%d\n", *element);
  if (*element == 0)
    queue_consumer_close(event->state);
}

static void receiver(reactor_event_t *event)
{
  queue_t *queue = event->state;
  queue_consumer_t consumer;

  if (event->type == REACTOR_RETURN)
    return;

  reactor_construct();
  queue_consumer_construct(&consumer, receiver_event, &consumer);
  queue_consumer_open(&consumer, queue, 1);
  queue_consumer_pop(&consumer);
  reactor_loop();
  queue_consumer_destruct(&consumer);
  reactor_destruct();
}

int main()
{
  queue_t queue;

  reactor_construct();
  queue_construct(&queue, sizeof(int));
  reactor_async(sender, &queue);
  reactor_async(receiver, &queue);
  reactor_loop();
  queue_destruct(&queue);
  reactor_destruct();
}