Nanolog is an extremely performant nanosecond scale logging system for C++ that exposes a simple printf-like API and achieves over 80 million logs/second at a median latency of just over 7 nanoseconds.
How it achieves this insane performance is by extracting static log information at compile-time, only logging the dynamic components in runtime hotpath, and deferring formatting to an offline process. This basically shifts work out of the runtime and into the compilation and post-execution phases.
More information about the techniques used in this logging system can be found in the NanoLog Paper published in the 2018 USENIX Annual Technical Conference.
This section shows the performance of NanoLog with existing logging systems such as spdlog v1.1.0, Log4j2 v2.8, Boost 1.55, glog v0.3.5, and Windows Event Tracing with Windows Software Trace Preprocessor on Windows 10 (WPP).
Maximum throughput measured with 1 million messages logged back to back with no delay and 1-16 logging threads (NanoLog logged 100 million messages to generate a log file of comparable size). ETW is "Event Tracing for Windows." The log messages used can be found in the Log Message Map below.
Measured in nanoseconds and each cell represents the 50th / 99.9th tail latencies. The log messages used can be found in the Log Message Map below.
Message | NanoLog | spdlog | Log4j2 | glog | Boost | ETW |
---|---|---|---|---|---|---|
staticString | 7/ 37 | 214/ 2546 | 174 / 3364 | 1198/ 5968 | 1764/ 3772 | 161/ 2967 |
stringConcat | 7/ 36 | 279/ 905 | 256 / 25087 | 1212/ 5881 | 1829/ 5548 | 191/ 3365 |
singleInteger | 7/ 32 | 268/ 855 | 180 / 9305 | 1242/ 5482 | 1914/ 5759 | 167/ 3007 |
twoIntegers | 8/ 62 | 437/ 1416 | 183 / 10896 | 1399/ 6100 | 2333/ 7235 | 177/ 3183 |
singleDouble | 8/ 43 | 585/ 1562 | 175 / 4351 | 1983/ 6957 | 2610/ 7079 | 165/ 3182 |
complexFormat | 8/ 40 | 1776/ 5267 | 202 / 18207 | 2569/ 8877 | 3334/ 11038 | 218/ 3426 |
Log messages used in the benchmarks above. Italics indicate dynamic log arguments.
Message ID | Log Message Used |
---|---|
staticString | Starting backup replica garbage collector thread |
singleInteger | Backup storage speeds (min): 181 MB/s read |
twoIntegers | buffer has consumed 1032024 bytes of extra storage, current allocation: 1016544 bytes |
singleDouble | Using tombstone ratio balancer with ratio = 0.4 |
complexFormat | Initialized InfUdDriver buffers: 50000 receive buffers (97 MB), 50 transmit buffers (0 MB), took 26.2 ms |
stringConcat | Opened session with coordinator at basic+udp:host=192.168.1.140,port=12246 |
NanoLog depends on the following:
- C++17 Compiler: GNU g++ 7.5.0 or newer
- GNU Make 4.0 or greater
- Python 3.4.2 or greater
- POSIX AIO and Threads (usually installed with Linux)
The NanoLog system enables low latency logging by deduplicating static log metadata and outputting the dynamic log data in a binary format. This means that log files produced by NanoLog are in binary and must be passed through a separate decompression program to produce the full, human readable ASCII log.
There are two versions of NanoLog (Preprocessor version and C++17 version) and you must chose one to use with your application as they’re not interoperable. The biggest difference between the two is that the Preprocessor version requires one to integrate a Python script in their build chain while the C++17 version is closer to a regular library (simply build and link against it). The benefit of using the Preprocessor version is that it performs more work at compile-time, resulting in a slightly more optimized runtime.
If you don’t know which one to use, go with C++17 NanoLog as it’s easier to use.
The C++17 version of NanoLog works like a traditional library; just #include "NanoLogCpp17.h"
and link against the NanoLog library. A sample application can be found in the sample directory.
To build the C++17 NanoLog Runtime library, go in the runtime directory and invoke make
. This will produce ./libNanoLog.a
to against link your application and a ./decompressor
application that can be used to re-inflate the binary logs.
When you compile your application, be sure to include the NanoLog header directory (-I ./runtime
), and link against NanoLog, pthreads, and POSIX AIO (-L ./runtime/ -lNanoLog -lrt -pthread
). Sample g++ invocations can be found in the sample GNUmakefile.
After you compile and run the application, the log file generated can then be passed to the ./decompressor
application to generate the full human-readable log file (instructions below).
The Preprocessor version of NanoLog requires a tighter integration with the user build chain and is only for advanced/extreme users.
It requires the user's GNUmakefile to include the NanoLogMakeFrag, declare USR_SRCS and USR_OBJS variables to list all app’s source and object files respectively, and use the pre-defined run-cxx
macro to compile ALL the user .cc files into .o files instead of g++
. See the preprocessor sample GNUmakefile for more details.
Internally, the run-cxx
invocation will run a Python script over the source files and generate library code that is specific to each compilation of the user application. In other words, the compilation builds a version of the NanoLog library that is non-portable, even between compilations of the same application and each make
invocation rebuilds this library.
Additionally, the compilation should also generate a ./decompressor
executable in the app directory and this can be used to reconstitute the full human-readable log file (instructions below).
To use the NanoLog system in the code, one just has to include the NanoLog header (either NanoLogCpp17.h for C++17 NanoLog or NanoLog.h for Preprocessor NanoLog) and invoke the NANO_LOG()
function in a similar fashion to printf, with the exception of a log level before it. Example below:
#include "NanoLogCpp17.h"
using namespace NanoLog::LogLevels;
int main()
{
NANO_LOG(NOTICE, "Hello World! This is an integer %d and a double %lf\r\n", 1, 2.0);
return 0;
}
Valid log levels are DEBUG, NOTICE, WARNING, and ERROR and the logging level can be set via NanoLog::setLogLevel(...)
The rest of the NanoLog API is documented in the NanoLog.h header file.
The execution of the user application should generate a compressed, binary log file (default locations: ./compressedLog or /tmp/logFile). To make the log file human-readable, simply invoke the decompressor
application with the log file.
./decompressor decompress ./compressedLog
After building the NanoLog library, the decompressor executable can be found in either the ./runtime directory (for C++17 NanoLog) or the user app directory (for Preprocessor NanoLog).