nimsync

Lock-free SPSC and MPSC channels for Nim with production-grade performance validation

nimsync v1.1.0 is production-ready for SPSC and MPSC channels with comprehensive benchmarking following industry standards (Tokio, Go, LMAX Disruptor). Performance: 558M ops/sec SPSC micro-benchmark (~35M realistic), 15M ops/sec MPSC (2 producers), 31ns P99 latency. All numbers verified in CI.

Features

High throughput: 558M ops/sec SPSC micro (~35M realistic), 15M ops/sec MPSC (2 producers), 512K ops/sec async - See benchmarks
Production-validated: Comprehensive benchmark suite (throughput, latency, burst, stress, sustained)
Industry-standard testing: Following Tokio, Go, Rust Criterion, LMAX Disruptor methodologies
SPSC and MPSC modes: Single-producer or multi-producer with single consumer
Lock-free ring buffer with atomic operations
Zero GC pressure with ORC memory management
Cache-line aligned (64 bytes) to prevent false sharing
Wait-free MPSC algorithm (based on dbittman + JCTools patterns)
Power-of-2 sizing for efficient operations
Non-blocking trySend/tryReceive
Async send/recv wrappers for Chronos

Installation

Requirements

Nim 2.0.0+ (required)
Chronos 4.0.0+

Via Nimble

nimble install nimsync

From Source

git clone https://github.com/codenimja/nimsync.git
cd nimsync
nimble install

Quick Start

Basic Usage (SPSC)

import nimsync

# Create SPSC channel with 16 slots
let chan = newChannel[int](16, ChannelMode.SPSC)

# Non-blocking operations
if chan.trySend(42):
  echo "Sent successfully"

var value: int
if chan.tryReceive(value):
  echo "Received: ", value

Multi-Producer Usage (MPSC)

import nimsync
import std/[os, threadpool]

# Create MPSC channel for multiple producers
let chan = newChannel[int](1024, ChannelMode.MPSC)

# Multiple producer threads
proc producer(ch: Channel[int], id: int) =
  for i in 0..<1000:
    while not ch.trySend(id * 1000 + i):
      discard  # Spin until space available

# Start multiple producers
spawn producer(chan, 1)
spawn producer(chan, 2)
spawn producer(chan, 3)

# Single consumer
var count = 0
var value: int
while count < 3000:
  if chan.tryReceive(value):
    echo "Received: ", value
    count.inc

sync()

Async Operations

import nimsync
import chronos

proc producer(ch: Channel[int]) {.async.} =
  for i in 1..10:
    await ch.send(i)

proc consumer(ch: Channel[int]) {.async.} =
  for i in 1..10:
    let value = await ch.recv()
    echo "Received: ", value

proc main() {.async.} =
  let ch = newChannel[int](16, ChannelMode.SPSC)
  await allFutures([producer(ch), consumer(ch)])

waitFor main()

API Reference

Channel Creation

proc newChannel[T](size: int, mode: ChannelMode): Channel[T]

Creates a channel with specified size (rounded to next power of 2).

Modes:

ChannelMode.SPSC: Single Producer Single Consumer (fastest)
ChannelMode.MPSC: Multi-Producer Single Consumer (wait-free producers)

Non-Blocking Operations

proc trySend[T](channel: Channel[T], value: T): bool
proc tryReceive[T](channel: Channel[T], value: var T): bool

Returns true on success, false if channel is full/empty. Use these for maximum performance (sub-100ns operations).

Async Operations

proc send[T](channel: Channel[T], value: T): Future[void] {.async.}
proc recv[T](channel: Channel[T]): Future[T] {.async.}

Async wrappers using Chronos. Note: Uses 1ms polling internally.

Utilities

proc isEmpty[T](channel: Channel[T]): bool
proc isFull[T](channel: Channel[T]): bool

Performance

SPSC vs MPSC (Verified Benchmarks)

Mode	Producers	Micro-benchmark	Realistic Threaded	P99 Latency	Use Case
SPSC	1	558M ops/sec	~35M ops/sec	31ns	Maximum performance, single-threaded pipeline
SPSC	1 (async)	512K ops/sec	—	—	Chronos integration
MPSC	2	15M ops/sec	~15M ops/sec	~64ns	Multi-threaded producers
MPSC	4	8.5M ops/sec	—	~117ns	High concurrency
MPSC	8	5.3M ops/sec	—	~256ns	Memory-bandwidth limited

Key insights:

SPSC is 3.5× faster than MPSC in realistic threaded workloads (35M vs 10M ops/sec)
Micro-benchmarks show peak potential; realistic workloads include thread scheduling overhead
MPSC scales well with 2 producers, degrades with 4+
Use SPSC when you have a single producer; use MPSC when multiple threads must produce concurrently

Methodology:

All benchmarks run in CI on 4-core Intel/AMD systems
Micro-benchmarks: tight loops, zero external overhead
Realistic threaded: actual thread spawning, OS scheduling, cache effects
Compiled with -d:danger --opt:speed --mm:orc

Comprehensive Benchmark Suite

nimsync includes 7 SPSC benchmarks + 1 MPSC suite following industry best practices:

Benchmark	Metric	SPSC Result	Methodology
Throughput	Peak ops/sec	558M	Go channels
Latency	p50/p99/p99.9	20ns/31ns/50ns	Tokio/Cassandra
Burst Load	Stability	385M ops/sec, 18% variance	Redis
Buffer Sizing	Optimal size	4096 slots, 557M ops/sec	LMAX Disruptor
Stress Test	Contention	0% at 500K ops	JMeter/Gatling
Sustained	Long-duration	Stable over 10s	Cassandra/ScyllaDB
Async	Overhead	512K ops/sec	Standard async
MPSC	2/4/8 producers	15M/8.5M/5.3M	Multi-threaded validation

Quick Run

# Run complete SPSC suite (~18 seconds)
./tests/performance/run_all_benchmarks.sh

# Run MPSC benchmark
nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_mpsc.nim
./tests/performance/benchmark_mpsc

Full documentation: tests/performance/README.md

Third-Party Verification

Want to verify these claims yourself?

Reproduction Guide: See BENCHMARKS.md and tests/performance/README.md
CI Benchmarks: Automatic benchmarks on every commit → GitHub Actions
Expected Range: 20M-600M ops/sec depending on CPU, benchmark type, and system load

Limitations

Single Consumer Only - All modes require single consumer
- SPSC: ONE sender, ONE receiver (fastest)
- MPSC: MULTIPLE senders, ONE receiver (wait-free)
- SPMC/MPMC not implemented
No close() - Channels don't have close operation
- Use sentinel values for shutdown signaling
Power-of-2 sizing - Size rounded up
- newChannel[int](10, SPSC) creates 16-slot channel
Async polling - send/recv use exponential backoff polling
- Starts at 1ms, backs off to 100ms max
- Use trySend/tryReceive for zero-latency
MPSC contention - Best performance with 2-4 producers
- 8+ producers experience diminishing returns due to contention

Development

Testing

nim c -r tests/unit/test_channel.nim              # Basic SPSC tests
nim c -r tests/unit/channels/test_mpsc_channel.nim  # MPSC tests
nim c -r tests/unit/test_basic.nim                # Version check

Benchmarking

nimble bench  # Run all benchmarks

Code Quality

nimble fmt   # Format code
nimble lint  # Static analysis
nimble ci    # Full CI checks

Internal/Experimental Code

This repository contains experimental implementations of:

TaskGroups (structured concurrency)
Actors (with supervision)
Streams (backpressure-aware)
Work-stealing scheduler
NUMA optimizations

These are NOT production-ready and not exported in the public API. They exist as research code for future releases. See internal modules in src/nimsync/ if interested.

Roadmap

✅ v1.0.0: Production SPSC channels (DONE!)
✅ v1.1.0: MPSC channels (DONE!)
v1.2.0: TaskGroup fixes + Production-ready Streams
v2.0.0: Full async runtime with actors

Known Issues

See GitHub Issues for experimental features and known limitations:

Async wrappers use polling - exponential backoff (1ms-100ms), use trySend/tryReceive for zero-latency
TaskGroup has bugs - nested async macros fail (not exported) - See issue template
NUMA untested - cross-socket performance unknown - See issue template

These are documented limitations, not intentional behavior. Contributions to fix welcome!

Contributing

Contributions welcome! Priority areas:

Fix TaskGroup nested async bug - Details
Validate NUMA performance - Details
Cross-platform support (macOS/Windows)
SPMC/MPMC channel implementations

See issue templates for detailed specifications and acceptance criteria.

License

MIT License - see LICENSE for details.

Status: Production-ready SPSC and MPSC channels with comprehensive validation. Other features (TaskGroup, actors, streams) are experimental - see GitHub Issues for contributor opportunities.

Disclaimer

nimsync v1.1.0 is production-ready for SPSC and MPSC channels.

✅ SPSC channels verified - 558M ops/sec micro (~35M realistic), 31ns P99 latency, 7-benchmark suite validation ✅ MPSC channels verified - 15M ops/sec (2 producers), wait-free algorithm, comprehensive stress testing ⚠️ Experimental features - TaskGroup, actors, streams not yet production-ready (help wanted)

We document performance honestly. We benchmark rigorously. We're transparent about limitations.

Open source async runtime built with Nim. Contributions welcome - see issues for high-impact areas.