/flod

Stream processing pipelines in D

Primary LanguageDBoost Software License 1.0BSL-1.0

Build Status

flod

flod is a D library implementing the pipeline pattern with compile-time pipeline composition.

Goals

  • UFCS-based filter chaining syntax, similar to how ranges are composed. Also includes converting between pipelines and ranges where it makes sense.

    read("file.gz").inflate(Format.gzip).byLine
        .map!(a => a.stripRight).join("\n")
        .deflate(Format.gzip).write("file-without-trailing-whitespaces.gz");
  • No single interface for passing data is imposed on pipe implementors. Push or pull, with your own buffer or requiring an external buffer, it shouldn't matter.

  • No gratuitous indirections – a pipeline is a single struct instance, composed of several struct template instances, allowing the compiler to show its inlining capabilities.

  • Deterministic resource management – stages are constructed in place, and destroyed in exactly the reverse order. Just make a stage struct non-copyable and release all resources (file handles, C library contexts, memory-mapped hardware, etc.) in the destructor and you're safe.

  • Support for out-of-band data, such as file names or information about the stream encoding, e.g. image size and color depth, or audio stream sample rate.

  • Liberal licensing – the library should be free to use, modify, extend, distribute and embed in proprietary and/or commercial products without restrictions.

Progress

  • Can connect stages with incompatible interfaces (buffered vs. unbuffered, push vs. pull).
  • Can read from input ranges and built-in arrays.
  • Can write to output ranges.
  • Input range interface to read from a pipeline by element.
  • Input range interface to read from a pipeline by chunk.
  • Input range interface to read from a pipeline by line.
  • Output range interface (put to pipeline).
  • Reading from and writing to files.
  • Metadata support.
  • Hints for best chunk size.
  • Stream sequences (e.g. archives of multiple files).
  • Multiplexing and demultiplexing.
  • Parallelization.
  • Dynamic (run-time polymorphic) stage interface.
  • Stable API.

External packages

The flod package provides only basic building blocks. Pipes doing anything more interesting than just passing unchanged data around will be implemented in separate packages, either natively in D or as wrappers over libraries written in other languages, mainly C. Some of them are already under development:

  • flod-curl – download files from the web using libcurl.
  • flod-zlib – inflate and deflate streams of data.
  • flod-mad – decode MPEG-compressed audio to raw PCM streams.
  • flod-alsa – replay PCM streams using default audio output.

Examples

To see what can be done with flod, see the examples found in this repo.

Similar libraries

There are several libraries in D and other languages which share some goals with flod.

  • iopipe – Steven Schveighoffer's D library for modular IO.
  • IO Streams – Jason White's IO stream library for D.
  • Boost.Iostreams – composable streams in C++.
  • GStreamer – C library for constructing graphs of media-handling components.
  • RaftLib – Jonathan Beard's C++ library for Stream Parallel Processing.

Release history

  • v0.1.0 (2016-05-xx)

    • Input range interface to read from pipeline by element, by line or by chunk.
    • Output range interface to write to pipeline.
    • Support for passing metadata down the pipeline.
    • Stages are now allowed to implement multiple methods (sources, sinks) or method pairs (filters). The actual method is chosen based on the methods supported by other stages.
  • v0.0.2 (2016-03-15)

    • Can use built-in arrays and input ranges as pipeline sources.
    • Can use output ranges as pipeline sinks.
  • v0.0.1 (2016-03-13)

    • First release – can connect pipes with any interfaces (even incompatible).

License

flod is licensed under Boost Software License v1.0, see the file COPYING for details. External packages may use different licenses, so be careful when using them, especially when combining pipes distributed under different licenses.