/winton-kafka-streams

A Python implementation of Apache Kafka Streams

Primary LanguagePythonApache License 2.0Apache-2.0

Winton Kafka Streams

Build Status

Implementation of Apache Kafka's Streams API in Python.

What and why?

Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation written in Scala and Java. Kafka has Streams API added for building stream processing applications using Apache Kafka. Applications built with Kafka's Streams API do not require any setup beyond the provision of a Kafka cluster.

Winton Kafka Streams is a Python implementation of Apache Kafka's Streams API. It builds on Confluent's librdkafka (a high performance C library implementing the Kafka protocol) and the Confluent Python Kafka library to achieve this.

The power and simplicity of both Python and Kafka's Streams API combined opens the streaming model to many more people and applications.

Getting started

Installing

The minimum Python version is currently 3.6 and a working Kafka cluster (a single replica is sufficient for testing) are required.

Confluent Python Kafka is also required and it should install as a dependency by pip. If it fails during installation, then we recommend installing librdkafka with HomeBrew and setting CFLAGS=-I/usr/local/include and LDFLAGS=-L/usr/local/lib is recommended.

Cloning the Winton Kafka Streams repository from GitHub is recommended if you want to contribute to the project. Use: pip install --editable <path/to/winton_kafka_streams>[develop] to install as an editable workspace with additional dependencies required for development.

If you want to install the code and get a feel for it as a user then we recommend using pip install which will pull the package from PyPI or your local mirror.

Running tests

Tests will run when py.test is called in the root of the repository.

Running examples

To run examples, you must have cloned the code locally from GitHub.

The debug and wordcount examples will run without further additional requirements.

The Jupyter notebook in the binning example requires some additional packages. Install these with the command:

pip install <path/to/winton_kafka_streams>[binning_example]

Contributing

Please see the CONTRIBUTING.md document for more details on getting involved.

Contact