/docker-druid

Docker image for Druid

Primary LanguageShellApache License 2.0Apache-2.0

Docker Druid

Docker Build Status Docker Image Stats Docker Pulls Tags:

What is Druid?

Druid is an open-source analytics data store designed for business intelligence (OLAP) queries on event data. Druid provides low latency (real-time) data ingestion, flexible data exploration, and fast data aggregation. Existing Druid deployments have scaled to trillions of events and petabytes of data. Druid is most commonly used to power user-facing analytic applications.

How to use?

Druid being a complex system, the best way to get up and running with a cluster is to use the docker-compose file provided.

Clone our public repository:

git clone git@github.com:rusnyder/docker-druid.git

and run :

docker-compose up

The compose file is going to launch :

and the following druid services :

  • 1 broker
  • 1 overlord
  • 1 middlemanager
  • 1 historical

as well as the following UIs :

  • Swiv
  • Metabase
  • Superset

The image contains the full druid distribution and use the default druid cli. If no command is provided the image will run as a broker.

If you plan to use this image on your local machine, be carefull with the JVM heap spaces required by default (some services are launched with 15gb heap space).

The docker-compose file is setup to run on a macbook.

Documentation

Work in progress

Configuration

Available environment options:

  • DRUID_XMX '-'
  • DRUID_XMS '-'
  • DRUID_NEWSIZE '-'
  • DRUID_MAXNEWSIZE '-'
  • DRUID_HOSTNAME '-'
  • DRUID_PORT '-'
  • DRUID_ZK_HOST '-'

Authors