Kudu is an open source storage engine for structured data which supports low-latency random access together with effi- cient analytical access patterns. Kudu distributes data using horizontal partitioning and replicates each partition using Raft consensus, providing low mean-time-to-recovery and low tail latencies. Kudu is designed within the context of the Hadoop ecosystem and supports many modes of access via tools such as Cloudera Impala, Apache Spark, and MapReduce.
docker-compose up -d
docker run --rm -it --link kududocker_kudu-tserver_1:kudu_tserver -e KUDU_TSERVER=kudu_tserver nemccarthy/docker-kudu cli status
docker pull nemccarthy/docker-kudu
docker build -t nemccarthy/docker-kudu .
docker create --name kudu-master-data -v /data/kudu-master nemccarthy/docker-kudu
docker create --name kudu-tserver-data -v /data/kudu-tserver nemccarthy/docker-kudu
docker run -d --name kudu-master -p 8051:8051 nemccarthy/docker-kudu master
docker run -d --name kudu-tserver -p 8050:8050 --link kudu-master \
-e KUDU_MASTER=kudu-master kudu tserver
docker logs -f kudu-master
docker logs -f kudu-tserver
docker run --rm -it --link kudu_tserver -e KUDU_TSERVER=kudu_tserver nemccarthy/docker-kudu cli status
Each component provide its own web UI. Open you browser at one of the URLs below, where dockerhost
is the name / IP of the host running the docker daemon. If using Linux, this is the IP of your linux box. If using OS X or Windows (via Docker-Machine), you can find out your docker host by typing docker-machine ip default
.
Component | Port |
---|---|
Master | http://dockerhost:8051 |
TabletServer | http://dockerhost:8050 |
- This docker image (and README) inspired by https://github.com/bigdatafoundation/docker-kudu
- https://github.com/cloudera/kudu-examples/wiki/Docker-based-tutorial