/docker-zeppelin

Docker image for starting Apache Zeppelin

Primary LanguageShell

Docker Zeppelin

Build Status Docker Pulls Image Size

Description

Docker image for starting Apache Zeppelin.

Usage

You can either start the image directly with Docker, or use the Nomad-Docker-Wrapper if you are running your containers on Nomad.

docker run -p 8080:8080 \
  -e ZEPPELIN_SPARK_MASTER="local[*]" \
  -e ZEPPELIN_PASSWORD="secret" \
  -e ZEPPELIN_NOTEBOOK_STORAGE=org.apache.zeppelin.notebook.repo.VFSNotebookRepo \
  -e ZEPPELIN_PROCESS_USER_NAME="zeppelinu" \
  -e ZEPPELIN_PROCESS_USER_ID=12345 \
  -e ZEPPELIN_PROCESS_GROUP_NAME="zeppeling" \
  -e ZEPPELIN_PROCESS_GROUP_ID=12340 \
  -e ZEPPELIN_SERVER_PORT=8080 \
  -e ZEPPELIN_SPARK_DRIVER_MEMORY="512M" \
  -e ZEPPELIN_SPARK_UI_PORT=4040 \
  -e ZEPPELIN_PYSPARK_PYTHON=/usr/bin/python \
  -e ZEPPELIN_MEM="-Xms1024m -Xmx1024m -XX:MaxPermSize=512m" \
  -v $(pwd)/notebooks:/usr/local/zeppelin/notebooks \
  -v $(pwd)/conf:/usr/local/zeppelin/conf \
  -v $(pwd)/hive:/hive \
  frosner/zeppelind:latest-zv0.6.2-s2.0.2-h2.7

Configuration

The docker image requires some environment variables to be set. They are used to configure your Zeppelin.

Variable Description
ZEPPELIN_SPARK_MASTER URL of the Spark master that Zeppelin should use.
ZEPPELIN_PASSWORD Password to use for authenticating as zeppelin user on the UI.
ZEPPELIN_NOTEBOOK_STORAGE Notebook storage to use.
ZEPPELIN_PROCESS_USER_NAME User name to execute the Zeppelin process as.
ZEPPELIN_PROCESS_USER_ID User ID to execute the Zeppelin process as.
ZEPPELIN_PROCESS_GROUP_NAME Group name to assign to the Zeppelin user.
ZEPPELIN_PROCESS_GROUP_ID Group ID to assign to the Zeppelin user.
ZEPPELIN_SERVER_PORT Port to bind the Zeppelin server to.
ZEPPELIN_SPARK_UI_PORT Port to use for the Spark UI.
ZEPPELIN_SPARK_DRIVER_MEMORY Amount of memory to allocate to the Spark driver process (e.g. 512M).
ZEPPELIN_PYSPARK_PYTHON Path to python executable for the Spark worker nodes.
ZEPPELIN_MEM Zeppelin JVM Options