Run the CDH Quick Start image in a docker container.
The latest CDH release is on master
and pinned versions can be found on
dedicated branches.
This image is somewhat large, so you might want to do a separate
docker pull aaronbedward/cdh
to install.
Spin it up with
docker run -td aaronbedward/cdh
get the container id from
docker ps
watch logs with
docker logs -f <container_id>
attach to the container with
docker exec -it <container_id> bash -l
then run any of the usual cdh commands
hadoop fs -ls /
impala-shell
hbase shell
spark-submit
etc.
- Apache Hadoop (Common, HDFS, MapReduce, YARN)
- Apache HBase
- Apache ZooKeeper
- Apache Oozie
- Apache Hive
- Hue (Apache licensed)
- Cloudera Impala (Apache licensed)
- Apache Sentry
- Apache Sqoop
- Cloudera Search (Apache licensed)
- Apache Spark
Pull the image on Docker Hub: https://registry.hub.docker.com/u/svds/cdh
Github page: https://github.com/silicon-valley-data-science/docker-cdh
This image started life as a modified fork of caioquirino/docker-cloudera-quickstart and is now curated by the Silicon Valley Data Science engineering team.