This Docker image allows you to execute Hadoop jobs with the MapReduce framework (MRv1, the "old" MapReduce) based on the currently latest Cloudera version (CDH5).
docker pull genepi/cdh5-hadoop-mrv1:latest
docker run -it -h cloudgene -p 50030:50030 -e EXEC_BASH="true" -e DOCKER_CORES="4" genepi/cdh5-hadoop-mrv1:latest start-hadoop
sh /usr/bin/execute-wordcount.sh
http://cloudgene:50030
The new version of Cloudgene allows to connect to a remote Hadoop cluster. To connect it with a Hadoop Docker container add its hosts entry (e.g. 172.17.0.2 cloudgene) to your local hosts file. Add hostname to your Cloudgene settings.yaml file (e.g. hostname: cloudgene)
cat /etc/hosts