Run Hadoop 3.1.2 (with Hive 2.3.4) on Ubuntu 16.04 inside docker container in Pseudo-distributed mode
-
Go to your terminal.
-
Navigate to directory with
Dockerfile
and build imagedocker build -t <image_name> .
-
or
docker pull macio232/hadoop-pseudo-distributed-mode
-
Run the following command
docker run -p 9870:9870 -p 8088:8088 -v <host-directory>:/home/hadoop/data -it --name=container_name <image_name>
Runs Hadoop startup script and bash on
ENTRYPOINT
. -
To use hive run
nohup hiveserver2 &
and press
ENTER
. This starts hive server.To connect to the client run
beeline -n root -u jdbc:hive2://localhost:10000
- Add execution of
stop-dfs.sh
andstop-yarn.sh
at shutdown as described in here - Solve
mesg: ttyname failed: Inappropriate ioctl for device
issue during benchmark execution
- Apache Hadoop 3.1.2 docs
- core-default.xml
- hdfs-default.xml
- mapred-default.xml
- yarn-default.xml
- DeprecatedProperties.html
The original repository for this image can be found under https://github.com/mjaglan/docker-hadoop-pseudo-distributed-mode.