Hadoop-Quick-Start

Start

If you want to configure your environment, you have to follow these steps:

Stop the docker containers:

docker stop historyserver nodemanager1 resourcemanager datanode1 namenode

Remove the docker containers:

docker rm historyserver nodemanager1 resourcemanager datanode1 namenode

Use this command:

docker exec -it namenode bash

If you are on Ubuntu or you use WSL, you can create an alias of this command adding this line to the file ~/.bashrc:

alias docker-hadoop='docker exec -it namenode bash'

To use them everywhere follow these steps:

Create hadoop-scripts directory in /usr/local:
```
sudo mkdir /usr/local/hadoop-scripts
```
Move the Scripts content in /usr/local/hadoop-scripts:
```
sudo cp ./Scripts/* /usr/local/hadoop-scripts
```

Change the owner:

sudo chown -R <your_user>:<your_group> /usr/local/hadoop-scripts

Add the execution permission:

sudo chmod a+x /usr/local/hadoop-scripts/*

Add these lines at the end of the file ~/.bashrc:

export HADOOP_SCRIPTS_HOME=/usr/local/hadoop-scripts
export PATH=$PATH:$HADOOP_SCRIPTS_HOME

Inside ./handoop directory you can find a simple example called WordCount. First of all, you have to install Maven.

Then you can generate the jar file using the following command:

mvn package

Now you can use it!