Original by acorbacho@nflabs.com Modified by Patrik Ohlson - ZDi.it
Contributing: Contribution Guide
License: Apache 2.0
Ansible Haddop is a playbook that help you to deploy a new Hadoop HA / Zookeeper cluster using Ansible.
The playbooks can:
- Deploy a fully functional Hadoop cluster with High Availability (HA) and automatic failover.
- Deploy additional nodes to scale the cluster
- Hadoop
- Zookeeper
- Journalnode
- HDFS
- Ansible 1.6+
- CentOS 7+ or RedHat servers
edit the files:
hosts
: Set the hosts and servicesgroup_vars/all
: to change/add more configuration parameters (ex: hdfs path, spark port etcetc)
site_name: mycluster # The name of your cluster
update_iptables: True/False # If True, change iptables file to add ip_range.
update_hosts: True/False # If True, set the hosts file to every host in the cluster.
install_oracle_jdk: True/False # If True, download and Install Oracle JDK from oracle server.
To run with Ansible:
./deploy
To e.g. just install ZooKeeper, add the zookeeper
tag as argument.
available tags:
- hadoop
- ntp
- zookeeper
- slaves
./deploy zookeeper
Dont forget to open the port of the hosts if you want to access to your cluster remotely.
- HDFS : active: master:50070 - active
- HDFS : stand by: master2:50070 - standby
restart all services run
./restart
If you want just restart some services run:
./restart serviceName
List of service that can be restarted
- zookeepers
- journalnodes
- namenodes
- datanodes