Configuration of Hadoop Cluster Using Ansible

Mihir Patel
2 min readDec 20, 2020

In this blog you will see how the configuration the Hadoop cluster by the Ansible.

Let’s See First what is ansible and Hadoop.

What is Hadoop?

Hadoop is an open source, a Java-based programming framework that supports the storage and processing of extremely large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.

What is Ansible?

Ansible is an open-source software that automates software provisioning, configuration management, and application deployment.

Now Let’s start practical :

Steps To configuratio Hadoop by ansible …

Step 1: Copying the hadoop software

Step 2: Copying the java software

Step 3: Installing thee hadoop software

Step 4: Installing the java Softwrae

Step 5: Create the Directory

Step 6:Configuration the core-site.xml

step 7: Cofiguration the hdfs-site.xml

Step 8: Start datanode or namenode if namenode then format

First write inventory :

Now Cofiguration in Namenode by hadoop.yml

hadoop.yml for Namenode

namenode ip is 192.168.0.113

datanode ip is 192.168.0.108

Namenode Confi.. : yml File..

Output:

So now our Namenode is started.. see by ‘jps’ command

Now Configuration in Datanode

hadoop2.yml for Datanode

Datanode Confi .. yml File :

Output:

So now Our datnode is started succesfully , see by ‘jps ’ command

Now you can confirm by Type url:192.168.0.113:50070

Thank you

--

--