Configuration of Hadoop Cluster Using Ansible

Mihir Patel
2 min readDec 20, 2020

--

In this blog you will see how the configuration the Hadoop cluster by the Ansible.

Let’s See First what is ansible and Hadoop.

What is Hadoop?

Hadoop is an open source, a Java-based programming framework that supports the storage and processing of extremely large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.

What is Ansible?

Ansible is an open-source software that automates software provisioning, configuration management, and application deployment.

Now Let’s start practical :

Steps To configuratio Hadoop by ansible …

Step 1: Copying the hadoop software

Step 2: Copying the java software

Step 3: Installing thee hadoop software

Step 4: Installing the java Softwrae

Step 5: Create the Directory

Step 6:Configuration the core-site.xml

step 7: Cofiguration the hdfs-site.xml

Step 8: Start datanode or namenode if namenode then format

First write inventory :

Now Cofiguration in Namenode by hadoop.yml

hadoop.yml for Namenode

namenode ip is 192.168.0.113

datanode ip is 192.168.0.108

Namenode Confi.. : yml File..

Output:

So now our Namenode is started.. see by ‘jps’ command

Now Configuration in Datanode

hadoop2.yml for Datanode

Datanode Confi .. yml File :

Output:

So now Our datnode is started succesfully , see by ‘jps ’ command

Now you can confirm by Type url:192.168.0.113:50070

Thank you

--

--