Open App

Software Development Exam > Software Development Notes > Hadoop Tutorials: Brief Introduction > Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development PDF Download

Join for Free

Join for Free

1. Hadoop 2 Installation Tutorial: Objective

This Hadoop 2 Installation tutorial describes how to install and configure Hadoop cluster on a single-node on Ubuntu OS. Single Node Hadoop cluster is also called as “Hadoop Pseudo-Distributed Mode”. The Hadoop 2 installation is explained here very simply and to the point, so that you can learn Hadoop CDH5 Installation in 10 Min. Once the you install Hadoop 2 is done you can perform Hadoop Distributed File System (HDFS) and Hadoop Map-Reduce operations.

Looking to BOOST your career in the exciting field of Big Data, Learn Big Data and Hadoop from Experts.

2. Hadoop 2 Installation: Video Tutorial

https://edurev.in/studytube/Easiest-way-to-install--setup-hadoop--Hadoop-tutor/9a1e6494-41a1-4e6a-894d-f380be774c2d_v

3. Install Hadoop 2 on Ubuntu

Follow the steps given below to install and configure Hadoop 2 cluster on ubuntu os-

3.1. Recommended Platform

OS – Linux is supported as a development and production platform. You can use Ubuntu 14.04 or later (you can also use other Linux flavors like CentOS, Redhat, etc.)
Hadoop – Cloudera Distribution for Apache Hadoop CDH5.x (you can use Apache Hadoop 2.x)

I. Setup Platform

If you are using Windows/Mac OS you can create a virtual machine and install Ubuntu using VMWare Player, alternatively, you can create a virtual machine and install Ubuntu using Oracle Virtual Box.

3.2. Prerequisites

I. Install Java 8 (Recommended Oracle Java)

a. Install Python Software Properties

sudo apt-get install python-software-properties

b. Add Repository

sudo add-apt-repository ppa:webupd8team/java

c. Update the source list

sudo apt-get update

d. Install Java

sudo apt-get install oracle-java8-installer

II. Configure SSH

a. Install Open SSH Server-Client

sudo apt-get install openssh-server openssh-client

b. Generate Key Pairs

ssh-keygen -t rsa -P ""

c. Configure password-less SSH

cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys

d. Check by SSH to localhost

ssh localhost

3.2. Install Hadoop

I. Download Hadoop 2

http://archive.cloudera.com/cdh5/cdh/5/hadoop-2.5.0-cdh5.3.2.tar.gz

II. Untar Tar ball

tar xzf hadoop-2.5.0-cdh5.3.2.tar.gz

Note: All the required jars, scripts, configuration files, etc. are available in HADOOP_HOME directory (hadoop-2.5.0-cdh5.3.2).

III. Hadoop 2 Setup Configuration

a. Edit .bashrc

Now, edit .bashrc file located in user’s home directory and add following parameters:

export HADOOP_PREFIX="/home/hdadmin/hadoop-2.5.0-cdh5.3.2"
export PATH=$PATH:$HADOOP_PREFIX/bin
export PATH=$PATH:$HADOOP_PREFIX/sbin
export HADOOP_MAPRED_HOME=${HADOOP_PREFIX}
export HADOOP_COMMON_HOME=${HADOOP_PREFIX}
export HADOOP_HDFS_HOME=${HADOOP_PREFIX}
export YARN_HOME=${HADOOP_PREFIX}

Note: After above step restarts the terminal so that all the environment variables will come into effect.

b. Edit hadoop-env.sh

Now, edit configuration file hadoop-env.sh (located in HADOOP_HOME/etc/hadoop) and set JAVA_HOME:

export JAVA_HOME=<path-to-the-root-of-your-Java-installation> (eg: /usr/lib/jvm/java-8-oracle/)

c. Edit core-site.xml

Now, edit configuration file core-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:

<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/home/dataflair/hdata</value>
</property>
</configuration>

Note: /home/hdadmin/hdata is a sample location; please specify a location where you have Read Write privileges

d. Edit hdfs-site.xml

Now, edit configuration file hdfs-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:

<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>

e. Edit mapred-site.xml

Now, edit configuration file mapred-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:

<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>

f. Edit yarn-site.xml

Now, edit configuration file mapred-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:

<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
</configuration>

3.4. Start the Cluster

I. Format the name node

bin/hdfs namenode -format

NOTE: This activity should be done once when you install Hadoop, else It will delete all your data from HDFS.

II. Start HDFS Services

sbin/start-dfs.sh

III. Start YARN Services

sbin/start-yarn.sh

Follow this link to learn What is YARN?

IV. Check whether services have been started

jps
NameNode
DataNode
ResourceManager
NodeManager

3.5. Run Map-Reduce Jobs

I. Run word count example

bin/hdfs dfs -mkdir /inputwords
bin/hdfs dfs -put <data-file> /inputwords
bin/yarn jar share/hadoop/mapreduce/hadoop-mapreduce-examples-2.5.0-cdh5.3.2.jar wordcount /inputwords /outputwords
bin/hdfs dfs -cat /outputwords/*

Follow HDFS command Guide to Play with HDFS Commands and perform various operations,

3.6. Stop The Cluster

I. Stop HDFS Services

sbin/stop-dfs.sh

II. Stop YARN Services

sbin/stop-yarn.sh

The document Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development is a part of the Software Development Course Hadoop Tutorials: Brief Introduction.

All you need of Software Development at this link: Software Development

Are you preparing for Software Development Exam? Then you should check out the best video lectures, notes, free mock test series, crash course and much more provided by EduRev. You also get your detailed analysis and report cards along with 24x7 doubt solving for you to excel in Software Development exam. So join EduRev now and revolutionise the way you learn!

Download App for Free

	Hadoop Tutorials: Brief Introduction 1 videos\|14 docs

Hadoop Tutorials: Brief Introduction

1 videos|14 docs

Join Course for Free

Up next

How to Install Hadoop 2.7 on Ubuntu | Hadoop Installation Steps

Doc | 2 pages

Install Hadoop 2.8.x on Ubuntu | Hadoop Installation Steps

Doc | 4 pages

Hadoop 2.6 Multi Node Cluster Setup and Hadoop Installation

Doc | 4 pages

FAQs on Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 - Hadoop Tutorials: Brief Introduction - Software Development

1. How do I install Hadoop 2 on Ubuntu?

Ans. To install Hadoop 2 on Ubuntu, you can follow the steps mentioned in the article. First, you need to download the CDH5 repository key and add it to the apt-key. Then, add the CDH5 repository to the apt sources list. After that, update the package list and install Hadoop using the apt-get command.

2. What is the difference between Hadoop 1 and Hadoop 2?

Ans. Hadoop 1 and Hadoop 2 are two different versions of the Hadoop framework. The main difference between them is the introduction of YARN (Yet Another Resource Negotiator) in Hadoop 2. YARN allows Hadoop to support multiple processing models, making it more flexible and efficient. In Hadoop 1, the MapReduce framework was tightly coupled with the resource management, which limited its scalability.

3. Can I install Hadoop 2 on a different Ubuntu version?

Ans. Yes, you can install Hadoop 2 on different versions of Ubuntu. However, the steps may vary slightly depending on the version you are using. It is recommended to refer to the official documentation or specific installation guides for the particular Ubuntu version you are using to ensure compatibility and accuracy.

4. What are the system requirements for installing Hadoop 2 on Ubuntu?

Ans. The system requirements for installing Hadoop 2 on Ubuntu include a 64-bit processor, a minimum of 4GB RAM, and a minimum of 10GB disk space. It is also recommended to have a dedicated machine for running Hadoop to ensure optimal performance. Additionally, a stable internet connection is required to download and install the necessary packages.

5. Can I use Hadoop 2 for production environments on Ubuntu?

Ans. Yes, Hadoop 2 can be used for production environments on Ubuntu. However, it is important to properly configure and tune Hadoop according to your specific requirements and workload. It is also recommended to have a good understanding of Hadoop administration and monitoring to ensure the smooth running of your production environment. Regular maintenance, monitoring, and updates are crucial for the stability and performance of your Hadoop cluster.

Related Exams

IT & Software

About this Document

4.94/5 Rating

Apr 22, 2025 Last updated

Document Description: Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 for Software Development 2025 is part of Hadoop Tutorials: Brief Introduction preparation. The notes and questions for Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 have been prepared according to the Software Development exam syllabus. Information about Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 covers topics like and Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Example, for Software Development 2025 Exam. Find important definitions, questions, notes, meanings, examples, exercises and tests below for Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5.

Introduction of Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 in English is available as part of our Hadoop Tutorials: Brief Introduction for Software Development & Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 in Hindi for Hadoop Tutorials: Brief Introduction course. Download more important topics related with notes, lectures and mock test series for Software Development Exam by signing up for free. Software Development: Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development

Description

Full syllabus notes, lecture & questions for Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development - Software Development | Plus excerises question with solution to help you revise complete syllabus for Hadoop Tutorials: Brief Introduction | Best notes, free PDF download

Information about Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5

In this doc you can find the meaning of Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 defined & explained in the simplest way possible. Besides explaining types of Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 theory, EduRev gives you an ample number of questions to practice Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 tests, examples and also practice Software Development tests

	Hadoop Tutorials: Brief Introduction 1 videos\|14 docs

Hadoop Tutorials: Brief Introduction

1 videos|14 docs

Join Course for Free

Download as PDF

Up next

How to Install Hadoop 2.7 on Ubuntu | Hadoop Installation Steps

Doc | 2 pages

Install Hadoop 2.8.x on Ubuntu | Hadoop Installation Steps

Doc | 4 pages

Hadoop 2.6 Multi Node Cluster Setup and Hadoop Installation

Doc | 4 pages

Explore Courses for Software Development exam

mock tests for examination

Semester Notes

past year papers

ppt

shortcuts and tricks

practice quizzes

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development

Objective type Questions

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development

MCQs

Summary

Exam

Free

Previous Year Questions with Solutions

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 | Hadoop Tutorials: Brief Introduction - Software Development

Viva Questions

pdf

Sample Paper

video lectures

Extra Questions

Important questions

study material

;

Additional Information about Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 for Software Development Preparation

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Free PDF Download

The Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 is an invaluable resource that delves deep into the core of the Software Development exam. These study notes are curated by experts and cover all the essential topics and concepts, making your preparation more efficient and effective. With the help of these notes, you can grasp complex subjects quickly, revise important points easily, and reinforce your understanding of key concepts. The study notes are presented in a concise and easy-to-understand manner, allowing you to optimize your learning process. Whether you're looking for best-recommended books, sample papers, study material, or toppers' notes, this PDF has got you covered. Download the Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 now and kickstart your journey towards success in the Software Development exam.

Importance of Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5

The importance of Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 cannot be overstated, especially for Software Development aspirants. This document holds the key to success in the Software Development exam. It offers a detailed understanding of the concept, providing invaluable insights into the topic. By knowing the concepts well in advance, students can plan their preparation effectively. Utilize this indispensable guide for a well-rounded preparation and achieve your desired results.

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Notes

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Notes offer in-depth insights into the specific topic to help you master it with ease. This comprehensive document covers all aspects related to Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5. It includes detailed information about the exam syllabus, recommended books, and study materials for a well-rounded preparation. Practice papers and question papers enable you to assess your progress effectively. Additionally, the paper analysis provides valuable tips for tackling the exam strategically. Access to Toppers' notes gives you an edge in understanding complex concepts. Whether you're a beginner or aiming for advanced proficiency, Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Notes on EduRev are your ultimate resource for success.

Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Software Development Questions

The "Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 Software Development Questions" guide is a valuable resource for all aspiring students preparing for the Software Development exam. It focuses on providing a wide range of practice questions to help students gauge their understanding of the exam topics. These questions cover the entire syllabus, ensuring comprehensive preparation. The guide includes previous years' question papers for students to familiarize themselves with the exam's format and difficulty level. Additionally, it offers subject-specific question banks, allowing students to focus on weak areas and improve their performance.

Study Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 on the App

Students of Software Development can study Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 alongwith tests & analysis from the EduRev app, which will help them while preparing for their exam. Apart from the Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5, students can also utilize the EduRev App for other study materials such as previous year question papers, syllabus, important questions, etc. The EduRev App will make your learning easier as you can access it from anywhere you want. The content of Hadoop 2 Installation on Ubuntu – Setup of Hadoop CDH5 is prepared as per the latest Software Development syllabus.

Education Revolution