1. Hadoop 2 Installation Tutorial: Objective
This Hadoop 2 Installation tutorial describes how to install and configure Hadoop cluster on a single-node on Ubuntu OS. Single Node Hadoop cluster is also called as “Hadoop Pseudo-Distributed Mode”. The Hadoop 2 installation is explained here very simply and to the point, so that you can learn Hadoop CDH5 Installation in 10 Min. Once the you install Hadoop 2 is done you can perform Hadoop Distributed File System (HDFS) and Hadoop Map-Reduce operations.
Looking to BOOST your career in the exciting field of Big Data, Learn Big Data and Hadoop from Experts.
2. Hadoop 2 Installation: Video Tutorial
https://edurev.in/studytube/Easiest-way-to-install--setup-hadoop--Hadoop-tutor/9a1e6494-41a1-4e6a-894d-f380be774c2d_v
3. Install Hadoop 2 on Ubuntu
Follow the steps given below to install and configure Hadoop 2 cluster on ubuntu os-
3.1. Recommended Platform
I. Setup Platform
If you are using Windows/Mac OS you can create a virtual machine and install Ubuntu using VMWare Player, alternatively, you can create a virtual machine and install Ubuntu using Oracle Virtual Box.
3.2. Prerequisites
I. Install Java 8 (Recommended Oracle Java)
a. Install Python Software Properties
sudo apt-get install python-software-properties
b. Add Repository
sudo add-apt-repository ppa:webupd8team/java
c. Update the source list
sudo apt-get update
d. Install Java
sudo apt-get install oracle-java8-installer
II. Configure SSH
a. Install Open SSH Server-Client
sudo apt-get install openssh-server openssh-client
b. Generate Key Pairs
ssh-keygen -t rsa -P ""
c. Configure password-less SSH
cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys
d. Check by SSH to localhost
ssh localhost
3.2. Install Hadoop
I. Download Hadoop 2
http://archive.cloudera.com/cdh5/cdh/5/hadoop-2.5.0-cdh5.3.2.tar.gz
II. Untar Tar ball
tar xzf hadoop-2.5.0-cdh5.3.2.tar.gz
Note: All the required jars, scripts, configuration files, etc. are available in HADOOP_HOME directory (hadoop-2.5.0-cdh5.3.2).
III. Hadoop 2 Setup Configuration
a. Edit .bashrc
Now, edit .bashrc file located in user’s home directory and add following parameters:
Note: After above step restarts the terminal so that all the environment variables will come into effect.
b. Edit hadoop-env.sh
Now, edit configuration file hadoop-env.sh (located in HADOOP_HOME/etc/hadoop) and set JAVA_HOME:
export JAVA_HOME=<path-to-the-root-of-your-Java-installation> (eg: /usr/lib/jvm/java-8-oracle/)
c. Edit core-site.xml
Now, edit configuration file core-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:
Note: /home/hdadmin/hdata is a sample location; please specify a location where you have Read Write privileges
d. Edit hdfs-site.xml
Now, edit configuration file hdfs-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:
e. Edit mapred-site.xml
Now, edit configuration file mapred-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:
f. Edit yarn-site.xml
Now, edit configuration file mapred-site.xml (located in HADOOP_HOME/etc/hadoop) and add following entries:
3.4. Start the Cluster
I. Format the name node
bin/hdfs namenode -format
NOTE: This activity should be done once when you install Hadoop, else It will delete all your data from HDFS.
II. Start HDFS Services
sbin/start-dfs.sh
III. Start YARN Services
sbin/start-yarn.sh
Follow this link to learn What is YARN?
IV. Check whether services have been started
3.5. Run Map-Reduce Jobs
I. Run word count example
Follow HDFS command Guide to Play with HDFS Commands and perform various operations,
3.6. Stop The Cluster
I. Stop HDFS Services
sbin/stop-dfs.sh
II. Stop YARN Services
sbin/stop-yarn.sh
1 videos|14 docs
|
1. How do I install Hadoop 2 on Ubuntu? |
2. What is the difference between Hadoop 1 and Hadoop 2? |
3. Can I install Hadoop 2 on a different Ubuntu version? |
4. What are the system requirements for installing Hadoop 2 on Ubuntu? |
5. Can I use Hadoop 2 for production environments on Ubuntu? |
|
Explore Courses for Software Development exam
|