1. Objective
This article is all about Hadoop NameNode Automatic failover. Here we will cover Introduction to failover in Hadoop. It also covers types of Failover i.e. Graceful Failover and Automatic Failover. The components add by failover in Hadoop HDFS- ZooKeeper quorum, ZKFailoverController Process (ZKFC) are also covered on this blog in detail.
2. What is Failover?
Failover is the process in which system transfers its control to the secondary system when it detects a fault or failure.
There are two types of Failover:
3. Apache Hadoop NameNode Automatic Failover
Automatic failover adds below components to an HDFS deployment:
3.1. Zookeeper Quorum
It is a basic service for maintaining small amounts of coordination data, configuration information, and naming. And it provides distributed synchronization and group services. It informs the client of changes in the data, and track clients for failures.
Implementation of automatic HDFS failover relies on Zookeeper for:
3.2. ZKFailoverController process (ZKFC)
ZKFC is a client of Zookeeper that monitors and manages the namenode status. So, Each of the machines which run namenode service also runs a ZKFC.
ZKFC handles:
4. Conclusion
In conclusion, we can say that it starts automatically in case of NameNode failure. Automatic failover adds ZooKeeper quorum and ZKFC components to an HDFS deployment. Hence, Zookeeper is a service that provides the automatic failover capability in HDFS.
1 videos|14 docs
|
|
Explore Courses for Software Development exam
|