Open App

Computer Science Engineering (CSE) Exam > Computer Science Engineering (CSE) Notes > Algorithms > Extendible Hashing

Extendible Hashing | Algorithms - Computer Science Engineering (CSE) PDF Download

Introduction

Extendible Hashing is a dynamic hashing method wherein directories, and buckets are used to hash data. It is an aggressively flexible method in which the hash function also experiences dynamic changes.

Main features of Extendible Hashing:

The main features in this hashing technique are:

Directories: The directories store addresses of the buckets in pointers. An id is assigned to each directory which may change each time when Directory Expansion takes place.
Buckets: The buckets are used to hash the actual data.

Basic Structure of Extendible Hashing:

Frequently used terms in Extendible Hashing:

Directories: These containers store pointers to buckets. Each directory is given a unique id which may change each time when expansion takes place. The hash function returns this directory id which is used to navigate to the appropriate bucket. Number of Directories = 2^Global Depth.
Buckets: They store the hashed keys. Directories point to buckets. A bucket may contain more than one pointers to it if its local depth is less than the global depth.
Global Depth: It is associated with the Directories. They denote the number of bits which are used by the hash function to categorize the keys. Global Depth = Number of bits in directory id.
Local Depth: It is the same as that of Global Depth except for the fact that Local Depth is associated with the buckets and not the directories. Local depth in accordance with the global depth is used to decide the action that to be performed in case an overflow occurs. Local Depth is always less than or equal to the Global Depth.
Bucket Splitting: When the number of elements in a bucket exceeds a particular size, then the bucket is split into two parts.
Directory Expansion: Directory Expansion Takes place when a bucket overflows. Directory Expansion is performed when the local depth of the overflowing bucket is equal to the global depth.

Basic Working of Extendible Hashing

Step 1: Analyze Data Elements: Data elements may exist in various forms eg. Integer, String, Float, etc.. Currently, let us consider data elements of type integer. eg: 49.
Step 2: Convert into binary format: Convert the data element in Binary form. For string elements, consider the ASCII equivalent integer of the starting character and then convert the integer into binary form. Since we have 49 as our data element, its binary form is 110001.
Step 3: Check Global Depth of the directory. Suppose the global depth of the Hash-directory is 3.
Step 4: Identify the Directory: Consider the ‘Global-Depth’ number of LSBs in the binary number and match it to the directory id.
Eg. The binary obtained is: 110001 and the global-depth is 3. So, the hash function will return 3 LSBs of 110001 viz. 001.
Step 5: Navigation: Now, navigate to the bucket pointed by the directory with directory-id 001.
Step 6: Insertion and Overflow Check: Insert the element and check if the bucket overflows. If an overflow is encountered, go to step 7 followed by Step 8, otherwise, go to step 9.
Step 7: Tackling Over Flow Condition during Data Insertion: Many times, while inserting data in the buckets, it might happen that the Bucket overflows. In such cases, we need to follow an appropriate procedure to avoid mishandling of data.
First, Check if the local depth is less than or equal to the global depth. Then choose one of the cases below.
- Case1: If the local depth of the overflowing Bucket is equal to the global depth, then Directory Expansion, as well as Bucket Split, needs to be performed. Then increment the global depth and the local depth value by 1. And, assign appropriate pointers.
  Directory expansion will double the number of directories present in the hash structure.
- Case2: In case the local depth is less than the global depth, then only Bucket Split takes place. Then increment only the local depth value by 1. And, assign appropriate pointers.
- Step 8: Rehashing of Split Bucket Elements: The Elements present in the overflowing bucket that is split are rehashed w.r.t the new global depth of the directory.
- Step 9: The element is successfully hashed.

Example based on Extendible Hashing: Now, let us consider a prominent example of hashing the following elements: 16,4,6,22,24,10,31,7,9,20,26.
Bucket Size: 3 (Assume)
Hash Function: Suppose the global depth is X. Then the Hash Function returns X LSBs.

Solution: First, calculate the binary forms of each of the given numbers.
16- 10000
4- 00100
6- 00110
22- 10110
24- 11000
10- 01010
31- 11111
7- 00111
9- 01001
20- 10100
26- 11010
Initially, the global-depth and local-depth is always 1. Thus, the hashing frame looks like this:
Inserting 16:
The binary format of 16 is 10000 and global-depth is 1. The hash function returns 1 LSB of 10000 which is 0. Hence, 16 is mapped to the directory with id=0.
Inserting 4 and 6:
Both 4(100) and 6(110)have 0 in their LSB. Hence, they are hashed as follows:
Inserting 22: The binary form of 22 is 10110. Its LSB is 0. The bucket pointed by directory 0 is already full. Hence, Over Flow occurs.
As directed by Step 7-Case 1, Since Local Depth = Global Depth, the bucket splits and directory expansion takes place. Also, rehashing of numbers present in the overflowing bucket takes place after the split. And, since the global depth is incremented by 1, now,the global depth is 2. Hence, 16,4,6,22 are now rehashed w.r.t 2 LSBs.[ 16(10000),4(100),6(110),22(10110)]

*Notice that the bucket which was underflow has remained untouched. But, since the number of directories has doubled, we now have 2 directories 01 and 11 pointing to the same bucket. This is because the local-depth of the bucket has remained 1. And, any bucket having a local depth less than the global depth is pointed-to by more than one directories.

Inserting 24 and 10: 24(11000) and 10 (1010) can be hashed based on directories with id 00 and 10. Here, we encounter no overflow condition.
Inserting 31,7,9: All of these elements[ 31(11111), 7(111), 9(1001) ] have either 01 or 11 in their LSBs. Hence, they are mapped on the bucket pointed out by 01 and 11. We do not encounter any overflow condition here.
Inserting 20: Insertion of data element 20 (10100) will again cause the overflow problem.
20 is inserted in bucket pointed out by 00. As directed by Step 7-Case 1, since the local depth of the bucket = global-depth, directory expansion (doubling) takes place along with bucket splitting. Elements present in overflowing bucket are rehashed with the new global depth. Now, the new Hash table looks like this:
Inserting 26: Global depth is 3. Hence, 3 LSBs of 26(11010) are considered. Therefore 26 best fits in the bucket pointed out by directory 010.
The bucket overflows, and, as directed by Step 7-Case 2, since the local depth of bucket < Global depth (2<3), directories are not doubled but, only the bucket is split and elements are rehashed.
Finally, the output of hashing the given list of numbers is obtained.
Hashing of 11 Numbers is Thus Completed.

Key Observations

A Bucket will have more than one pointers pointing to it if its local depth is less than the global depth.
When overflow condition occurs in a bucket, all the entries in the bucket are rehashed with a new local depth.
If Local Depth of the overflowing bucket
The size of a bucket cannot be changed after the data insertion process begins.

Advantages

Data retrieval is less expensive (in terms of computing).
No problem of Data-loss since the storage capacity increases dynamically.
With dynamic changes in hashing function, associated old values are rehashed w.r.t the new hash function.

Limitations Of Extendible Hashing

The directory size may increase significantly if several records are hashed on the same directory while keeping the record distribution non-uniform.
Size of every bucket is fixed.
Memory is wasted in pointers when the global depth and local depth difference becomes drastic.
This method is complicated to code.

Data Structures used for implementation:

B+ Trees
Array
Linked List

The document Extendible Hashing | Algorithms - Computer Science Engineering (CSE) is a part of the Computer Science Engineering (CSE) Course Algorithms.

All you need of Computer Science Engineering (CSE) at this link: Computer Science Engineering (CSE)

	Algorithms 81 videos\|80 docs\|33 tests

Algorithms

81 videos|80 docs|33 tests

Join Course for Free

Top Courses for Computer Science Engineering (CSE)

View all

Related Exams

Computer Science Engineering (CSE)

About this Document

	4.62/5 Rating
	Dec 22, 2024 Last updated

Document Description: Extendible Hashing for Computer Science Engineering (CSE) 2024 is part of Algorithms preparation. The notes and questions for Extendible Hashing have been prepared according to the Computer Science Engineering (CSE) exam syllabus. Information about Extendible Hashing covers topics like Introduction and Extendible Hashing Example, for Computer Science Engineering (CSE) 2024 Exam. Find important definitions, questions, notes, meanings, examples, exercises and tests below for Extendible Hashing.

Introduction of Extendible Hashing in English is available as part of our Algorithms for Computer Science Engineering (CSE) & Extendible Hashing in Hindi for Algorithms course. Download more important topics related with notes, lectures and mock test series for Computer Science Engineering (CSE) Exam by signing up for free. Computer Science Engineering (CSE): Extendible Hashing | Algorithms - Computer Science Engineering (CSE)

Description

Full syllabus notes, lecture & questions for Extendible Hashing | Algorithms - Computer Science Engineering (CSE) - Computer Science Engineering (CSE) | Plus excerises question with solution to help you revise complete syllabus for Algorithms | Best notes, free PDF download

Information about Extendible Hashing

In this doc you can find the meaning of Extendible Hashing defined & explained in the simplest way possible. Besides explaining types of Extendible Hashing theory, EduRev gives you an ample number of questions to practice Extendible Hashing tests, examples and also practice Computer Science Engineering (CSE) tests

	Algorithms 81 videos\|80 docs\|33 tests

Algorithms

81 videos|80 docs|33 tests

Join Course for Free

Download as PDF

Explore Courses for Computer Science Engineering (CSE) exam

Top Courses for Computer Science Engineering (CSE)

Explore Courses

Signup for Free!

Signup to see your scores go up within 7 days! Learn & Practice with 1000+ FREE Notes, Videos & Tests.

Start learning for Free

10M+ students study on EduRev

Extra Questions

Sample Paper

Previous Year Questions with Solutions

ppt

Extendible Hashing | Algorithms - Computer Science Engineering (CSE)

Semester Notes

practice quizzes

Important questions

Extendible Hashing | Algorithms - Computer Science Engineering (CSE)

Free

Viva Questions

MCQs

pdf

video lectures

Extendible Hashing | Algorithms - Computer Science Engineering (CSE)

shortcuts and tricks

Exam

Summary

past year papers

mock tests for examination

Objective type Questions

study material

;

Additional Information about Extendible Hashing for Computer Science Engineering (CSE) Preparation

Extendible Hashing Free PDF Download

The Extendible Hashing is an invaluable resource that delves deep into the core of the Computer Science Engineering (CSE) exam. These study notes are curated by experts and cover all the essential topics and concepts, making your preparation more efficient and effective. With the help of these notes, you can grasp complex subjects quickly, revise important points easily, and reinforce your understanding of key concepts. The study notes are presented in a concise and easy-to-understand manner, allowing you to optimize your learning process. Whether you're looking for best-recommended books, sample papers, study material, or toppers' notes, this PDF has got you covered. Download the Extendible Hashing now and kickstart your journey towards success in the Computer Science Engineering (CSE) exam.

Importance of Extendible Hashing

The importance of Extendible Hashing cannot be overstated, especially for Computer Science Engineering (CSE) aspirants. This document holds the key to success in the Computer Science Engineering (CSE) exam. It offers a detailed understanding of the concept, providing invaluable insights into the topic. By knowing the concepts well in advance, students can plan their preparation effectively. Utilize this indispensable guide for a well-rounded preparation and achieve your desired results.

Extendible Hashing Notes

Extendible Hashing Notes offer in-depth insights into the specific topic to help you master it with ease. This comprehensive document covers all aspects related to Extendible Hashing. It includes detailed information about the exam syllabus, recommended books, and study materials for a well-rounded preparation. Practice papers and question papers enable you to assess your progress effectively. Additionally, the paper analysis provides valuable tips for tackling the exam strategically. Access to Toppers' notes gives you an edge in understanding complex concepts. Whether you're a beginner or aiming for advanced proficiency, Extendible Hashing Notes on EduRev are your ultimate resource for success.

Extendible Hashing Computer Science Engineering (CSE) Questions

The "Extendible Hashing Computer Science Engineering (CSE) Questions" guide is a valuable resource for all aspiring students preparing for the Computer Science Engineering (CSE) exam. It focuses on providing a wide range of practice questions to help students gauge their understanding of the exam topics. These questions cover the entire syllabus, ensuring comprehensive preparation. The guide includes previous years' question papers for students to familiarize themselves with the exam's format and difficulty level. Additionally, it offers subject-specific question banks, allowing students to focus on weak areas and improve their performance.

Study Extendible Hashing on the App

Students of Computer Science Engineering (CSE) can study Extendible Hashing alongwith tests & analysis from the EduRev app, which will help them while preparing for their exam. Apart from the Extendible Hashing, students can also utilize the EduRev App for other study materials such as previous year question papers, syllabus, important questions, etc. The EduRev App will make your learning easier as you can access it from anywhere you want. The content of Extendible Hashing is prepared as per the latest Computer Science Engineering (CSE) syllabus.

Education Revolution