Computer Science Engineering (CSE) Exam > Computer Science Engineering (CSE) Notes > Computer Architecture & Organisation (CAO) > The Memory Hierarchy

The Memory Hierarchy

Introduction to Memory Hierarchy

The memory hierarchy organises the various storage devices in a computer system so that the processor can obtain data at the highest possible speed while the overall cost of storage is kept reasonable. Different memory technologies trade off three principal characteristics: capacity, cost per bit and access time. A single memory technology cannot simultaneously provide very large capacity, very low cost and very short access time; the hierarchy combines technologies so that the system gains the best of each.

Why use a hierarchy?

Not all information is needed by the CPU at the same time; most programs repeatedly access a small fraction of their data and instructions.
Fast memories are expensive per bit and therefore used in small amounts close to the CPU; slower memories are cheaper and used for large-capacity backup.
Using several levels of storage gives both high effective speed (by keeping frequently used items in fast memory) and large overall capacity (by storing less-used items in low-cost devices).
The memory unit that communicates directly with the CPU is called the main memory. Devices that provide large, persistent backup storage are called auxiliary memory.
The main memory occupies a central position: it can communicate directly with the CPU and with auxiliary memory devices (often through an I/O processor).
A special, very-high-speed memory called cache is used to increase processing speed by making currently used instructions and data available to the CPU at a rapid rate.
CPU logic is usually faster than main memory access; hence processing speed is often limited by memory access times and is improved by cache.
The memory hierarchy therefore spans from very fast, small storage near the CPU to very slow, very large storage as auxiliary devices; the levels cooperate to give both performance and capacity.

Structure of the hierarchy

Registers - the fastest, smallest storage inside the CPU used for immediate computation.
Level 1 (L1) cache - small and very fast; often split into instruction and data caches.
Level 2 (L2) cache - larger and slower than L1; may be on-chip or off-chip.
Main memory (RAM) - larger capacity, slower access; directly accessible by the CPU.
Disk cache - buffer between main memory and disk to reduce disk I/O.
Disk (HDD/SSD) - large, persistent storage used as primary auxiliary memory.
Optical (CD/DVD/BD) - removable media for distribution and backup.
Tape - very high capacity and very low cost per bit, used for archival backup.

Behaviour as you move down the hierarchy

Cost per bit decreases - lower levels are cheaper to build per unit of storage.
Capacity increases - lower levels provide much larger storage space.
Access time increases - lower levels are slower to access.
Frequency of access decreases - the processor accesses higher-level (faster) memory far more frequently than lower-level memory.

Cache: purpose and basic operation

Cache memory is a small, fast memory placed between the CPU and main memory. It holds copies of a subset of main memory that the CPU is likely to access soon. Cache operation relies on the principle of locality of reference:

Temporal locality - if a location is referenced, it is likely to be referenced again soon.
Spatial locality - if a location is referenced, nearby locations are likely to be referenced soon.

A cache stores data in units called blocks or lines. Each cached block is identified by a tag and located by an index (for set selection) and an offset (within the block).

Mapping techniques

Direct mapping - each memory block maps to exactly one cache line determined by the index; simple and fast but may cause conflicts if multiple blocks map to the same line.
Fully associative mapping - a memory block may be placed in any cache line; flexible but requires associative search hardware to compare tags with all lines.
Set-associative mapping - a compromise: cache is divided into sets, each set contains several lines (ways); a memory block maps to exactly one set but can occupy any way within that set. Common designs are 2-way or 4-way set-associative.

Replacement and write policies

Replacement policies determine which cache block to evict when a new block must be loaded: common policies are Least Recently Used (LRU), First-In First-Out (FIFO), and random.
Write-through - on a write, data is written to both cache and main memory immediately; simpler but can increase memory traffic.
Write-back (write-behind) - updates are made only to the cache block and the block is written back to main memory only when it is evicted; reduces memory writes but requires a dirty bit and more complex coherence control.

Cache performance metrics and formula

Hit - requested data is found in cache.
Miss - requested data is not in cache and must be fetched from lower-level memory.
Hit ratio - fraction of accesses that are hits; miss rate = 1 - hit ratio.
Miss penalty - extra time required to fetch data from lower-level memory on a miss.
Effective Access Time (EAT) - average time to access memory taking into account hits and misses. The basic formula is: EAT = (hit ratio × t_cache) + (miss rate × t_memory), where t_cache is cache access time and t_memory is the time to access the required data from lower-level memory including the miss penalty.

Example calculation:

Given t_cache = 1 ns, t_memory = 100 ns and hit ratio = 0.98, compute EAT.

EAT = (0.98 × 1 ns) + (0.02 × 100 ns) = 0.98 ns + 2.0 ns = 2.98 ns.

Main memory, virtual memory and the TLB

Main memory (RAM) is larger and slower than cache and is the primary workspace for programs. When main memory is insufficient for all active data, the system uses virtual memory to give the illusion of a much larger address space by storing some pages on disk and bringing them into RAM on demand. This introduces page faults when a referenced page is not in main memory and must be loaded from disk.

The Translation Lookaside Buffer (TLB) is a small, fast cache that stores recent virtual-to-physical page translations; the TLB sits between the CPU and the page table and greatly speeds up address translation. The TLB itself is part of the memory hierarchy and obeys the same hit/miss and locality principles as caches.

Distinction between cache and virtual memory: caches operate on blocks (cache lines) and speed up access to main memory; virtual memory operates on pages and provides a large address space backed by disk. Both use similar ideas (locality, caching) but serve different purposes and operate at different granularities and levels.

Auxiliary memory and trade-offs

Auxiliary devices (disk, optical, tape) provide persistent storage at much lower cost per bit than main memory and cache. Designers decide how much to invest at each level of the hierarchy by weighing the trade-offs: increasing capacity often increases average access time but reduces cost per bit; increasing speed reduces capacity available at a given cost.

Therefore, designers usually place small, fast memories as close to the CPU as practical (registers and caches), larger and slower memories further away (main memory and disk), and very large but slow, cheap media at the bottom for archival storage (tape and optical). Multi-level caches (L1, L2, L3) are common to smooth the performance/cost curve.

Design considerations and additional topics

Inclusion and exclusion - multi-level cache designs may enforce that an upper-level cache's contents are included in the lower-level cache (inclusive) or kept exclusive; each choice has performance and coherence implications.
Cache coherence - in multiprocessor systems, caches must be kept coherent so that copies of a memory location across different caches reflect a consistent value; coherence protocols (MESI, MOESI, etc.) are used.
Block size - larger cache blocks exploit spatial locality but increase miss penalty and can raise conflict misses; block size is a key design parameter.
Associativity - higher associativity reduces conflict misses but increases access time and complexity; set-associative caches balance these considerations.
Cost-performance balance - overall system performance depends on hit ratios, miss penalties, and the relative speeds and costs of each level; designers use simulation and workload analysis to choose sizes and policies.

Summary

The memory hierarchy organises storage into multiple levels so that frequently used data is available quickly and infrequently used data is stored cheaply. Key concepts are locality of reference, cache organisation (mapping, replacement, write policies), performance metrics (hit ratio, miss penalty, effective access time), and the interaction of cache, main memory and virtual memory. Understanding these topics allows designers and programmers to make choices that improve observed performance while controlling cost and capacity.

The document The Memory Hierarchy is a part of the Computer Science Engineering (CSE) Course Computer Architecture & Organisation (CAO).

All you need of Computer Science Engineering (CSE) at this link: Computer Science Engineering (CSE)

	Computer Architecture & Organisation (CAO)

Computer Architecture & Organisation (CAO)

Join Course for Free

FAQs on The Memory Hierarchy

1. What is the memory hierarchy in computer science engineering?

Ans. The memory hierarchy in computer science engineering refers to the organization and structure of different levels of memory within a computer system. It consists of multiple levels, including registers, cache, main memory (RAM), and secondary storage (hard drives or solid-state drives). These levels are arranged in a hierarchy based on their proximity to the processor and their speed and cost characteristics.

2. How does the memory hierarchy improve computer performance?

Ans. The memory hierarchy improves computer performance by exploiting the principle of locality. This principle states that programs tend to access a small portion of the available memory at any given time. The memory hierarchy places frequently accessed data and instructions in faster and more expensive levels of memory, such as cache, closer to the processor. This reduces the average access time and enhances the overall system performance.

3. What is the role of cache memory in the memory hierarchy?

Ans. Cache memory plays a crucial role in the memory hierarchy. It is a small, high-speed memory located between the processor and main memory. Cache memory stores frequently accessed data and instructions from the main memory to provide faster access to the processor. By keeping a copy of frequently used data closer to the processor, cache memory reduces the average memory access time and improves system performance.

4. How is data transferred between different levels of the memory hierarchy?

Ans. Data is transferred between different levels of the memory hierarchy using the principle of cache coherence. When the processor requests data, the cache memory checks if the data is present in its cache. If not, it fetches the data from the next level of memory and stores it in the cache. Similarly, when data is modified in the cache, it is written back to the next level of memory to maintain consistency across different levels.

5. What are the trade-offs involved in designing the memory hierarchy?

Ans. Designing the memory hierarchy involves several trade-offs. One trade-off is between speed and cost. Faster and smaller memory levels, such as registers and cache, are more expensive than larger and slower memory levels, such as main memory and secondary storage. Another trade-off is between capacity and latency. Larger memory levels provide more storage capacity but may have higher access latency. Designers need to balance these trade-offs to optimize the overall system performance and cost efficiency.

About this Document

2.9K Views

4.74/5 Rating

Apr 25, 2026 Last updated

Related Exams

Computer Science Engineering (CSE)

Document Description: The Memory Hierarchy for Computer Science Engineering (CSE) 2026 is part of Computer Architecture & Organisation (CAO) preparation. The notes and questions for The Memory Hierarchy have been prepared according to the Computer Science Engineering (CSE) exam syllabus. Information about The Memory Hierarchy covers topics like and The Memory Hierarchy Example, for Computer Science Engineering (CSE) 2026 Exam. Find important definitions, questions, notes, meanings, examples, exercises and tests below for The Memory Hierarchy.

Introduction of The Memory Hierarchy in English is available as part of our Computer Architecture & Organisation (CAO) for Computer Science Engineering (CSE) & The Memory Hierarchy in Hindi for Computer Architecture & Organisation (CAO) course. Download more important topics related with notes, lectures and mock test series for Computer Science Engineering (CSE) Exam by signing up for free. Computer Science Engineering (CSE): The Memory Hierarchy

Description

The Memory Hierarchy of Computer Architecture & Organisation covers all the important topics, helping you prepare for the Computer Science Engineering (CSE) exam on EduRev.

Information about The Memory Hierarchy

In this doc you can find the meaning of The Memory Hierarchy defined & explained in the simplest way possible. Besides explaining types of The Memory Hierarchy theory, EduRev gives you an ample number of questions to practice The Memory Hierarchy tests, examples and also practice Computer Science Engineering (CSE) tests

	Computer Architecture & Organisation (CAO)

Computer Architecture & Organisation (CAO)

Join Course for Free

Download as PDF

Explore Courses for Computer Science Engineering (CSE) exam

Get EduRev Notes directly in your Google search

The Memory Hierarchy Free PDF Download

The The Memory Hierarchy is an invaluable resource that delves deep into the core of the Computer Science Engineering (CSE) exam. These study notes are curated by experts and cover all the essential topics and concepts, making your preparation more efficient and effective. With the help of these notes, you can grasp complex subjects quickly, revise important points easily, and reinforce your understanding of key concepts. The study notes are presented in a concise and easy-to-understand manner, allowing you to optimize your learning process. Whether you're looking for best-recommended books, sample papers, study material, or toppers' notes, this PDF has got you covered. Download the The Memory Hierarchy now and kickstart your journey towards success in the Computer Science Engineering (CSE) exam.

Importance of The Memory Hierarchy

The importance of The Memory Hierarchy cannot be overstated, especially for Computer Science Engineering (CSE) aspirants. This document holds the key to success in the Computer Science Engineering (CSE) exam. It offers a detailed understanding of the concept, providing invaluable insights into the topic. By knowing the concepts well in advance, students can plan their preparation effectively. Utilize this indispensable guide for a well-rounded preparation and achieve your desired results.

The Memory Hierarchy Notes

The Memory Hierarchy Notes offer in-depth insights into the specific topic to help you master it with ease. This comprehensive document covers all aspects related to The Memory Hierarchy. It includes detailed information about the exam syllabus, recommended books, and study materials for a well-rounded preparation. Practice papers and question papers enable you to assess your progress effectively. Additionally, the paper analysis provides valuable tips for tackling the exam strategically. Access to Toppers' notes gives you an edge in understanding complex concepts. Whether you're a beginner or aiming for advanced proficiency, The Memory Hierarchy Notes on EduRev are your ultimate resource for success.

The Memory Hierarchy Computer Science Engineering (CSE) Questions

The "The Memory Hierarchy Computer Science Engineering (CSE) Questions" guide is a valuable resource for all aspiring students preparing for the Computer Science Engineering (CSE) exam. It focuses on providing a wide range of practice questions to help students gauge their understanding of the exam topics. These questions cover the entire syllabus, ensuring comprehensive preparation. The guide includes previous years' question papers for students to familiarize themselves with the exam's format and difficulty level. Additionally, it offers subject-specific question banks, allowing students to focus on weak areas and improve their performance.

Study The Memory Hierarchy on the App

Students of Computer Science Engineering (CSE) can study The Memory Hierarchy alongwith tests & analysis from the EduRev app, which will help them while preparing for their exam. Apart from the The Memory Hierarchy, students can also utilize the EduRev App for other study materials such as previous year question papers, syllabus, important questions, etc. The EduRev App will make your learning easier as you can access it from anywhere you want. The content of The Memory Hierarchy is prepared as per the latest Computer Science Engineering (CSE) syllabus.

Signup to see your scores go up within 7 days!

Access 1000+ FREE Docs, Videos and Tests

Continue with Google

Takes less than 10 seconds to signup

The Memory Hierarchy

Introduction to Memory Hierarchy

Why use a hierarchy?

Structure of the hierarchy

Behaviour as you move down the hierarchy

Cache: purpose and basic operation

Mapping techniques

Replacement and write policies

Cache performance metrics and formula

Main memory, virtual memory and the TLB

Auxiliary memory and trade-offs

Design considerations and additional topics

Summary

Computer Architecture & Organisation (CAO)

FAQs on The Memory Hierarchy

Computer Architecture & Organisation (CAO)

Schools

Competitive Exams

International Exams

Quick Links

The Memory Hierarchy Free PDF Download

Importance of The Memory Hierarchy

The Memory Hierarchy Notes

The Memory Hierarchy Computer Science Engineering (CSE) Questions

Study The Memory Hierarchy on the App