# Lecture 16 Virtual Memory

CS213 – Intro to Computer Systems Branden Ghena – Winter 2023

Slides adapted from:

St-Amour, Hardavellas, Bustamente (Northwestern), Bryant, O'Hallaron (CMU), Garcia, Weaver (UC Berkeley)

#### Administrivia

- Homework 4 due tonight
  - Very similar problems will be on the exam

- SETI Lab
  - Due next week Wednesday
  - See the pinned Piazza posts
    - Getting Started
    - Testing Performance

# Today's Goals

Understand goals and application of virtual memory

Explore how virtual memory resolves memory problems

Practice translating virtual addresses to physical addresses

- Bonus: Practice problems at the end
  - Also some bonus details on multi-level page tables we won't test you on

#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

# The Illusion!



# The Reality!



# Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?



Both processes assume they start at the beginning of RAM and use as much as they need









There's enough RAM for both. Why should we have to swap?

Challenge here is that programs are compiled with specific addresses...

# Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?
  - 2. How do we move memory around?



Process A Process A Process C **RAM CPU** Process C

Process A



Process A Process A Process C Process C Process D **CPU** There we go! Process D

Process A



# Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?
  - 2. How do we move memory around?
  - 3. How do we support processes bigger than RAM?

# Processes might be bigger than RAM



### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?
  - 2. How do we move memory around?
  - 3. How do we support processes bigger than RAM?
  - 4. How do we protect processes from each other?

#### Processes can't be trusted



### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?
  - 2. How do we move memory around?
  - 3. How do we support processes bigger than RAM?
  - 4. How do we protect processes from each other?
  - 5. How do we deal with how incredibly slow disk is?

### Computing timescales

• Assuming 4 GHz processor, Instruction (with registers): 0.25 ns

#### Jeff Dean (Google AI): "Numbers Everyone Should Know"

#### Jim Gray's analogy:

- Registers are in your apartment
- Disk is on Mars

| L1 cache reference                 | 0.          | .5 ns |
|------------------------------------|-------------|-------|
| Branch mispredict                  | 5           | ns    |
| L2 cache reference                 | 7           | ns    |
| Mutex lock/unlock                  | 25          | ns    |
| Main memory reference              | 100         | ns    |
| Compress 1K bytes with Zippy       | 3,000       | ns    |
| Send 2K bytes over 1 Gbps network  | 20,000      | ns    |
| Read 1 MB sequentially from memory | 250,000     | ns    |
| Round trip within same datacenter  | 500,000     | ns    |
| Disk seek                          | 10,000,000  | ns    |
| Read 1 MB sequentially from disk   | 20,000,000  | ns    |
| Send packet CA->Netherlands->CA    | 150,000,000 | ns    |

# Caching disks



### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?
  - 2. How do we move memory around?
  - 3. How do we support processes bigger than RAM?
  - 4. How do we protect processes from each other?
  - 5. How do we deal with how incredibly slow disk is?
- Virtual memory addresses all of these problems!

#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

### Virtual memory concept

Disconnect reality of RAM from illusion of main memory

- Processes work with the illusion
  - They use virtual addresses to reference where their memory is
- Computer (and OS) work with the reality
  - They use physical addresses that are real locations in RAM
- The hardware/OS translates virtual addresses into physical addresses

### A system using physical addresses

- Main memory An array of M contiguous byte-sized cells, each with a unique physical address
- Physical addressing
  - Most natural way to access it
    - Addresses used by the CPU correspond to bytes in memory
  - Used in simple systems like early PCs and embedded microcontrollers



# A system using virtual addresses

- The CPU generates virtual address
  - Address translation is done by dedicated hardware (memory management unit) via OS-managed lookup table (a Page Table)
  - Resulting physical address is used to access memory hierarchy

#### Modern processors use virtual addresses

 All addresses your programs work with are virtual!



Main memory

# Your experiences with Virtual Memory

- In Attack Lab, what was the address of touch2?
  - 0x40000-ish, right?
  - The same each time you run it too
- But multiple of you were running separate ctarget processes at the same time on Moore
  - 0x40000-ish was a Virtual Address
- Really, each process's code was at a totally different Physical Address in Moore's actual RAM

# Virtual Memory

- From here on out, we'll be working with two different memory spaces:
  - Virtual Memory (VM): A large (~infinite) space that a process believes it, and only it, has access to
  - Physical Memory (PM): The limited RAM space your computer must share among all processors

- This idea is independent of physical caches
  - There are still multiple layers of memory caches in the CPU
  - They might use virtual or physical addresses
    - We'll usually assume caches use physical addresses for this class

# virtual memory

your computer has physical memory



physical memory has addresses, like

O-8GB
but when your program
references an address
like 0x 5c69a2a2,
that's not a physical
memory address!

It's a virtual address.



Linux keeps a mapping from virtual memory pages to physical memory pages called the page table



PID virtual addr physical addr 1971 0x 20000 0x 192000 2310 0x 20000 0x 22 8000 2310 0x 21000 0x 9788000 when your program accesses a virtual address



every time you switch which process is running, Linux needs to switch the page table



#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

#### File data is stored on disk

- Physical memory: how much RAM does a computer actually have
  - These days, a handful of gigabytes is typical
- Address space: 2<sup>64</sup> possible addresses on x86-64!
  - That's 2<sup>64</sup> possible bytes of memory, or *17,179,869,184* GB!
  - (In reality, architecture limits addresses to 48 bits, soon 57. Still huge!)
- Across all your programs, may need more data than fits in physical memory, so some of it is in disk

### VM as a Tool for Caching

- Solution: use physical memory as a cache! (called: DRAM cache)
  - Store the bulk of your data on disk (very large, very cheap, but very slow)
  - And store the currently-used data in main memory (very fast by comparison)
  - Get the best of both worlds! Large capacity and fast access!

- DRAM cache organization driven by the enormous miss penalty
  - DRAM is about 100x slower than SRAM
  - Disk is about 100,000x slower than DRAM

#### Picking Cache Design Parameters

- Block size?
  - Disks are better at transferring large chunks of data (the first byte incurs a long delay, the rest come really fast afterwards)
  - Large block size: typically 4-8 KB
- Associativity?
  - DRAM cache misses incur enormous penalties; have to go to disk. Yikes!
  - Associativity is high to minimize miss rate
    - Fully associative (one huge set): any block can go anywhere in cache
    - Requires a "large" mapping function but managed in software, so ok
- Write-back or write-through?
  - Disk cannot keep up with a firehose of small writes
  - Use write-back (only write to disk when a page is evicted)
- Replacement algorithms?
  - Not limited by hardware; hardware strongly favors simple methods
  - Highly sophisticated, expensive, open-ended replacement algorithms

#### DRAM Cache Analogy to Cache Memory



#### Locating an object in DRAM Cache: Page Tables

- A page table maps virtual pages to physical pages
  - One page table entry (PTE) per virtual page (possible page in VM)
    - Fully-associative → one big set. Use the "tag" to index into table!
  - Each PTE specifies either a physical page (in DRAM) or a disk address
    - Valid bit tells us which
    - Instead of data in the "cache block", "pointer" to where data is



#### Page Hit

• *Page hit:* reference to a VM word that is in physical memory (hit on the DRAM cache)



## Page Fault

 Page fault: reference to VM word that is not in physical memory (DRAM cache miss)



• Page miss causes page fault (a HW exception, OS code kicks in to handle)



- Page miss causes page fault (a HW exception, OS code kicks in to handle)
- Page fault handler selects a victim to be evicted (here VP 4)



- Page miss causes page fault (a HW exception, OS code kicks in to handle)
- Page fault handler selects a victim to be evicted (here VP 4)
- The victim page is swapped with the disk block of the requested address



Offending instruction is restarted: page hit this time!



## Locality saves the day (as usual)

- At any point in time, programs tend to access a small set of active virtual pages called the working set
  - Programs with higher temporal locality will have smaller working sets
- If (working set size < main memory size)</li>
  - High performance for one process after compulsory misses
  - Fully-associative cache, so no conflicts. Only capacity matters.
  - Life is good!
- If (SUM(working set sizes) > main memory size)
  - Thrashing: Performance meltdown where pages are swapped to and from disk continuously
  - When cache memory is thrashing, CPU runs at the speed of memory. Ow.
  - When virtual memory is thrashing, CPU runs at the speed of disk. Yikes!
    - · Hope you enjoy the commute to Pluto. Because that's where your data is

#### Break + Question

- Computer has:
  - 8 pages of Virtual Memory
  - 4 pages of Physical Memory
- How many entries (rows) does a page table have?
- How many entries can be valid at any time?

#### Break + Question

- Computer has:
  - 8 pages of Virtual Memory
  - 4 pages of Physical Memory
- How many entries (rows) does a page table have?
   8 entries
- How many entries can be valid at any time?
  4 valid
- Page Table translates Virtual to Physical
  - It needs an entry for each virtual page, so 8 entries
  - Rows are valid if they point at physical memory
    - So only four entries can be valid (unless they share a physical page)

#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

#### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get?
  - 2. How do we move memory around?
  - 3. How do we support processes bigger than RAM?
  - 4. How do we protect processes from each other?
  - 5. How do we deal with how incredibly slow disk is? ✓
    Use RAM as a cache for disk

## Which addresses do processes get?

- Programs can use whatever virtual addresses they want
  - Usually a fixed mapping for a given OS

- OS controls physical addresses
  - Decides which parts of RAM are used for which things



#### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get? ✓ Whatever virtual addresses they want
  - 2. How do we move memory around?
  - 3. How do we support processes bigger than RAM?
  - 4. How do we protect processes from each other?
  - 5. How do we deal with how incredibly slow disk is? ✓
    Use RAM as a cache for disk

#### How do we move memory around?

 Just change the page table entry!



#### How do we move memory around?

- Just change the page table entry!
  - Same virtual address points at a different physical address
- Usually only happens when pages are swapped to disk and then later brought back



#### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get? ✓ Whatever virtual addresses they want
  - How do we move memory around? ✓
     Update page table entries
  - 3. How do we support processes bigger than RAM?
  - 4. How do we protect processes from each other?
  - 5. How do we deal with how incredibly slow disk is? ✓
    Use RAM as a cache for disk

#### How do we support processes bigger than RAM?

 Just leave some pages for that process on disk

 Page table entry still exists for each virtual page

 Hopefully working set is smaller than program memory



#### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get? ✓ Whatever virtual addresses they want
  - How do we move memory around? ✓
     Update page table entries
  - 3. How do we support processes bigger than RAM? ✓ Leave some pages on disk
  - 4. How do we protect processes from each other?
  - 5. How do we deal with how incredibly slow disk is? ✓
    Use RAM as a cache for disk

#### How do we protect processes from each other?

- Each process has separate virtual memory spaces
  - No way to access another process's physical memory unless it is mapped to one of your virtual addresses



#### Enabling shared libraries

 We could share some physical pages across processes to enable shared libraries or shared memory



VM as a Tool for Memory Protection

- What if we want better protection?
  - Mark a page as read-only
  - Keep a page in memory, but only the OS can touch it
- Extend PTEs with permission bits!
  - Page fault handler checks these before remapping
- HW enforces this protection (trap into OS if violation occurs)



Process j

#### Memory problems

- What are the challenges to supporting this reality?
  - 1. Which addresses does each process get? ✓ Whatever virtual addresses they want
  - How do we move memory around? ✓
     Update page table entries
  - 3. How do we support processes bigger than RAM? ✓ Leave some pages on disk
  - 4. How do we protect processes from each other? ✓
    Don't overlap virtual address spaces + permission bits
  - 5. How do we deal with how incredibly slow disk is? ✓
    Use RAM as a cache for disk

#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

#### **Address Translation**

- Goal: Given virtual address, find corresponding physical address
  - (Or get a page fault if the page is not in memory)
  - Translation done by Memory Management Unit (hardware)
  - But mapping itself is maintained by OS (software)
    - Just a table in memory!
- Basic Parameters
  - $N = 2^n$ : Number of addresses in virtual address space
  - $M = 2^m$ : Number of addresses in physical address space.  $m \le n$  (usually much less)
  - $P = 2^p$ : Page size (bytes)
- Components of the virtual address (VA)
  - Virtual page number (VPN)
  - Page Offset
- Components of the physical address (PA)
  - Physical page number (PPN)
  - Page Offset (same offset as VA)

#### Address Translation With a Page Table



#### Memory Access: Page Hit



- 1) Processor sends virtual address to MMU
- 2-3) MMU fetches PTE from page table in cache/memory
- 4) MMU sends physical address to cache/memory
- 5) Cache/memory sends data word to processor

#### Memory Access: Page Fault



- 2-3) MMU fetches PTE from page table in cache/memory
- 4) Valid bit is zero, so MMU triggers page fault exception
- 5) Handler identifies victim (and, if dirty, pages it out to disk)
- 6) Handler pages in new page and updates PTE in memory
- 7) Handler returns to original process, restarting faulting instruction

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

Mapping can be anything, which is bigger doesn't really matter!

1. How do we split Virtual Addresses into VPN and Offset?

| 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|----|----|---|---|---|---|---|---|---|---|---|---|
|    |    |   |   |   |   |   |   |   |   |   |   |

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

Mapping can be anything, which is bigger doesn't really matter!

- 1. How do we split Virtual Addresses into VPN and Offset?
  - Offset is based on page size: 64-bytes ⇒ 6 bits. All the rest are VPN

| 11 | 10   | 9       | 8      | 7   | 6 | 5           | 4 | 3 | 2 | 1 | 0 |
|----|------|---------|--------|-----|---|-------------|---|---|---|---|---|
|    | Virt | ual Pag | ge Num | ber |   | Page Offset |   |   |   |   |   |

2. How big are Physical Page Numbers?

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

Mapping can be anything, which is bigger doesn't really matter!

- 1. How do we split Virtual Addresses into VPN and Offset?
  - Offset is based on page size: 64-bytes ⇒ 6 bits. All the rest are VPN

| 11 | 10   | 9       | 8      | 7   | 6 | 5           | 4 | 3 | 2 | 1 | 0 |
|----|------|---------|--------|-----|---|-------------|---|---|---|---|---|
|    | Virt | ual Pag | ge Num | ber |   | Page Offset |   |   |   |   |   |

2. How big are Physical Page Numbers? 16-6 = 10 bits

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                      |    |    |    | 11 | 10 | 9 | 8 | 7 | 6 | 5   | 4    | 3     | 2  | 1 | 0 |
|----------------------|----|----|----|----|----|---|---|---|---|-----|------|-------|----|---|---|
| Virtual Page Number  |    |    |    |    |    |   |   | • |   | Pa  | age  | Offse | et |   |   |
| 15                   | 14 | 13 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5   | 4    | 3     | 2  | 1 | 0 |
| Physical Page Number |    |    |    |    |    |   | • |   | P | age | Offs | et    |    |   |   |

- Translate:
- Virtual address: 0x3F0
  - VPN:
  - Offset:

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                          |    |    |    | 11 | 10 | 9 | 8 | 7   | 6     | 5   | 4    | 3  | 2 | 1 | 0 |
|--------------------------|----|----|----|----|----|---|---|-----|-------|-----|------|----|---|---|---|
| Virtual Page Number Page |    |    |    |    |    |   |   | age | Offse | et  |      |    |   |   |   |
| 15                       | 14 | 13 | 12 | 11 | 10 | 9 | 8 | 7   | 6     | 5   | 4    | 3  | 2 | 1 | 0 |
| Physical Page Number     |    |    |    |    |    |   |   |     | P     | age | Offs | et |   |   |   |

- Translate:
- Virtual address: 0x3F0
  - VPN: 0b001111
  - Offset: 0b110000

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                           |  |  |  | 11 | 10 | 9 | 8 | 7 | 6 | 5   | 4     | 3  | 2 | 1 | 0 |
|---------------------------|--|--|--|----|----|---|---|---|---|-----|-------|----|---|---|---|
| Virtual Page Number       |  |  |  |    |    |   | • |   | P | age | Offse | et |   |   |   |
| 15 14 13 12 11 10 9 8 7 6 |  |  |  |    |    |   |   | 6 | 5 | 4   | 3     | 2  | 1 | 0 |   |
| Physical Page Number      |  |  |  |    |    |   |   |   | P | age | Offs  | et |   |   |   |

- Translate:
- Virtual address: 0x3F0
  - VPN: 0b001111
  - Offset: 0b110000

| VPN  | PPN   | Valid |
|------|-------|-------|
| 0x00 | 0x123 | 1     |
| 0x01 | 0x156 | 1     |
| 0x02 | 0x143 | 1     |
| 0x03 | 0x16F | 1     |
| 0x04 | 0x1FF | 0     |
| 0x05 | 0x107 | 0     |
| 0x06 | 0x100 | 0     |
| 0x07 | 0x1C0 | 0     |
| 0x08 | 0x1D8 | 0     |
| 0x09 | 0x1BF | 0     |
| 0x0A | 0x000 | 1     |
| 0x0B | 0x3FF | 1     |
| 0x0C | 0x308 | 0     |
| 0x0D | 0x3FD | 0     |
| 0x0E | 0x111 | 1     |
| 0x0F | 0x1F0 | 1     |

| VPN          | PPN   | Valid |  |  |  |
|--------------|-------|-------|--|--|--|
| 0x10         | 0x237 | 1     |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |
| Continues on |       |       |  |  |  |

- PPN:
- Offset:

### Virtual memory example

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                                  |                           |  |  | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|----------------------------------|---------------------------|--|--|----|----|---|---|---|---|---|---|---|---|---|---|
| Virtual Page Number Page Offset  |                           |  |  |    |    |   |   |   |   |   |   |   |   |   |   |
| 15                               | 15 14 13 12 11 10 9 8 7 6 |  |  |    |    |   |   | 6 | 5 | 4 | 3 | 2 | 1 | 0 |   |
| Physical Page Number Page Offset |                           |  |  |    |    |   |   |   |   |   |   |   |   |   |   |

- Translate:
- Virtual address: 0x3F0
  - VPN: 0b001111
  - Offset: 0b110000

| VPN  | PPN   | Valid |
|------|-------|-------|
| 0x00 | 0x123 | 1     |
| 0x01 | 0x156 | 1     |
| 0x02 | 0x143 | 1     |
| 0x03 | 0x16F | 1     |
| 0x04 | 0x1FF | 0     |
| 0x05 | 0x107 | 0     |
| 0x06 | 0x100 | 0     |
| 0x07 | 0x1C0 | 0     |
| 0x08 | 0x1D8 | 0     |
| 0x09 | 0x1BF | 0     |
| 0x0A | 0x000 | 1     |
| 0x0B | 0x3FF | 1     |
| 0x0C | 0x308 | 0     |
| 0x0D | 0x3FD | 0     |
| 0x0E | 0x111 | 1     |
| 0x0F | 0x1F0 | 1     |

| VPN          | PPN   | Valid |  |  |  |  |  |
|--------------|-------|-------|--|--|--|--|--|
| 0x10         | 0x237 | 1     |  |  |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |  |  |
| Continues on |       |       |  |  |  |  |  |

• PPN: 0b01 1111 0000

• Offset: 0b110000

Physical address:

### Virtual memory example

#### Parameters

- Virtual addresses are 12-bits
- Physical addresses are 16-bits
- Page size is 64 bytes

|                                 |  |  |  | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4   | 3    | 2  | 1 | 0 |
|---------------------------------|--|--|--|----|----|---|---|---|---|---|-----|------|----|---|---|
| Virtual Page Number Page Offset |  |  |  |    |    |   |   |   |   |   |     |      |    |   |   |
| 15 14 13 12 11 10 9 8 7 6       |  |  |  |    |    |   | 6 | 5 | 4 | 3 | 2   | 1    | 0  |   |   |
| Physical Page Number            |  |  |  |    |    |   |   | • |   | P | age | Offs | et |   |   |

#### • Translate:

Virtual address: 0x3F0

• VPN: 0b001111

• Offset: 0b110000

| PPN   | Valid                                                                                           |
|-------|-------------------------------------------------------------------------------------------------|
| 0x123 | 1                                                                                               |
| 0x156 | 1                                                                                               |
| 0x143 | 1                                                                                               |
| 0x16F | 1                                                                                               |
| 0x1FF | 0                                                                                               |
| 0x107 | 0                                                                                               |
| 0x100 | 0                                                                                               |
| 0x1C0 | 0                                                                                               |
| 0x1D8 | 0                                                                                               |
| 0x1BF | 0                                                                                               |
| 0x000 | 1                                                                                               |
| 0x3FF | 1                                                                                               |
| 0x308 | 0                                                                                               |
| 0x3FD | 0                                                                                               |
| 0x111 | 1                                                                                               |
| 0x1F0 | 1                                                                                               |
|       | 0x123 0x156 0x143 0x16F 0x16F 0x107 0x100 0x100 0x1C0 0x1D8 0x1BF 0x000 0x3FF 0x308 0x3FD 0x111 |

| VPN          | PPN   | Valid |  |  |  |  |  |
|--------------|-------|-------|--|--|--|--|--|
| 0x10         | 0x237 | 1     |  |  |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |  |  |
| Continues on |       |       |  |  |  |  |  |

• PPN: 0b01 1111 0000

• Offset: 0b110000

Physical address:

• 0b0111110000110000

0x7C30

### Break + Question

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                                 |    |                        |  | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4   | 3    | 2  | 1 | 0 |
|---------------------------------|----|------------------------|--|----|----|---|---|---|---|---|-----|------|----|---|---|
| Virtual Page Number Page Offset |    |                        |  |    |    |   |   |   |   |   |     |      |    |   |   |
| 15                              | 14 | 14 13 12 11 10 9 8 7 6 |  |    |    |   |   | 6 | 5 | 4 | 3   | 2    | 1  | 0 |   |
| Physical Page Number            |    |                        |  |    |    |   |   |   |   | P | age | Offs | et |   |   |

- Translate:
- Virtual address: 0x500
  - VPN:
  - Offset:

| VPN  | PPN   | Valid |
|------|-------|-------|
| 0x00 | 0x123 | 1     |
| 0x01 | 0x156 | 1     |
| 0x02 | 0x143 | 1     |
| 0x03 | 0x16F | 1     |
| 0x04 | 0x1FF | 0     |
| 0x05 | 0x107 | 0     |
| 0x06 | 0x100 | 0     |
| 0x07 | 0x1C0 | 0     |
| 0x08 | 0x1D8 | 0     |
| 0x09 | 0x1BF | 0     |
| 0x0A | 0x000 | 1     |
| 0x0B | 0x3FF | 1     |
| 0x0C | 0x308 | 0     |
| 0x0D | 0x3FD | 0     |
| 0x0E | 0x111 | 1     |
| 0x0F | 0x1F0 | 1     |

| VPN          | PPN   | Valid |  |  |  |  |  |
|--------------|-------|-------|--|--|--|--|--|
| 0x10         | 0x237 | 1     |  |  |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |  |  |
| Continues on |       |       |  |  |  |  |  |

- PPN:
- Offset:
- Physical address:

### Break + Question

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                                 |    |    |    | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4   | 3    | 2  | 1 | 0 |
|---------------------------------|----|----|----|----|----|---|---|---|---|---|-----|------|----|---|---|
| Virtual Page Number Page Offset |    |    |    |    |    |   |   |   |   |   |     |      |    |   |   |
| 15                              | 14 | 13 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4   | 3    | 2  | 1 | 0 |
| Physical Page Number            |    |    |    |    |    |   |   |   |   | P | age | Offs | et |   |   |

- Translate:
- Virtual address: 0x500
  - VPN: 0b010100
  - Offset: 0b000000

| VPN  | PPN   | Valid |
|------|-------|-------|
| 0x00 | 0x123 | 1     |
| 0x01 | 0x156 | 1     |
| 0x02 | 0x143 | 1     |
| 0x03 | 0x16F | 1     |
| 0x04 | 0x1FF | 0     |
| 0x05 | 0x107 | 0     |
| 0x06 | 0x100 | 0     |
| 0x07 | 0x1C0 | 0     |
| 0x08 | 0x1D8 | 0     |
| 0x09 | 0x1BF | 0     |
| 0x0A | 0x000 | 1     |
| 0x0B | 0x3FF | 1     |
| 0x0C | 0x308 | 0     |
| 0x0D | 0x3FD | 0     |
| 0x0E | 0x111 | 1     |
| 0x0F | 0x1F0 | 1     |

| VPN          | PPN   | Valid |  |  |  |  |  |
|--------------|-------|-------|--|--|--|--|--|
| 0x10         | 0x237 | 1     |  |  |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |  |  |
| Continues on |       |       |  |  |  |  |  |

• PPN: INVALID

Offset:

- Physical address:
  - Page Fault

### Break + Practice again

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                                             |  |  |  | 11 | 10      | 9    | 8   | 7    | 6 | 5   | 4    | 3   | 2     | 1  | 0 |
|---------------------------------------------|--|--|--|----|---------|------|-----|------|---|-----|------|-----|-------|----|---|
|                                             |  |  |  | Vi | rtual F | Page | Nur | nber | • |     | Pa   | age | Offse | et |   |
| 15   14   13   12   11   10   9   8   7   6 |  |  |  |    |         |      | 6   | 5    | 4 | 3   | 2    | 1   | 0     |    |   |
| Physical Page Number                        |  |  |  |    |         |      |     |      | P | age | Offs | et  |       |    |   |

- Translate:
- Virtual address: 0x0D6
  - VPN:
  - Offset:

| VPN  | PPN   | Valid |
|------|-------|-------|
| VEIN | FFIN  | valiu |
| 0x00 | 0x123 | 1     |
| 0x01 | 0x156 | 1     |
| 0x02 | 0x143 | 1     |
| 0x03 | 0x16F | 1     |
| 0x04 | 0x1FF | 0     |
| 0x05 | 0x107 | 0     |
| 0x06 | 0x100 | 0     |
| 0x07 | 0x1C0 | 0     |
| 0x08 | 0x1D8 | 0     |
| 0x09 | 0x1BF | 0     |
| 0x0A | 0x000 | 1     |
| 0x0B | 0x3FF | 1     |
| 0x0C | 0x308 | 0     |
| 0x0D | 0x3FD | 0     |
| 0x0E | 0x111 | 1     |
| 0x0F | 0x1F0 | 1     |

| VPN          | PPN   | Valid |  |  |  |  |
|--------------|-------|-------|--|--|--|--|
| VEIN         | FFIN  | vallu |  |  |  |  |
| 0x10         | 0x237 | 1     |  |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |  |
| Continues on |       |       |  |  |  |  |

- PPN:
- Offset:
- Physical address:

### Break + Practice again

- Parameters
  - Virtual addresses are 12-bits
  - Physical addresses are 16-bits
  - Page size is 64 bytes

|                      |    |    |    | 11 | 10      | 9   | 8   | 7    | 6   | 5    | 4  | 3   | 2     | 1  | 0 |
|----------------------|----|----|----|----|---------|-----|-----|------|-----|------|----|-----|-------|----|---|
|                      |    |    |    | Vi | rtual F | age | Nur | nber | •   |      | P  | age | Offse | et |   |
| 15                   | 14 | 13 | 12 | 11 | 10      | 9   | 8   | 7    | 6   | 5    | 4  | 3   | 2     | 1  | 0 |
| Physical Page Number |    |    |    |    |         |     |     | P    | age | Offs | et |     |       |    |   |

- Translate:
- Virtual address: 0x0D6
  - VPN: 0b000011
  - Offset: 0b010110

| VPN  | PPN   | Valid |
|------|-------|-------|
| 0x00 | 0x123 | 1     |
| 0x01 | 0x156 | 1     |
| 0x02 | 0x143 | 1     |
| 0x03 | 0x16F | 1     |
| 0x04 | 0x1FF | 0     |
| 0x05 | 0x107 | 0     |
| 0x06 | 0x100 | 0     |
| 0x07 | 0x1C0 | 0     |
| 0x08 | 0x1D8 | 0     |
| 0x09 | 0x1BF | 0     |
| 0x0A | 0x000 | 1     |
| 0x0B | 0x3FF | 1     |
| 0x0C | 0x308 | 0     |
| 0x0D | 0x3FD | 0     |
| 0x0E | 0x111 | 1     |
| 0x0F | 0x1F0 | 1     |

| VPN          | PPN   | Valid |  |  |  |
|--------------|-------|-------|--|--|--|
| 0x10         | 0x237 | 1     |  |  |  |
| 0x11         | 0x236 | 1     |  |  |  |
| 0x12         | 0x2B0 | 1     |  |  |  |
| 0x13         | 0x280 | 0     |  |  |  |
| 0x14         | 0x120 | 0     |  |  |  |
| Continues on |       |       |  |  |  |

PPN: 0b010 110 1111

• Offset: 0b010110

- Physical address:
  - 0b0101101111010110
  - 0x5BD6

#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

#### Accessing page tables is slow

- Problem: page tables are in memory
  - And we need to access them to find our address to access memory
  - Two memory accesses per access!!!



- Page table entries (PTEs) are cached in L1, L2, etc, like any other data in memory
  - PTEs may be evicted by other data references. Oops.
  - PTE access still requires average effective memory access delay

#### Speeding up Translation with a TLB

- Solution: *Translation Lookaside Buffer* (TLB)
  - Small hardware cache memory inside MMU
  - Contains page table entries for a small number of pages
  - Maps virtual page numbers to physical page numbers
  - Reduces issues with data kicking PTEs out of caches!
- Like cache memories, uses set indices, tags, and valid bits
  - VPN split into: TLB tag and TLB index (just like caches, because it is one!)
  - No need for a block offset equivalent (PTEs have a single value)

#### TLB Hit



A TLB hit eliminates a memory access

#### TLB Miss



#### A TLB miss incurs an additional memory access (the PTE)

Fortunately, TLB misses are rare. Why? Localit

Locality. It's always locality.

#### Address translation process



#### **Outline**

- Memory Problems
- Virtual Memory Concept
- Main Memory as a Cache
- Memory Problems Solved
- Address Translation
- Caching Page Table Entries

#### **Outline**

• Bonus: Memory System Practice Problems

#### Simple Memory System Example

- Addressing
  - 14-bit virtual addresses
  - 12-bit physical address
  - Page size = 64 bytes



## Simple Memory System: Page Table

We only show a few entries (out of 256)

| VPN | PPN | Valid |  |  |
|-----|-----|-------|--|--|
| 00  | 28  | 1     |  |  |
| 01  | l   | 0     |  |  |
| 02  | 33  | 1     |  |  |
| 03  | 02  | 1     |  |  |
| 04  | _   | 0     |  |  |
| 05  | 16  | 1     |  |  |
| 06  | _   | 0     |  |  |
| 07  | _   | 0     |  |  |

| VPN | PPN | Valid |  |  |
|-----|-----|-------|--|--|
| 08  | 13  | 1     |  |  |
| 09  | 17  | 1     |  |  |
| 0A  | 09  | 1     |  |  |
| OB  | _   | 0     |  |  |
| 0C  | _   | 0     |  |  |
| 0D  | 2D  | 1     |  |  |
| 0E  | 11  | 1     |  |  |
| OF  | 0D  | 1     |  |  |

| VPN | PPN | Valid |
|-----|-----|-------|
| 2E  | -   | 0     |

• • •

•

### Simple Memory System: TLB

- 16 entries
- 4-way associative



| Set | Tag | PPN | Valid |
|-----|-----|-----|-------|-----|-----|-------|-----|-----|-------|-----|-----|-------|
| 0   | 03  | _   | 0     | 09  | 0D  | 1     | 00  | _   | 0     | 07  | 02  | 1     |
| 1   | 03  | 2D  | 1     | 02  | _   | 0     | 04  | _   | 0     | 0A  | _   | 0     |
| 2   | 02  | _   | 0     | 08  | _   | 0     | 06  | _   | 0     | 03  | _   | 0     |
| 3   | 07  | _   | 0     | 03  | 0D  | 1     | 0A  | 34  | 1     | 02  | _   | 0     |

#### Simple Memory System: L1 Cache

- 16 lines, 4-byte block size
- Physically addressed



| Idx | Tag | Valid | B3 | B2 | B1 | B0 |
|-----|-----|-------|----|----|----|----|
| 0   | 19  | 1     | 99 | 11 | 23 | 11 |
| 1   | 15  | 0     | 1  | 1  | 1  | _  |
| 2   | 1B  | 1     | 00 | 02 | 04 | 08 |
| 3   | 36  | 0     | -  | _  | _  | _  |
| 4   | 32  | 1     | 43 | 6D | 8F | 09 |
| 5   | 0D  | 1     | 1D | 72 | F0 | 36 |
| 6   | 31  | 0     | _  | _  | _  | _  |
| 7   | 16  | 1     | 11 | C2 | DF | 03 |

| Idx | Tag | Valid | В3 | B2 | B1 | В0 |
|-----|-----|-------|----|----|----|----|
| 8   | 24  | 1     | 3A | 00 | 51 | 89 |
| 9   | 2D  | 0     | 1  | 1  | 1  | _  |
| Α   | 2D  | 1     | 93 | 15 | DA | 3B |
| В   | OB  | 0     | ı  | -  | 1  | _  |
| С   | 12  | 0     | -  | _  | -  | _  |
| D   | 16  | 1     | 04 | 96 | 34 | 15 |
| Е   | 13  | 1     | 83 | 77 | 1B | D3 |
| F   | 14  | 0     | _  | _  | _  | _  |

#### Address Translation Example #1

(using the Page Table, TLB, and L1 cache shown in the preceding slides)

movb (%rcx), %al

Virtual Address: 0x03D4

Address space: 14-bit VAddr, 12-bit PAddr, 64-byte page

TLB: 16 entries, 4-way

L1 Cache: 16 lines, 4-byte block, direct mapped,

Physically addressed





#### Address Translation Example #2

(using the Page Table, TLB, and L1 cache shown in the preceding slides)

Hit? \_\_\_

Virtual Address: 0x0B8F



#### Physical Address

VPN <u>0x2</u>E



Byte: \_

Likely invalid page. Maybe needs to read from disk. Either way we don't know the PPN.

#### Address Translation Example #3

(using the Page Table, TLB, and L1 cache shown in the preceding slides)

Virtual Address: 0x0020



**VPN 0x00** 

TLBI 0 TLBT 0x00

TLB Hit? N Page Fault? N

PPN: **0x28** 

#### Physical Address



BO 0

CI **0**x8

CT **0**x**28** 

Hit? N

Byte: Mem

Cache miss, so needs to read byte values from main memory

### **Outline**

• Bonus: Multi-level Page Tables

#### Multi-Level Page Tables

- Suppose:
  - 4KB (2<sup>12</sup>) page size, 48-bit address space, 8-byte PTE
- How big is the page table?
  - Would need a 512 GB page table!
    2<sup>48</sup> \* 2<sup>-12</sup> \* 2<sup>3</sup> = 2<sup>39</sup> bytes
  - That's just meta-data!
     Where does the data go?
- Common solution:
  - Multi-level page tables
  - Split the VPN into multiple pieces, 1 per level
  - Example: 2-level page table
    - Level 1 table: each PTE points to a level 2 page table (always memory resident)
    - Level 2 table: each PTE points to a page (paged in and out like any other data, maybe not even allocated!)



# A Two-Level Page Table Hierarchy 32 bit addresses, 4KB pages, 4-byte PTEs



#### Multi-level page table: Core i7



#### End-to-end Core i7 Data Address Translation

