License: https://creativecommons.org/licenses/by-nc-nd/4.0/ Computer Architecture

### Purpose:

# 8 Memory Management, Virtual Memory

Providing a <u>continuous</u> (linear) address space to users/processes that is <u>larger</u> than physical memory.

This gives each process the illusion that the system has a separate, continuous, large memory, dedicated only to itself and no other processes are executing or consuming resources.

- Providing security/protection on memory blocks.
- · Sharing program binaries and application data in multiuser/multiprogram systems.

The memory hierarchy Speed (+) Cost (+) CPU' Cache Memory Main Size (+ Memory DISK

- The concept of virtual memory is in principle similar to that of cache memory.
- The relationship between virtual memory and main memory is very similar to the relationship between main memory and cache (Locality!)
- Using virtual memory, a computer can address more memory than the amount physically installed on the system, and it uses the hard disk to hold the excess.
- Programs and data are stored on disk; program instructions and data are brought to main memory as and when required.

http://akademi.itu.edu.tr/en/buzluca http://www.buzluca.info

@ ⊕ ⊕

2013 - 2020 Feza BUZLUCA

Computer Architecture

# Addresses and memories:

- · Programs are written and addresses are generated according to virtual memory (linear and large).
- The address generated by the CPU during program execution is called a logical address or virtual address.

In systems with virtual memory, the CPU generates virtual addresses.

- The set of virtual addresses (the space of the virtual memory) is called a virtual address space (or simply, address space).
- An address in main memory is called a physical address.
- The set of addresses used by main memory is called the memory space.

Virtual Memory Main Memory Physical Address Virtual Address Memory Space Address Space http://akademi.itu.edu.tr/en/buzluca

http://www.buzluca.info

@090

2013 - 2020 Feza BUZLUCA



• Each running process generates addresses as if it has the entire machine to itself (as if the computer offers an extremely large and linear memory).

### Main advantages:

- The job of the programmer and compiler is simplified, because no details of the hardware or memory organization are necessary to build a program.
- The program can be compiled independently from the properties of the physical memory.





Due to locality of reference, the process is expected to spend a certain amount

<u>@0</u>99

2013 - 2020 Feza BUZLUCA

time on the same page. http://akademi.itu.edu.tr/en/buzluca

http://www.buzluca.info







Computer Architecture

### Shared Memory:

- Normally, address spaces of two processes are protected from each other.
- The shared memory mechanism allows two address spaces to intersect at some points to provide interprocess communication.
- · Shared pages map to the same physical page.
- The same page frame number is placed for the shared page in the page tables
  of the two processes sharing that page. Two different virtual addresses map to
  the same physical address.



Computer Architecture

# Speeding Up Address Translation (Translation Lookaside Buffer - TLB) :

- Because of the lookup in the page table, each "virtual address physical address" translation requires an extra memory access.
  - 1. Access to page table to get the PF number
  - 2. Access to physical memory
- To make the translation faster, the most frequently used pages (locality!) and their descriptors (main memory block number and access rights) are stored in an associative memory (cached), called the Translation Lookaside Buffer (TLB). (Slide 8.11)
- When a virtual address is generated, the hardware searches the TLB for the virtual address's mapping.
  - If the mapping exists in the TLB, the hardware can translate the address without using the page table (faster).
  - Otherwise (TLB miss), the necessary mapping information is obtained from the page table and loaded into the TLB.
- When the TLB is full, a replacement algorithm must be used (such as FIFO or LRU).

http://akademi.itu.edu.tr/en/buzluca http://www.buzluca.info



2013 - 2020 Feza BUZLUCA



### Computer Architecture

### Page Replacement

If the referenced page is not currently in main memory and all blocks of the memory are occupied, then a block in main memory must be replaced with a page from virtual memory according to a page replacement algorithm.

 $\textbf{LRU (Least Recently Used)} \ associates "aging counters" \ with each page (block) \\ residing in main memory. \ Algorithm:$ 

- 1. When a page is referenced, its counter is cleared.
- 2. Only the counters of the pages that have lower values than that of the referenced page are incremented.
- 3. When a replacement is necessary, the page with the highest counter value is replaced; the new page is brought into main memory, its counter is cleared, and counters of other pages are incremented.

**Example:** Referenced page numbers: 0, 1, 2, 3, 0, 3, 4, 5 Main memory has 4 blocks. We need 2-bit counters.



License: https://creativecommons.org/licenses/by-nc-nd/4.0/

#### Computer Architecture

### Fragmentation

- Reminder: To benefit from spatial locality and to use the DMAC, the data are transferred in the form of pages.
- The virtual address space of a process is divided into fixed-size pages, resulting in potential internal fragmentation when the last page is copied into memory.
- · The process may not use the entire page.
- For example, in a system with a page size of 1K words, a process with a size of (5K words + 1 word) will occupy 6 pages.
- Actually, the last page contains only 1 word of data, but it will be copied as an
  entire page to main memory, and it will occupy an entire block there.
- · No other process may use this block.
- It might also happen that the process itself requires less than one page in its entirety, but it must occupy an entire block when copied to memory.
- In paged systems, pages are not partitioned into smaller chunks; they are always transferred as a whole.
- The problem that some space in a page cannot be used is called internal fragmentation.
- Besides, some processes may not need the entire address space. Therefore, they do not use all 2<sup>p</sup> lines of the page table.

http://akademi.itu.edu.tr/en/buzluca http://www.buzluca.info



2013 - 2020 Feza BUZLUCA

3 13

### Computer Architecture

# 8.2 Segmented and Paged Mapping (Paged Segmentation):

- The virtual address space is divided into logical, variable-length units, called segments.
- Each segment may have a different number of pages.
- A segment corresponds to a logical part of a program, such as an entire program, a subroutine, or an array.
- Segmentation supports the sharing and protection on logical parts of programs (segments), both of which are very difficult to do with paging.
- The logical address is divided into three fields:

n bits

Logical address: Segment Num. Page Num. word (offset)
s bits p bits r bits

Properties of the system:

Logical address space: 2<sup>n</sup> words

Number of Segments: 2<sup>S</sup>

Number of pages in a segment: Between 1 and 2<sup>p</sup>

Size of a page: 2<sup>r</sup> words

http://akademi.itu.edu.tr/en/buzluca http://www.buzluca.info **@ ⊕ ⊕** 

2013 - 2020 Feza BUZLUCA

8.14





#### Computer Architecture

### 8.3 Putting It All Together: Paged Segmentation, TLB, Cache Memory

- 1. The CPU generates the logical address: s, p, w (segment, page, word).
- 2. The TLB is searched for the (s, p) pair of the logical address.
  - a) If the (s, p) pair matches a tag in the TLB and access rights allow access
    - Retrieve the physical block number from the TLB, and form the physical address.
    - Search for the data in cache memory (see step 3).
    - If LRU is used, update the aging counters in the TLB.
  - b) If the searched (s, p) pair is **not** in the TLB
    - Retrieve the starting address of page table from the segment table using s.
    - Search in the page table for the referenced page number p.
    - i. If the page is in main memory
      - Retrieve the physical block number from the PT, and form the physical address.
      - Search for the data in cache memory (see step 3).
      - Update the TLB: Place new s, p, and Physical Block Num. info into the TLB; perform replacement if necessary.
      - If LRU is used, update the aging counters in the PT.

http://akademi.itu.edu.tr/en/buzluca http://www.buzluca.info



2013 - 2020 Feza BUZLUCA

8.17

### Computer Architecture

- b) If the searched (s, p) pair is **not** in the TLB (cont'd)
  - ii. If the page is **not** in main memory
    - A page fault occurs.
    - Bring the referenced page is to physical memory.
    - Update the page table: Address, counters.
    - Form the physical address.
    - Search for the data in cache memory (see step 3).
    - Update the TLB: Place new s and p info into the TLB; perform replacement if necessary.
- 3. Using the physical address, the referenced data is searched for in cache memory according to the mapping technique used (set-associative or direct).
  - a) If the data is in cache memory
    - Read the data from cache memory.
    - If LRU is used, update the counters in the cache.
  - b) If the data is **not** in cache memory
    - Read the data from main memory.
    - Transfer the block containing the data from main memory to cache.
    - Update counters in the cache.

http://akademi.itu.edu.tr/en/buzluca http://www.buzluca.info **@ ⊕ ⊕** 

2013 - 2020 Feza BUZLUCA

8.18