CS5460: Operating Systems Lecture 14: Memory Management (Chapter 8) - PowerPoint PPT Presentation

CS5460: Operating Systems Lecture 14: Memory Management (Chapter 8) CS 5460: Operating Systems

Important from last time  We’re trying to build efficient virtual address spaces – Why??  Virtual / physical translation is done by HW and SW cooperating – We want to make the common case fast in HW  Key insight: Treat a virtual address as a collection of separate elements – High bits identify a segment or page – Low bits identify offset into the segment or page CS 5460: Operating Systems

Quick Review  Base and bounds – Maps a single contiguous chunk of virtual address space to a single contiguous chunk of physical memory – Fast, small HW, and safe, but inflexible and lots of external fragmentation  Segmentation – Small number of segment registers (base-bounds per segment) – Each segment maps contiguous VM to contiguous physical mem – More flexible and less fragmentation than B+B, but same issues CS 5460: Operating Systems

 Modern hardware and OSes use paging  Pages are like segments, but fixed size – So the bounds register goes away – External fragmentation goes away  Since pages are small (4 or 8 KB, often) there are a lot of them – So page table has to go into RAM – Page table might be huge – Accessing the page table requires a memory reference by the hardware to perform the translation » First, load the page table entry » Second, access the data the user asked for  Today we look at how these problems are fixed CS 5460: Operating Systems

 Problems – Page table too large to store on processor die – Two memory references per load or store issued by the user program is unacceptable  Solution: Cache recently-used page table entries – TLB == “translation lookaside buffer” – TLB is a fixed-size cache of recently used page table entries – If TLB hit rate sufficiently large à à translation as fast as segments – If TLB hit rate poor à à load/store performance suffers badly – Key issue: Access locality – What is effective access time with/without TLB? CS 5460: Operating Systems

Paging Unit CPU issues load/store • TLB  Tag PPN State Memory management unit  CPU 0A1FE 20104 … Core 1. Compare VPN to all TLB tags 1 104A3 4010D … 2. If no match, need TLB refill … … … » SW à à trap to OS 3D11C 0401B … » HW à à HW table walker Virtual Pg# Offset 3. Checks if VA is valid » If not, generate trap 2 3 4. Concatenate PPN to Offset to 4 Match? Valid? form physical address Phys. Pg# Offset This all needs to be very No  No fast Trap or Trap – Why? HW refill Physical Address CS 5460: Operating Systems

TLB Issues  How large should the TLB be?  What limits TLB size?  How does a TLB interact with context switches? CS 5460: Operating Systems

TLB Discussion  What are typical TLB sizes and organization? – 16-1024 entries, fully associative, sub-cycle access latency – TLB misses take 10-100 cycles – Modern chips have multiple levels of TLB – Perform TLB lookup in parallel to L1 cache tag load » Requires virtually tagged L1 cache – Why is TLB often fully associative?  TLB “ reach ” – Amount of address space that can be mapped by TLB entries – What are architectural trends in terms of: » TLB size and associativity » Main memory size – Example: 64-entry TLB w/ 4KB pages à à 256 KB reach – Good news: 90-10 rule (90% of accesses to 10% of address space) CS 5460: Operating Systems

More Paging Discussion  What are typical state bits? – Valid bit – Protection bits (read-only, execute-only, read-write) – Referenced bit Used to support “ demand paging ” – Modified bit – … (real PTEs tend to have more)  What happens during a context switch? – PCB needs to be extended to contain (or point to) page table – Save/restore page table base register (for hardware-filled TLB) – Flush TLB (if PID not in tag field) CS 5460: Operating Systems

Page Table Organization • Root Page Table  Simple flat full table – Simple, fast lookups • Main Memory – Huge!!! …  Hierarchical – Two or more levels of tables » Leaf tables have PTEs • Valid, R/W, … – Fairly simple, fairly fast Hierarchical – Size roughly proportional to allocated memory size • Virtual address • Physical address PID PPN (n) Offset Page# Offset  Inverted – One entry per physical page – Entry contains VPN and ASID • Search – Small table size • n – Lookups more expensive hash(PID | Page#) » Hashing helps » Poor memory locality • Inverted Page Table CS 5460: Operating Systems

• page number • page offset • p1 • p2 • d • p1 • p2 • d • outer page • desired • table • inner page • page • table CS 5460: Operating Systems

Multi-level page tables  Why does this save space?  Early virtual memory systems used a single-level page table  VAX, 386 used 2-level page table  Sparc uses 3-level  Alpha, IA-64, x86-64 have 4 levels – In many OSes, page tables other than top-level can themselves be paged CS 5460: Operating Systems

Sharing P 1 VA P 2 VA  How do you share memory? VP5 VP5 – Entries in different process page tables map to same PPN VP4 2 VP4 1  Questions: VP3 1 – What does this imply about VP2 1 VP2 2 granularity of sharing? VP1 2 VP1 1 – Can we share only a 4-byte int? – Can we set it up so that only one of VP0 VP0 the sharers can write the region? – How can we use this idea to VP1 1 implement copy on write? VP3 1 VP2 2 VP2 1 VP4 2 VP5 VP0 VP4 1 Physical Address VP1 2 CS 5460: Operating Systems

Initializing an Address Space  Determine number of pages needed – Examine header information in executable file  Determine virtual address layout – Header file determines size of various segments – OS specifies positioning (e.g., base of code, location of stack)  Allocate necessary pages and initialize contents – Initialize page table entry to contain VPN à à PPN, set valid, …  Mark current TLB entries as invalid (i.e., TLB flush)  Start process – PC should point at valid address in code segment – Allocated stack space as process touches new pages CS 5460: Operating Systems

Superpages  Problem: TLB reach shrinking as % of memory size  Solution: Superpages – Permit pages to have variable size (back to segments?) – For simplicity, restrict generality: » Power of two region sizes » Aligned to superpage size (e.g., 1MB superpage aligned on 1MB bdy) » Contiguous Tag PPN Size State Offset VPN 014BEC02 …  Problem: Restrictions limit applicability. How? CS 5460: Operating Systems

Example: Superpage Usage • Physical Addresses • Virtual Addresses 0x80240000 0x00004000 Both virtual and 0x80241000 0x00005000 physical ranges 0x80242000 0x00006000 are contiguous 0x80243000 0x00007000 supervisor read-only access Virtual and physical ranges valid dirty aligned on superpage boundary virtual physical size 00004 80240 002 Y Y Y Y N • Page table Size: Denotes number of base pages in superpage (2 size ) CS 5460: Operating Systems

Superpage Discussion  What are good candidates for superpages? – Kernel – or at least the portions of kernel that are not “ paged ” – Frame buffer – Large “ wired ” data structures » Scientific applications being run in “ batch ” mode » In-core databases  How might OS exploit superpages? – Simple: Few hardwired regions (e.g., kernel and frame buffer) – Improved: Provide system calls so applications can request it – Holy grail: OS watches page access behavior and determines which pages are “ hot ” enough to warrant superpaging  Why might you not want to use superpages? CS 5460: Operating Systems

Paged Segmentation Virtual Address  Problem with paging: – Large VA space à à large page Seg# Page# Offset table  Idea: Combine segmentation and paging base 0 limit 0 Y >? – Divide address space into large base 1 limit 1 Trap contiguous segments … – Divide segments into smaller base n limit n fixed-size pages – Divide main memory into pages frame 0 prot 0 – Two-stage address translation frame 1 prot 1  Benefits: … – Reduced page table size frame n prot n – Can share segments easily  Problems: Physical frame# Offset – Extra complexity Address CS 5460: Operating Systems

Paged Segmentation  Sharing: – Share individual pages à à copy page table entries Physical Memory Virtual Memory Virtual Memory – Share entire segments à à copy segment table entries Seg3 Seg3  Protection: – Can be associated with either Seg2 Seg2 segment or page tables  Implementation Seg1 Seg1 – Segment tables in MMU – Page tables in main memory Seg0 Seg0  Practice Sharing example: – x86 supports pages & segments Two processes – RISCs support only paging Two common segments (0&3) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 14: Memory Management (Chapter 8) - PowerPoint PPT Presentation

CS5460: Operating Systems Lecture 14: Memory Management (Chapter 8) CS 5460: Operating Systems Important from last time Were trying to build efficient virtual address spaces Why?? Virtual / physical translation is done by HW and

CS5460: Operating Systems Lecture 2: OS Hardware Interface (Chapter 2) CS 5460: Operating

CS5460: Operating Systems Lecture 13: Memory Management (Chapter 8) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 1: Course Overview (Chapter 1) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 8: Critical Sections (Chapter 6) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 4: OS Organization & Intro to Process Management (Chapter

CS5460: Operating Systems Lecture 17: Intro to File Systems (Ch. 10) CS 5460: Operating Systems

Operating Systems: Operating Systems: Memory management Memory management Fall 2008 Fall 2008

CS5460: Operating Systems Lecture 9: Implementing Synchronization (Chapter 6) CS 5460:

CS5460: Operating Systems Lecture 3: OS Organization (Chapters 2-3) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 7: Synchronization (Chapter 6) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 5: Processes and Threads (Chapters 3-4) CS 5460: Operating

CS5460: Operating Systems Lecture 11: Deadlock (Chapter 7) CS 5460: Operating Systems Dining

CS5460: Operating Systems Lecture 20: File System Reliability CS 5460: Operating Systems File

CS5460: Operating Systems Lecture 24: Security CS 5460: Operating Systems Once upon a time,

CS5460: Operating Systems Lecture 16: Page Replacement (Ch. 9) CS 5460: Operating Systems Last

CS5460: Operating Systems Lecture 18: File System Implementation (Ch.10) CS 5460: Operating

Address Transla+on Main Points Address Transla+on Concept

Hit the Ground Running Oracle 10gR2 RAC on Linux Chris Page (cpage@cptech.com) Corporate

3x1x1 ROOT parser: v3 (December 4, 2017) Metadata (one entry per event): Run: run number

TLBs 1 memory HW random memory image page tables with 1-byte page entries answer: 2-byte

l N N l i w E E M M t n e I I Your monthly return C C m E E u P P c This

DN Charging Methodology Forum NGGD Apr-15 Revenue Report Mike Lapper / Craig Neilson UK GAS

Electrifying fleets: An Electric Van Centre for Leeds Chris Plumb Highways England Tom Cowen

Session 5A: Research Infrastructures: ensuring trust and quality of data Darren Bell Repository

CS5460: Operating Systems Lecture 14: Memory Management (Chapter 8) - PowerPoint PPT Presentation

CS5460: Operating Systems Lecture 14: Memory Management (Chapter 8) CS 5460: Operating Systems Important from last time Were trying to build efficient virtual address spaces Why?? Virtual / physical translation is done by HW and

CS5460: Operating Systems Lecture 2: OS Hardware Interface (Chapter 2) CS 5460: Operating

CS5460: Operating Systems Lecture 13: Memory Management (Chapter 8) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 1: Course Overview (Chapter 1) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 8: Critical Sections (Chapter 6) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 4: OS Organization &amp; Intro to Process Management (Chapter

CS5460: Operating Systems Lecture 17: Intro to File Systems (Ch. 10) CS 5460: Operating Systems

Operating Systems: Operating Systems: Memory management Memory management Fall 2008 Fall 2008

CS5460: Operating Systems Lecture 9: Implementing Synchronization (Chapter 6) CS 5460:

CS5460: Operating Systems Lecture 3: OS Organization (Chapters 2-3) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 7: Synchronization (Chapter 6) CS 5460: Operating Systems

CS5460: Operating Systems Lecture 5: Processes and Threads (Chapters 3-4) CS 5460: Operating

CS5460: Operating Systems Lecture 11: Deadlock (Chapter 7) CS 5460: Operating Systems Dining

CS5460: Operating Systems Lecture 20: File System Reliability CS 5460: Operating Systems File

CS5460: Operating Systems Lecture 24: Security CS 5460: Operating Systems Once upon a time,

CS5460: Operating Systems Lecture 16: Page Replacement (Ch. 9) CS 5460: Operating Systems Last

CS5460: Operating Systems Lecture 18: File System Implementation (Ch.10) CS 5460: Operating

Address Transla+on Main Points Address Transla+on Concept

Hit the Ground Running Oracle 10gR2 RAC on Linux Chris Page (cpage@cptech.com) Corporate

3x1x1 ROOT parser: v3 (December 4, 2017) Metadata (one entry per event): Run: run number

TLBs 1 memory HW random memory image page tables with 1-byte page entries answer: 2-byte

l N N l i w E E M M t n e I I Your monthly return C C m E E u P P c This

DN Charging Methodology Forum NGGD Apr-15 Revenue Report Mike Lapper / Craig Neilson UK GAS

Electrifying fleets: An Electric Van Centre for Leeds Chris Plumb Highways England Tom Cowen

Session 5A: Research Infrastructures: ensuring trust and quality of data Darren Bell Repository

CS5460: Operating Systems Lecture 4: OS Organization & Intro to Process Management (Chapter