Varia%onsofVirtualMemory CSE240AStudentPresenta%on PaulLoriaux - PowerPoint PPT Presentation

Varia%ons of Virtual Memory  CSE 240A Student Presenta%on  Paul Loriaux  Thursday, January 21, 2010 

VM: Real and Imagined  Every user process assigned its own  linear address space.  Each address space a single protec%on  domain shared by all threads.  Sharing only possible at page  granularity.  Disadvantage 1: Pointer meaningless  Compare this to “ideal” VM as imagined  outside its address context   years ago.   Every allocated region a “segment” with  Disadvantage 2: Transfer of control  its own protec%on informa%on.   across protec%on domains requires  expensive context switch.   However, this has so far proved to be  slow and cumbersome. So far...   In other words, sharing is hard and slow.  

Enter Mondrian memory protec%on (MMP)!  Offers fine grained memory protec%on,  with the simplicity and efficiency of  today’s linear addressing,  with acceptably small run‐%me  overheads.  Conven%onal linear VM systems fail on  How? By (A) allowing  different  PDs to to  (A) and (B).    have different permissions on the same  memory region.    Page‐group systems fail on (A) and (B).   By (B) suppor%ng sharing granularity  small er than a page.   Capability‐based systems fail mainly on  (C), arguably on (A).   By (C) allowing PDs to own regions of  memory and grant or  revoke  privileges.  

MMP Design  1. A  Permissions Table , one per PD and  stored in privileged memory, specifies  4 the permissions that PD has for  every   address in the address space.  3 2 2. A control register holds the address  of the ac%ve PD’s permissions table.  1 3. A PLB caches entries from (1) to  reduce memory accesses.  4. A sidecar register, one per address  register, caches the last segment  accessed by its associated register.  A compressed permissions table  reduces space needed for permissions. 

How to store permissions, take 1: SST  Sorted Segment Table  A linear, sorted array of segments,  permi%ng a binary search on PLB miss.  Segments can be any number of words  in length, but can not overlap.  Each entry in the SST includes a 30‐bit  start address and a 2‐bit permissions  field.  Problem: can s%ll take many steps to  locate a segment when the number of  Goal: balance (a) space overhead, (b)  segments is large.   access %me overhead, (c) PLB  u%liza%on, and (d) %me to modify the  Problem: Can only be shared between  tables when permissions change.   PDs in its en%rety.  

How to store permissions, take 2: MLPT  Mul8‐level Permissions Table  A mul%‐level table, sort of like an  inode.  1024 entries, each of which maps a 4  MB block, in which each entry maps a  4 KB block, in which each of the 64  entries provides individual  permissions for 16 x 4 B words.  How are permissions stored in those 4  Byte words?  Op%on 1: Permission Vector Entries  Op%on 2: Mini‐SST entries 

Permission Vector Entries  Problem: Do not take advantage of the  Well, you’ve got 32 bits, you have 2‐bit  fact that most user segments are longer  permissions, so just chop the entry up  than a single word. I.e. not compact.   into 16 2‐bit values for indica%ng the  permissions for each of 16 words. 

Mini‐SST Entries  Two segments (mid0, mid1) encode  two different permissions for 16  words.  One segment (first) encodes  permissions for 31‐word segment  (maximally) upstream.  One segment (last) encodes  permissions for 32‐word segment  Total address range: 79 words  (maximally) downstream.  Advantage: much more compact  2‐bit entry type. Either pointer to next  level, pointer to permission vector, or  Advantage: overlap in segments may  mini‐SST entry.    alleviate proximal loads from the table   Disadvantage: overlapping address  ranges complicates table updates.  

Boos%ng Performance via 2‐Level Permissions Caching  The PLB caches Permissions Table  entries, analogous to the TLB.  Low order “don’t care” bits in the PLB  tag increase the number of addresses  a PLB entry matches, thus decreasing  the PLB miss‐rate.  Changes in permissions requires a PLB  flush.  As above, “don’t care” bits in  the search key allow all PLB entries  within the modified region to be  invalidated during a single cycle. 

Boos%ng Performance via 2‐Level Permissions Caching  Each address register in the machine  has an associated sidecar register.  On a PLB miss, the entry returned by  the Permissions Table is also loaded  into the appropriate sidecar register.  The base and bound of the user  segment represented by the table  entry are expanded to facilitate  boundary checks.  Idea: the memory address referenced  by a par%cular address register on the  CPU will frequently load/store from/ to that address or one within the  Reduces traffic to the PLB.   same user segment, so hardwire the  permissions. 

Evalua%ng Performance Overhead  Evaluated both C and Java programs.  (why?) that were a mix of both  memory‐reference and memory‐ alloca%on intensive.  One confounding parameter: the  degree of granularity. Evaluated the  extrema, (a) coarse‐grained as  provided by today’s VM, and (b)  super‐fine‐grained where every object  Refs : total no. of loads and stores x 10 6  is its own user segment.  Segs : no. of segments wriien to PT  All benchmark programs run on a MIPS  simulator modified to trace memory  R/U : avg. references per PT update  references.  Cs : no. of coarse‐grained segments 

Metrics  Space overhead = space occupied by  Run%me overhead = number of  protec%on tables ÷ by space being  permissions table references (rw) ÷  number of memory references made  used by applica%on (data +  by the applica%on.   instruc%ons) at end of run.   Caveat: this overhead may or may not  Caveat: not measuring peak overhead.  manifest itself as performance loss,  depending on cpu implementa%on.  Space being used by applica%on  determined by querying every word in  memory and seeing if it has valid  permissions.   Caveat: space between malloced regions  not included in this quan%ty. 

Coarse‐Grained Protec%on Results  MLPT with mini‐SST entries and 60‐ entry PLB versus conven%onal page  table plus TLB.  Expecta%ons generally hold.  Claim: overhead for MMP word‐level  protec%on is very low when not used.   Expecta%on: slight space overhead from  MLPT leaf tables.  Expecta%on: slight speed improvement  from addi%onal hardware. 

Fine‐Grained Protec%on Results  Claim 1 . MLPT outperforms SST as  Removed permissions on malloc  segment number increases. Why?   header and only allowed program  access to the allocated block.  Claim 2 . MLPT space overhead is  always < 9%.   Claim 3 . The mSST table entry  outperforms protec%on vectors.  

Memory Hierarchy Performance  Sidecar miss rate about 10‐20%. PLB  Impact of permissions table accesses  miss rate just 0.5%.  on L1 L2 cache efficiency is slight, with  less than an addi%onal 0.25% being  added to the miss rate in the worst  case. 

Conclusions  1. Fine‐grained segment‐based memory protec%on that is compa%ble with current  linearly addressed ISAs is feasible.  2. The space and run%me overhead of providing this protec%on is small and scales  with the degree of granularity.   3. The MMP facili%es can be used to implement efficient applica%ons.   

Varia%onsofVirtualMemory CSE240AStudentPresenta%on PaulLoriaux - PowerPoint PPT Presentation

Varia%onsofVirtualMemory CSE240AStudentPresenta%on PaulLoriaux Thursday,January21,2010 VM:RealandImagined Everyuserprocessassigneditsown linearaddressspace.

Virtual Memory and Virtual Memory and Demand Paging Demand Paging Virtual Memory Illustrated

Cloud-Powered Continuous Integration and Deployment Jinesh Varia jvaria@amazon.com @jinman

Lecture 19: Virtual Memory Virtual Memory concept, Virtual- physical translation, page table,

Func+on applica+ons (calls, invoca+ons) lambda denotes a anonymous func+on To use a func+on, you

Searching for varia.ons in fundamental constants using Hubble Space Telescope observa.ons of White

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

Lecture 21: Virtual Memory, I/O Basics Todays topics: Virtual memory I/O overview

Virtual Memory 1 Virtual Memory Main memory is cache for secondary storage

Lecture 24: Virtual Memory, Multiprocessors Todays topics: Virtual memory

Virtual Memory: Demand Paging and Replacment Virtual Memory Illustrated virtual physical

Lecture 23: Virtual Memory, Multiprocessors Todays topics: Virtual memory

Last class: Paging Today: Virtual Memory Virtual Memory What if programs

Last class: Virtual Memory Today: Virtual Memory Uses Efficient Use of Physical

Shared Memory OS Lecture 9 UdS/TUKL WS 2015 MPI-SWS 1 Review: Virtual Memory How is virtual

Virtual Memory Questions? ! What is virtual memory and when is it useful? CSCI [4|6] 730 ! What is

Memory Virtualization: Basic Address Translation Prof. Patrick G. Bridges 1 University of New

Virtual memory Came out of work in late 1960s by Peter Denning - Established working set model

Operating System Labs Yuanbin Wu cs@ecnu Operating System Labs Review of Memory Management

CS698T Wireless Networks: Principles and Practice Topic 06 Modulation Bhaskaran Raman,

Memory Management and Paging Eric McCreath Address Binding For a program to execute it must be

Device I/O Configurability isnt free Bake-in some reasonable assumptions

Where We Are Source code Lexical, Syntax, and if (b == 0) a = b; Semantic Analysis IR

Main Memory Management External and Internal Fragmentation Address Binding HW