Schism: Fragmentation-Tolerant Real-Time Garbage Collection Fil - PowerPoint PPT Presentation

Replication-based GC • See: [Nettles-O’Toole ’93], [Cheng-Blelloch ’01] • Allows concurrent defragmentation • Two spaces: one space for reads; writes “replicated” to both spaces Works best for immutable objects. • Problem: Writes not atomic ! Loss of coherence! Application Read Write Object Original Replica Copying Friday, June 11, 2010

Allocate in fragments [Siebert ’99] •All objects split into small fragments. •Fragment size is typically fixed at 32 bytes. •Fragments are linked, application must follow links on object access. Friday, June 11, 2010

Allocate in fragments [Siebert ’99] •All objects split into small fragments. •Fragment size is typically fixed at 32 bytes. •Fragments are linked, application must follow links on object access. Plain Object Access cost is known statically, does not vary. Most objects require only two fragments. Friday, June 11, 2010

Allocate in fragments [Siebert ’99] •All objects split into small fragments. •Fragment size is typically fixed at 32 bytes. •Fragments are linked, application must follow links on object access. Array Access cost is logarithmic. Array accesses will see significant slow- down! Friday, June 11, 2010

Allocate in fragments [Siebert ’99] •All objects split into small fragments. •Fragment size is typically fixed at 32 bytes. •Fragments are linked, application must follow links on object access. Bad idea for large arrays. Array Access cost is logarithmic. Array accesses will see significant slow- down! Friday, June 11, 2010

Synopsis •Replication-copying Collection: • great, but only for immutable objects •Fragmented Allocation: • great, unless you have large arrays Friday, June 11, 2010

Synopsis •Replication-copying Collection: • great, but only for immutable objects •Fragmented Allocation: • great, unless you have large arrays Can we combine the two? Friday, June 11, 2010

Idea : combine Fragmented Allocation with Replication-Copying using Arraylets Friday, June 11, 2010

A new way of exploiting Arraylets Friday, June 11, 2010

A new way of exploiting Arraylets Arraylet Spine Friday, June 11, 2010

A new way of exploiting Arraylets Arraylet Spine Fragments have fixed size - no external fragmentation Friday, June 11, 2010

A new way of exploiting Arraylets The Arraylet Spine has variable size, which can lead to fragmentation! Arraylet Spine Fragments have fixed size - no external fragmentation Friday, June 11, 2010

A new way of exploiting Arraylets But the spine is immutable ... Arraylet Spine Fragments have fixed size - no external fragmentation Friday, June 11, 2010

A new way of exploiting Arraylets But the spine is immutable ... ... and replication is ideal for immutable objects Arraylet Spine Fragments have fixed size - no external fragmentation Friday, June 11, 2010

Schism = arraylets + replication + fragments •Combination: •Concurrent mark-sweep GC for fixed-size fragments • Replication copying for variable-size arraylet spines •No external fragmentation for either fragments or spines •Heap access is O(1), wait-free, and coherent. Friday, June 11, 2010

Friday, June 11, 2010

Concurrent Replication Heap for Spines To-space for Array From-space for Array Spines Spines Concurrent Mark-Sweep Heap for Fragments Friday, June 11, 2010

Concurrent Replication Heap for Spines To-space for Array From-space for Array Spines Spines Small Object Concurrent Mark-Sweep Heap for Fragments Friday, June 11, 2010

Concurrent Replication Heap for Spines To-space for Array From-space for Array Spines Spines Large Array? Small Object Concurrent Mark-Sweep Heap for Fragments Friday, June 11, 2010

Concurrent Replication Heap for Spines From-space for Array To-space for Array Spines Spines Large Array? Small Object Concurrent Mark-Sweep Heap for Fragments Friday, June 11, 2010

Friday, June 11, 2010

related work - or - how to make a complete RTGC Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Henrikkson ’98 Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Henrikkson ’98 Kalibera et al ’09 Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Blackburn & McKinley ’08 Henrikkson ’98 Kalibera et al ’09 Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Blackburn & McKinley ’08 Doligez, Leroy, Gonthier ’93, ’94 Henrikkson ’98 Kalibera et al ’09 Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Blackburn & McKinley ’08 Doligez, Leroy, Gonthier ’93, ’94 Puffitsch & Schoeberl ’08 Henrikkson ’98 Kalibera et al ’09 Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Blackburn & McKinley ’08 Doligez, Leroy, Fiji CMR* Gonthier ’93, ’94 Puffitsch & Schoeberl ’08 Henrikkson ’98 Kalibera et al * concurrent mark- ’09 region Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism Blackburn & McKinley ’08 S CHISM / CMR Doligez, Leroy, Fiji CMR* Gonthier ’93, ’94 Puffitsch & Schoeberl ’08 Henrikkson ’98 Kalibera et al * concurrent mark- ’09 region Friday, June 11, 2010

related work Cheng & Blelloch ’01 - or - how to make a complete RTGC Siebert ’99 Schism good throughput } on-the-fly Blackburn & concurrent McKinley ’08 S CHISM / CMR time/space bounds Doligez, Leroy, Fiji CMR* Gonthier ’93, ’94 Puffitsch & Schoeberl ’08 Henrikkson ’98 Kalibera et al * concurrent mark- ’09 region Friday, June 11, 2010

Tunable throughput-predictability trade-off. Friday, June 11, 2010

Tunable throughput-predictability trade-off. • Schism A : completely deterministic : •arrays allocated fragmented • Schism C : optimize throughput: •allocate contiguously if possible • Schism CW : simulate worst-case execution of Schism C: •poison all fast-paths (array accesses, write barriers, allocations) Friday, June 11, 2010

(very short) Summary of Results •Goal: as fast as Metronome •Goal: fragmentation tolerant like Java RTS •Goal: deterministic Friday, June 11, 2010

SPECjvm98 throughput summary 70% 60% (100% = HotSpot) 50% Throughput 40% 30% 20% 10% 0% Java RTS Metronome Schism Friday, June 11, 2010

(very short) Summary of Results •Goal: as fast as Metronome •Goal: fragmentation tolerant like Java RTS •Goal: deterministic Friday, June 11, 2010

(very short) Summary of Results ✓ •Goal: as fast as Metronome •Goal: fragmentation tolerant like Java RTS •Goal: deterministic Friday, June 11, 2010

Fragger Results Friday, June 11, 2010

Fragger Results •Amount of free memory successfully allocated under fragmentation: • HotSpot : ~ 100% • Java RTS : ~ 80% • Metronome : ~ 1% , unless using >10KB objects • Schism : ~ 100% (all objects) Friday, June 11, 2010

(very short) Summary of Results ✓ •Goal: as fast as Metronome •Goal: fragmentation tolerant like Java RTS •Goal: deterministic Friday, June 11, 2010

(very short) Summary of Results ✓ •Goal: as fast as Metronome ✓ •Goal: fragmentation tolerant like Java RTS •Goal: deterministic Friday, June 11, 2010

Schism predictability: RTEMS* on 40MHz LEON3 Friday, June 11, 2010

Schism predictability: RTEMS* on 40MHz LEON3 * Real Time Executive for Missile Systems Friday, June 11, 2010

Schism predictability: RTEMS* on 40MHz LEON3 The OS/hardware platform used for NASA & ESA space missions. * Real Time Executive for Missile Systems Friday, June 11, 2010

Performance baseline: C code. Friday, June 11, 2010

Performance baseline: C code. Using both C and Java implementations of the CDx real-time air traffic collision detection benchmark [Kalibera et al ’09]. Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 100 Milliseconds 80 60 40 Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 100 Milliseconds 80 60 Min 40 Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 100 Milliseconds Max 80 60 Min 40 Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 100 Milliseconds Max 80 60 CDx performance varies between Min events due to varying number of 40 predicted collisions. Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 100 Milliseconds 80 70.5 60 40 Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 96.6 100 Milliseconds 80 70.5 60 40 Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 120 97.2 96.6 100 Milliseconds 80 70.5 60 40 Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Java (CMR, Schism) versus C on CDx real-time benchmark 112.5 120 97.2 96.6 100 Milliseconds 80 70.5 60 Schism CW refines the worst-case of 40 Schism C by accounting for GC Java Java Java Java C code Fiji CMR Schism C Schism CW Schism A Friday, June 11, 2010

Schism: Fragmentation-Tolerant Real-Time Garbage Collection Fil - PowerPoint PPT Presentation

Schism: Fragmentation-Tolerant Real-Time Garbage Collection Fil Pizlo Luke Ziarek Peta Maj * Tony Hosking * Ethan Blanton Jan Vitek * * Friday, June 11, 2010 Why another Real Time Garbage Collector? Friday, June 11, 2010

Uniprocessor Garbage Collection Techniques Presented by: Shiri Dori Shai Erera Outline

Garbage Collection Akim Demaille, Etienne Renault, Roland Levillain June 4, 2019 TYLA Garbage

Garbage Collection Last time Compiling Object-Oriented Languages Today

Incremental Garbage Collection Part II Roland Schatz Incremental Garbage Collection p.1/22

Using SCHISM Joseph Zhang 2 Primer Online manual and wiki resource: www.schism.wiki o There

Garbage Collection Jan Midtgaard Michael I. Schwartzbach Aarhus University The Garbage

GARBAGE BAGE CO COLLECTIO LLECTION: N: @EvaAndreasson, @Cloudera AGENDA Garbage

Generational Generational Garbage Collection Garbage Collection Mirko Jerrentrup,

Distributed Garbage Collection for General Graphs Basic Approaches to Garbage Collection

Sunglasses SM001 Collection SM005 Collection YPC001 Collection(swimming goggles) SR001

Fault-Tolerant Data Collection in Fault-Tolerant Data Collection in Heterogeneous Intelligent

Practical Fully Relocating Garbage Collection in LLVM Philip Reames, Sanjoy Das Azul Systems

Prioritized Garbage Collection Using the Garbage Collector to Support Caching Diogenes Nunez ,

Bounding Pause Times in a Regional Garbage Collector Felix S Klock II Thesis Advisor: Will

FOX Garbage Collection FOX / GC Example 1 ex1: garbage at end let x = (1, 2) , y = let tmp =

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

Decidable fragments of first order logic R. Ramanujam The Institute of Mathematical Sciences,

Combinations of Theories for Decidable Fragments of First-order Logic Pascal Fontaine Loria,

Reflection calculus and conservativity spectra Lev D. Beklemishev Steklov Mathematical Institute

The Berkeley File System The Original File System Background Why is the bandwidth low? The

Undecidability of propositional separation logic and its neighbours James Brotherston Computer

1 A Nachos Filesystem A Nachos FilesystemOn Disk On Disk A Typical Unix File Tree A Typical

A Worst-Case Opmal Mul-Round Algorithm for Parallel Computaon of Conjuncve Queries

Remote Procedure Call Client Server R e Blocked q u e s t Outline Protocol Stack