Register Allocation II 15-745 Optimizing Compilers Spring 2006 - PowerPoint PPT Presentation

Register Allocation II 15-745 Optimizing Compilers Spring 2006 David Koes School of Computer Science

Outline • Review • Old Subtleties • New Subtleties School of Computer Science

Review Build Simplify Coalesce Potential Spill Select Actual Spill School of Computer Science

Build First compute live ranges v <- 1 - use both reaching definitions w <- v + 3 and liveness information x <- w + v - live range defined by v definition point u <- v x - ends when variable dies t <- u + x w <- w t <- t u <- u School of Computer Science

Build Construct interference graph: - each node represents a live range v - edges represent live ranges that overlap - put in move edges between move operands m o v x w e e d v <- 1 g e w <- v + 3 v x <- w + v x u u <- v w t <- u + x t <- w u <- t t <- u School of Computer Science

Simplify k = 4 Reduce the graph: - remove non-move related, easy to color, nodes v - easy to color: degree < k - place on stack x w w u x t t School of Computer Science

Coalesce Coalesce moves: k = 4 - conservatively combine operands of a move v - subtlety: how? Repeat Simplify uv uv -Detail: If both Simplify and Coalesce get stuck, start simplifying move related nodes w u Simplify x Coalesce t School of Computer Science

What if we can’t simplify? Now what? k = 3 Be optimistic: v - Put a node with degree ≥ k on stack - Lose guarantee that anything we put on stack is colorable x w - If we’re lucky this node will still be colorable when popped from stack Be realistic: t u - If unlucky, this node will have to be spilled (allocated to memory) w - Mark as potential spill to avoid t recomputation later v School of Computer Science

Select Pop a node from the stack k = 3 Assign it a color that does not v conflict with neighbors in interference graph x This will always be possible, x w u unless the node is a potential spill t u If it is not possible spill variable and rebuild graph w t v School of Computer Science

Spilling v <- 1 Allocate w to memory w 1 <- v + 3 location M w w 1 M w []<- w 1 Spilled variables are allocated to w 2 <- M w [] w 2 the stack in an area completely controlled by the compiler. x <- w 2 + v These memory locations are v special in that they can be u <- v optimized without concern for x t <- u + x memory aliasing issues. w 3 <- M w [] w 3 <- w 3 t <- t Now Start Over... u <- u ...compute live ranges... School of Computer Science

Spilling to Memory • RISC Architectures – Only load and store can access memory • every use requires load • every def requires store • create new temporary for each location • CISC Architectures – can operate on data in memory directly • makes writing compiler easier(?), but isn’t necessarily faster – pseudo-registers inside memory operands still have to be handled School of Computer Science

Rematerialization An alternative to spilling – Recompute value of variable instead of store/load to memory – Example: v <- 1 v <- 1 w <- v + 3 w <- v + 3 x <- w + v x <- w + v u <- v u <- v t <- u + x t <- u + x <- w w <- 4 <- t <- w <- u <- t <- u School of Computer Science

Build Take Two v <- 1 k = 3 w 1 <- v + 3 v M w []<- w 1 w 2 <- M w [] x <- w 2 + v x w 1 w 2 w 3 u <- v t <- u + x u w 3 <- M w [] <- w 3 <- t t <- u Recalculate interference graph School of Computer Science

Simplify->Coalesce->Select k = 3 v x w 1 w 2 w 3 uv u t School of Computer Science

Review Build Simplify Coalesce Potential Spill Select Actual Spill School of Computer Science

Old Subtleties 1. MOVE instructions 2. Merging live ranges 3. Splitting live ranges 4. Choosing potential spills 5. Allocating spill slots 6. Coalescing is bad School of Computer Science

#1 MOVE Instructions • During liveness analysis, MOVE instructions should be treated specially t := s … Note that s and t don’t x <- ⊗ (…s…) really interfere … y <- ⊗ (…t…) Under what conditions can we remove the interference edge between s and t ? School of Computer Science

#2 Live Ranges & Merged Live Ranges A live range consists of a definition and all the points in a program (e.g. end of an instruction) in which that definition is live. – How to compute a live range? a = … a = … … = a Two overlapping live ranges for the same variable must be merged School of Computer Science

Example Live Variables {} {} Reaching Definitions A = ... (A 1 ) {A} {A 1 } IF A goto L1 {A} {A 1 } {A} {A 1 } B = ... (B 1 ) L1: {A} {A 1 } {A,B} {A 1 ,B 1 } = A C = ... (C 1 ) {A,C} {A 1 ,C 1 } {B} {A 1 ,B 1 } D = B (D 2 ) = A {C} {A 1 ,C 1 } {D} {A 1 ,B 1 ,D 2 } D = ... (D 1 ) {D} {A 1 ,C 1 ,D 1 } {D} {A 1 ,B 1 ,C 1 ,D 1 ,D 2 } A = 2 (A 2 ) Merge {A,D} {A 2 ,B 1 ,C 1 ,D 1 ,D 2 } {A,D} {A 2 ,B 1 ,C 1 ,D 1 ,D 2 } = A {D} {A 2 ,B 1 ,C 1 ,D 1 ,D 2 } ret D A live range consists of a definition and all the points in a program in which that definition is live. School of Computer Science

Merging Live Ranges • Merging definitions into equivalence classes: – Start by putting each definition in a different equivalence class – For each point in a program • if variable is live, and there are multiple reaching definitions for the variable • merge the equivalence classes of all such definitions into a one equivalence class Merged live ranges are also known as “webs” School of Computer Science

Example: Merged Live Ranges {} A = ... (A 1 ) {A 1 } IF A goto L1 {A 1 } {A 1 } B = ... (B 1 ) L1: {A 1 } {A 1 ,B 1 } = A C = ... (C 1 ) {A 1 ,C 1 } {B 1 } D = B (D 2 ) = A {C 1 } {D 1,2 } D = ... (D 1 ) {D 1,2 } {D 1,2 } A = 2 (A 2 ) {A 2 ,D 1,2 } {A 2 ,D 1,2 } = A {D 1,2 } ret D A has two “webs” makes register allocation easier School of Computer Science

Edges of Interference Graph Intuitively: Two live ranges (necessarily of different variables) may interfere if they overlap at some point in the program. Algorithm: At each point in program, enter an edge for every pair of live ranges at that point An optimized definition & algorithm for edges: For each defining inst i Let x be live range of definition at inst i For each live range y present at end of inst i insert an edge between x and y Faster? {D 1,2 } Edge between A 2 and D 1,2 A = 2 (A 2 ) Better quality? {A 2 ,D 1,2 } School of Computer Science

Reducing Register Pressure Splitting variables into live ranges creates an interference graph that is easier to color Eliminate interference in a variable’s “dead” zones. Increase flexibility in allocation: can allocate same variable to different registers A = … B = … C = … … = A … = A D = B D = C A = D ret A School of Computer Science

#3 Live Range Splitting Split a live range into smaller regions (by paying a small cost) to create an interference graph that is easier to color Eliminate interference in a variable’s nearly dead zones. Cost: Memory loads and stores Load and store at boundaries of regions with no activity # active live ranges at a program point can be > # registers Can allocate same variable to different registers Cost: Register operations a register copy between regions of different assignments # active live ranges cannot be > # registers School of Computer Science

Examples Example 1: Example 2: A = …; B = …; A = … FOR i = 0 TO 10 FOR j = 0 TO 10000 A = A + ... (does not use B) B = … C = … FOR j = 0 TO 10000 … = A+B … = A+C B = B + ... C = … B = … (does not use A) … = B+C School of Computer Science

One Algorithm • Observation: Spilling is absolutely necessary if not degree in graph – number of live ranges active at a program point > n • Apply live-range splitting before coloring – Identify a point where number of live ranges > n – For each live range active around that point • find the outermost “block construct” that does not access the variable – Choose a live range with the largest inactive region – Split the inactive region from the live range School of Computer Science

#4 What to Spill? When choosing potential spill node want: – A node that makes graph easier to color • Fewer spills later – A node that isn’t “expensive” to spill • An expensive node would slow down the program if spilled – We can apply heuristics both when choosing potential spill nodes and when choosing actual spill nodes • not required to spill node that we popped off stack and can’t color School of Computer Science

A Spill Heuristic Pick node (live range) n that minimizes: � � 10 depth ( def ) 10 depth ( use ) + def � n use � n degree( n ) This heuristic prefers nodes that: – Are used infrequently – Aren’t used inside of loops – Have a large degree Could use any one of several other heuristics as well… School of Computer Science

#5 Spill coalescing • On machines with few registers (like the Pentium), a lot of temps can get spilled – activation records get big – memory-to-memory moves get generated • For these reasons, it is good to do spill coalescing School of Computer Science

Register Allocation II 15-745 Optimizing Compilers Spring 2006 - PowerPoint PPT Presentation

Register Allocation II 15-745 Optimizing Compilers Spring 2006 David Koes School of Computer Science Outline Review Old Subtleties New Subtleties School of Computer Science Review Build Simplify Coalesce Potential Spill

More Register Allocation Last time Register allocation Global allocation via graph

1 Coalescing Logistics Computing the Interference Graph (in MiniJava compiler) Rule Use results

Register Allocation (via graph coloring and spilling) Register allocation LLVM IR uses an

Register allocation Michel Schinz Advanced Compiler Construction 2008-05-16 Register

Global Register Allocation Memory Hierarchy Management Register Allocation via Graph

Outline Fine-Grain Register Allocation Based on a Global Spill Costs Analysis Graph coloring

Global Register Allocation Lecture Outline Memory Hierarchy Management Register

Register allocation Michel Schinz based on Erik Stenmans slides Register allocation

Register Allocation cs5363 1 Register Allocation And Assignment Values in registers are easier

Register allocation Michel Schinz based on Erik Stenmans slides Register allocation

Compilers Register Allocation Alex Aiken Register Allocation Intermediate code uses

Low-Level Issues Last lecture Interprocedural analysis Today Start low-level issues

Register Allocation Based on slides by E. Ernst Register Allocation Recall: Interference graph

Control Unit Datapath Elements & Single Cycle Datapath Unit Register Files Register Layout

CS453 INTRODUCTION TO DATAFLOW ANALYSIS CS453 Lecture Register allocation using liveness

Outline What is register allocation P3 / 2003 Webs Interference Graphs Register

BASIC FACTS CONCERNING ACTIONS OF AMENABLE GROUPS ON COMPACT SPACES TOMASZ DOWNAROWICZ based on

UC Updatable Databases and Applications AFRICACRYPT 2020 Aditya Damodaran and Alfredo Rial SnT,

Aligning Mission, Culture & Internal Branding Aligning Mission, Culture & Internal

society is the history of all class struggles GS address to Political School 26 January 2014

Retroconversion Of A Complex Etymological Dictionary European Master in Lexicography 2009-2010

Mixed Reality Service Tony Parisi (as proxy gateway for) Mark Pesce www.mixedrealitysystem.org

Understanding (or not) Deep Convolutional Networks Stphane Mallat cole Normale Suprieure

PAST STA NOO NOODLE (Bari rilla lla P Penne Past sta) a) Feb ebruary 26, 26, 2016 2016

Register Allocation II 15-745 Optimizing Compilers Spring 2006 - PowerPoint PPT Presentation

Register Allocation II 15-745 Optimizing Compilers Spring 2006 David Koes School of Computer Science Outline Review Old Subtleties New Subtleties School of Computer Science Review Build Simplify Coalesce Potential Spill

More Register Allocation Last time Register allocation Global allocation via graph

1 Coalescing Logistics Computing the Interference Graph (in MiniJava compiler) Rule Use results

Register Allocation (via graph coloring and spilling) Register allocation LLVM IR uses an

Register allocation Michel Schinz Advanced Compiler Construction 2008-05-16 Register

Global Register Allocation Memory Hierarchy Management Register Allocation via Graph

Outline Fine-Grain Register Allocation Based on a Global Spill Costs Analysis Graph coloring

Global Register Allocation Lecture Outline Memory Hierarchy Management Register

Register allocation Michel Schinz based on Erik Stenmans slides Register allocation

Register Allocation cs5363 1 Register Allocation And Assignment Values in registers are easier

Register allocation Michel Schinz based on Erik Stenmans slides Register allocation

Compilers Register Allocation Alex Aiken Register Allocation Intermediate code uses

Low-Level Issues Last lecture Interprocedural analysis Today Start low-level issues

Register Allocation Based on slides by E. Ernst Register Allocation Recall: Interference graph

Control Unit Datapath Elements &amp; Single Cycle Datapath Unit Register Files Register Layout

CS453 INTRODUCTION TO DATAFLOW ANALYSIS CS453 Lecture Register allocation using liveness

Outline What is register allocation P3 / 2003 Webs Interference Graphs Register

BASIC FACTS CONCERNING ACTIONS OF AMENABLE GROUPS ON COMPACT SPACES TOMASZ DOWNAROWICZ based on

UC Updatable Databases and Applications AFRICACRYPT 2020 Aditya Damodaran and Alfredo Rial SnT,

Aligning Mission, Culture &amp; Internal Branding Aligning Mission, Culture &amp; Internal

society is the history of all class struggles GS address to Political School 26 January 2014

Retroconversion Of A Complex Etymological Dictionary European Master in Lexicography 2009-2010

Mixed Reality Service Tony Parisi (as proxy gateway for) Mark Pesce www.mixedrealitysystem.org

Understanding (or not) Deep Convolutional Networks Stphane Mallat cole Normale Suprieure

PAST STA NOO NOODLE (Bari rilla lla P Penne Past sta) a) Feb ebruary 26, 26, 2016 2016

Control Unit Datapath Elements & Single Cycle Datapath Unit Register Files Register Layout

Aligning Mission, Culture & Internal Branding Aligning Mission, Culture & Internal