EECS 583 Class 7 Dataflow Analysis Static Single Assignment Form - PowerPoint PPT Presentation

EECS 583 – Class 7 Dataflow Analysis Static Single Assignment Form University of Michigan September 24, 2014

Announcements & Reading Material ❖ HW2 is out if you missed it, due Oct 9th ❖ Today’s class » Ron Cytron, Jeanne Ferrante, Barry K. Rosen, Mark N. Wegman, F. Kenneth Zadeck: An Efficient Method of Computing Static Single Assignment Form. POPL 1989 » “Practical Improvements to the Construction and Destruction of Static Single Assignment Form,” P. Briggs, K. Cooper, T. Harvey, and L. Simpson, Software--Practice and Experience , 28(8), July 1998, pp. 859-891. ❖ Next class – Optimization » Muchnick: Chapters 11-13 » Dragon book: Chapter 9 - 1 -

❖ Wrap up dataflow analysis ❖ We have talked about » Liveness Analysis » Reaching Definition ❖ We will talk about » General Framework for dataflow analysis » 2 more examples: Available Definitions and Available Expression - 2 -

Generalizing Dataflow Analysis ❖ Transfer function » How information is changed by “something” (BB) » OUT = GEN + (IN – KILL) /* forward analysis */ » IN = GEN + (OUT – KILL) /* backward analysis */ ❖ Meet function » How information from multiple paths is combined » IN = Union(OUT(predecessors)) /* forward analysis */ » OUT = Union(IN(successors)) /* backward analysis */ ❖ Generalized dataflow algorithm » while (change) Ÿ change = false Ÿ for each BB ◆ apply meet function ◆ apply transfer functions ◆ if any changes à change = true - 3 -

What About All Path Problems? ❖ Up to this point » Any path problems (maybe relations) Ÿ Definition reaches along some path Ÿ Some sequence of branches in which def reaches Ÿ Lots of defs of the same variable may reach a point » Use of Union operator in meet function ❖ All-path: Definition guaranteed to reach » Regardless of sequence of branches taken, def reaches » Can always count on this » Only 1 def can be guaranteed to reach » Availability (as opposed to reaching) Ÿ Available definitions Ÿ Available expressions (could also have reaching expressions, but not that useful) - 4 -

Reaching vs Available Definitions 1:r1 = r2 + r3 2:r6 = r4 – r5 1,2 reach 1,2 available 3:r4 = 4 1,2 reach 4:r6 = 8 1,2 available 1,3,4 reach 1,3,4 available 5:r6 = r2 + r3 6:r7 = r4 – r5 1,2,3,4 reach 1 available - 5 -

Available Definition Analysis (Adefs) ❖ A definition d is available at a point p if along all paths from d to p, d is not killed ❖ Remember, a definition of a variable is killed between 2 points when there is another definition of that variable along the path » r1 = r2 + r3 kills previous definitions of r1 ❖ Algorithm » Direction? » Meet functions (IN/OUT)? » GEN/KILL? - 6 -

Available Definition Analysis (Adefs) ❖ A definition d is available at a point p if along all paths from d to p, d is not killed ❖ Remember, a definition of a variable is killed between 2 points when there is another definition of that variable along the path » r1 = r2 + r3 kills previous definitions of r1 ❖ Algorithm » Forward dataflow analysis as propagation occurs from defs downwards » Use the Intersect function as the meet operator to guarantee the all-path requirement » GEN/KILL/IN/OUT similar to reaching defs Ÿ Initialization of IN/OUT is the tricky part - 7 -

Compute GEN/KILL Sets for each BB (Adefs) Exactly the same as reaching defs !!! for each basic block in the procedure, X, do GEN(X) = 0 KILL(X) = 0 for each operation in sequential order in X, op, do for each destination operand of op, dest, do G = op K = {all ops which define dest – op} GEN(X) = G + (GEN(X) – K) KILL(X) = K + (KILL(X) – G) endfor endfor endfor - 8 -

Compute IN/OUT Sets for all BBs (Adefs) U = universal set of all operations in the Procedure IN(0) = 0 OUT(0) = GEN(0) for each basic block in procedure, W, (W != 0), do IN(W) = 0 OUT(W) = U – KILL(W) change = 1 while (change) do change = 0 for each basic block in procedure, X, do old_OUT = OUT(X) IN(X) = Intersect (OUT(Y)) for all predecessors Y of X OUT(X) = GEN(X) + (IN(X) – KILL(X)) if (old_OUT != OUT(X)) then change = 1 endif endfor endfor - 9 -

Available Expression Analysis (Aexprs) ❖ An expression is a RHS of an operation » r2 = r3 + r4, r3+r4 is an expression ❖ An expression e is available at a point p if along all paths from e to p, e is not killed ❖ An expression is killed between 2 points when one of its source operands are redefined » r1 = r2 + r3 kills all expressions involving r1 ❖ Algorithm » Forward dataflow analysis as propagation occurs from defs downwards » Use the Intersect function as the meet operator to guarantee the all-path requirement » Looks exactly like adefs, except GEN/KILL/IN/OUT are the RHS’s of operations rather than the LHS’s - 10 -

Computation of Aexpr GEN/KILL Sets We can also formulate the GEN/KILL slightly differently so you do not need to break up instructions like “r2 = r2 + 1”. for each basic block in the procedure, X, do GEN(X) = 0 KILL(X) = 0 for each operation in sequential order in X, op, do K = 0 for each destination operand of op, dest, do K += {all ops which use dest} endfor if (op not in K) G = op else G = 0 GEN(X) = G + (GEN(X) – K) KILL(X) = K + (KILL(X) – G) endfor endfor - 11 -

Class Problem - Aexprs Calculation 1: r1 = r6 * r9 2: r2 = r2 + 1 3: r5 = r3 * r4 4: r1 = r2 + 1 7: r7 = r3 * r4 5: r3 = r3 * r4 8: r1 = r1 + 5 6: r8 = r3 * 2 9: r7 = r1 - 6 10: r8 = r2 + 1 11: r1 = r3 * r4 12: r3 = r6 * r9 - 12 -

Dataflow Analyses in 1 Slide Liveness Reaching Definitions/DU/UD OUT = Union(IN(succs)) IN = Union(OUT(preds)) IN = GEN + (OUT – KILL ) OUT = GEN + (IN – KILL ) Bottom-up dataflow Top-down dataflow Any path Any path Keep track of variables/registers Keep track of instruction IDs Uses of variables à GEN Defs of variables à GEN Defs of variables à KILL Defs of variables à KILL Available Expressions Available Definitions IN = Intersect(OUT(preds)) IN = Intersect(OUT(preds)) OUT = GEN + (IN – KILL ) OUT = GEN + (IN – KILL ) Top-down dataflow Top-down dataflow All path All path Keep track of instruction IDs Keep track of instruction IDs Expressions of variables à GEN Defs of variables à GEN Defs of variables à KILL Defs of variables à KILL - 13 -

Some Things to Think About ❖ All dataflow is basically the same with a few parameters Ÿ Meaning of gen/kill – src vs dest, variable vs operation Ÿ Backward / Forward Ÿ All paths / some paths (must/may) Ÿ What other dataflow analysis problems can be formulated? ❖ Dataflow can be slow » How to implement it efficiently? Ÿ Forward analysis – DFS order Ÿ Backward analysis – PostDFS order » How to represent the info? ❖ Predicates » Throw a monkey wrench into this stuff » So, how are predicates handled? - 14 -

Static Single Assignment (SSA) Form ❖ Difficulty with optimization r1 = r2 + r3 » Multiple definitions of the r6 = r4 – r5 same register » Which definition reaches r4 = 4 » Is expression available? r6 = 8 r6 = r2 + r3 r7 = r4 – r5 ❖ Static single assignment » Each assignment to a variable is given a unique name » All of the uses reached by that assignment are renamed » DU chains become obvious based on the register name! - 15 -

Converting to SSA Form ❖ Trivial for straight line code x = -1 x0 = -1 y = x y = x0 x = 5 x1 = 5 z = x z = x1 ❖ More complex with control flow – Must use Phi nodes if ( ... ) if ( ... ) x0 = -1 x = -1 else else x1 = 5 x = 5 x2 = Phi(x0,x1) y = x y = x2 - 16 -

Converting to SSA Form (2) ❖ What about loops? » No problem!, use Phi nodes again i0 = 0 i = 0 do { do { i1 = Phi(i0, i2) i = i + 1 i2 = i1 + 1 } } while (i < 50) while (i2 < 50) - 17 -

SSA Plusses and Minuses ❖ Advantages of SSA » Explicit DU chains – Trivial to figure out what defs reach a use Ÿ Each use has exactly 1 definition!!! » Explicit merging of values » Makes optimizations easier ❖ Disadvantages » When transform the code, must either recompute (slow) or incrementally update (tedious) - 18 -

Phi Nodes (aka Phi Functions) ❖ Special kind of copy that selects one of its inputs ❖ Choice of input is governed by the CFG edge along which control flow reached the Phi node x0 = x1 = x2 = Phi(x0,x1) ❖ Phi nodes are required when 2 non-null paths X à Z and Y à Z converge at node Z, and nodes X and Y contain assignments to V - 19 -

SSA Construction High-level algorithm ❖ 1. Insert Phi nodes 2. Rename variables A dumb algorithm ❖ » Insert Phi functions at every join for every variable » Solve reaching definitions » Rename each use to the def that reaches it (will be unique) Problems with the dumb algorithm ❖ » Too many Phi functions (precision) » Too many Phi functions (space) » Too many Phi functions (time) - 20 -

Need Better Phi Node Insertion Algorithm A definition at n forces a Phi node at m iff n not in DOM(m), but n in DOM(p) ❖ for some predecessors p of m BB0 def in BB4 forces Phi in BB6 def in BB6 forces Phi in BB7 def in BB7 forces Phi in BB1 BB1 BB2 BB3 Phi is placed in the block that is just outside the dominated region BB4 BB5 of the definition BB Dominance frontier BB6 The dominance frontier of node X is the set of nodes Y such that BB7 * X dominates a predecessor of Y, but * X does not strictly dominate Y - 21 -

EECS 583 Class 7 Dataflow Analysis Static Single Assignment Form - PowerPoint PPT Presentation

EECS 583 Class 7 Dataflow Analysis Static Single Assignment Form University of Michigan September 24, 2014 Announcements & Reading Material HW2 is out if you missed it, due Oct 9th Todays class Ron Cytron, Jeanne Ferrante,

EECS 583 Class 5 Dataflow Analysis Intro University of Michigan September 17, 2014 Reading

EECS 583 Class 6 Dataflow Analysis University of Michigan September 22, 2014 Announcements

Naiad (Timely Dataflow) & Streaming Systems CS 848: Models and Applications of Distributed

Quantifying Dataflow Analysis with Gradients in LLVM Gabriel Ryan 1 , Abhishek Shah 1 , Dongdong

Static and Method Overloading static One per class, not per object static variables

Google Cloud Dataflow Cosmin Arad , Senior Software Engineer carad@google.com August 7, 2015

CO444H Dataflow Dataflow frameworks Ben Livshits Masters Projects Available 1. Crashes to

Dataflow Analysis Iterative Data-flow Analysis and Static-Single-Assignment cs5363 1

Static Single Assignment Form Last Time Static single assignment (SSA) form Today

EECS 583 Class 2 Control Flow Analysis LLVM Introduction University of Michigan September

EECS 583 Class 10 Code Generation University of Michigan October 6, 2014 Announcements

EECS 583 Class 4 Predicated Execution If-conversion University of Michigan September 15,

EECS 583 Advanced Compilers Course Overview, Introduction to Control Flow Analysis Fall

Wrap Up Static, Packages, Exceptions Static methods // Example: // Java's built in Math class

Static and dynamic verification Static and dynamic V&V Software inspections Concerned

Static analysis of OpenAFS code base Cheyenne Wills OpenAFS 2019 Workshop Overview What is

CS 423 Operating System Design: Systems Programming Review Tianyin Xu * Thanks for Prof. Adam

1 CSE Example CSE Approach 1 Before CSE After CSE Notation c := a + b c := a + b

Surviving the Out of Memory Killer Dave Hansen & Balbir Singh OOF Condition Airlines

Lecture 07: Masking Signals and Deferring Handlers Synchronization, multi-processing,

For Tuesday No reading (review chapter 9) Homework: Chapter 9, exercises 4 and 6

CS 10: Problem solving via Object Oriented Programming Winter

sts Prr rtt

Bourne (Again) Shell CIS 218 Bourne Again Shell Current GNU LINUX BASH Shell documentation:

EECS 583 Class 7 Dataflow Analysis Static Single Assignment Form - PowerPoint PPT Presentation

EECS 583 Class 7 Dataflow Analysis Static Single Assignment Form University of Michigan September 24, 2014 Announcements & Reading Material HW2 is out if you missed it, due Oct 9th Todays class Ron Cytron, Jeanne Ferrante,

EECS 583 Class 5 Dataflow Analysis Intro University of Michigan September 17, 2014 Reading

EECS 583 Class 6 Dataflow Analysis University of Michigan September 22, 2014 Announcements

Naiad (Timely Dataflow) &amp; Streaming Systems CS 848: Models and Applications of Distributed

Quantifying Dataflow Analysis with Gradients in LLVM Gabriel Ryan 1 , Abhishek Shah 1 , Dongdong

Static and Method Overloading static One per class, not per object static variables

Google Cloud Dataflow Cosmin Arad , Senior Software Engineer carad@google.com August 7, 2015

CO444H Dataflow Dataflow frameworks Ben Livshits Masters Projects Available 1. Crashes to

Dataflow Analysis Iterative Data-flow Analysis and Static-Single-Assignment cs5363 1

Static Single Assignment Form Last Time Static single assignment (SSA) form Today

EECS 583 Class 2 Control Flow Analysis LLVM Introduction University of Michigan September

EECS 583 Class 10 Code Generation University of Michigan October 6, 2014 Announcements

EECS 583 Class 4 Predicated Execution If-conversion University of Michigan September 15,

EECS 583 Advanced Compilers Course Overview, Introduction to Control Flow Analysis Fall

Wrap Up Static, Packages, Exceptions Static methods // Example: // Java's built in Math class

Static and dynamic verification Static and dynamic V&amp;V Software inspections Concerned

Static analysis of OpenAFS code base Cheyenne Wills OpenAFS 2019 Workshop Overview What is

CS 423 Operating System Design: Systems Programming Review Tianyin Xu * Thanks for Prof. Adam

1 CSE Example CSE Approach 1 Before CSE After CSE Notation c := a + b c := a + b

Surviving the Out of Memory Killer Dave Hansen &amp; Balbir Singh OOF Condition Airlines

Lecture 07: Masking Signals and Deferring Handlers Synchronization, multi-processing,

For Tuesday No reading (review chapter 9) Homework: Chapter 9, exercises 4 and 6

CS 10: Problem solving via Object Oriented Programming Winter

sts Prr rtt

Bourne (Again) Shell CIS 218 Bourne Again Shell Current GNU LINUX BASH Shell documentation:

Naiad (Timely Dataflow) & Streaming Systems CS 848: Models and Applications of Distributed

Static and dynamic verification Static and dynamic V&V Software inspections Concerned

Surviving the Out of Memory Killer Dave Hansen & Balbir Singh OOF Condition Airlines