263-2810: Advanced Compiler Design 6.0 Partial redundancy - PowerPoint PPT Presentation

263-2810: Advanced Compiler Design 6.0 Partial redundancy elimination Thomas R. Gross Computer Science Department ETH Zurich, Switzerland

Outline PRE with SSA in five (easy) steps § Assumption: Program in SSA form for scalar variables 1. Insert Φ functions for expressions Introduce “temporary” to isolate expression § x = a 1 + b 1 ➞ E = a 1 + b 1 x = E Identify places where different versions merge § 2. Identify/set version numbers for expressions Expressions on LHS: set version number § Operands of functions: find correct version § 2

(5 steps continued) 3. Identify places where is is legal to insert a copy of an expression § “legal” is defined later 4. Find places where the insertion on a copy is profitable Only places that are legal can be considered § Expect removal of operations § Expect reduction in cycles § Metrics for profitability to be discussed § 5. Exploit full redundancy of expressions Transform program to reuse computed values § 3

6.3 Insertion of Φ functions § Given the CFG of a program. § SSA format for scalars needs dominator tree, dominance frontier. § Keep for insertion of Φ functions § Consider an expression E = a + b in some block B. § Must insert a Φ function somewhere if § E is computed explicitly or § One of the operands of E (i.e., a or b ) is changed in block B. 4

Part 1: E computed in B B E = a 1 + b 1 B1 B2 § Find dominance frontier DF(B) § Here {B1, B2} § Insert a Φ function (unless already present) 5

Part 1: E computed in B B E = a 1 + b 1 B1 B2 E = Φ ( , ) E = Φ ( , ) § Find dominance frontier DF(B) § Here {B1, B2} § Insert a Φ function (unless already present) § This step deals with all explicit computations of E in some block 6

Part 2: Redefinition of operand(s) § Consider E = a + b and a (or b ) is defined in block B’ 7

Part 2: Redefinition of operand(s) § Consider E = a + b and a (or b ) is defined in block B’ § There must be a φ function in nodes in DF(B’) § Consider B ∈ DF(B’) a 1 = … B’’ b 1 = … E = a 1 + b 1 B’ a 2 = … B a 3 = φ (a 1 ,a 2 ) = a 3 + b 1 8

Part 2: Redefinition of operand(s) § Insert into B ( ∈ DF(B’)) a Φ function for all expressions E that contain a as an operand a 1 = … B’’ b 1 = … E = a 1 + b 1 B’ a 2 = … E = Φ ( , ) B a 3 = φ (a 1 ,a 2 ) = a 3 + b 1 9

6.4 Version numbers for expressions § If E appears on the LHS: get a new version § For the operands of a Φ function: § Identify correct version § Recall: for functions (for scalars) we used a stack of versions § Array [variable] of stacks § Use stack for operands of expression E to figure out which version is used 10

§ Consider E E , the set of all expressions of interest in the program § stack[ E ]: one stack for each expression § E ∈ E E stack[E] : stack of versions (integers) § Given an expression E = a + b , after turning program into SSA for scalars, versions of a and b are known § E = a i + b j § If we have processed E = a i + b j before (with these versions of a , b ) then we use the version k of E on top of stack[E] E k = a i + b j § § Must be the current version of E 11

§ So use version k of E if § stack(a).top = i § stack(b).top = j § and stack(E) = k with E k = a i + b j § If E = a i + b j has not been processed before, i.e., § stack(a).top = i § stack(b).top = j § and stack(E) = k with E k ≠ a i + b j (i.e., operand versions changed but we did not find a new version for E) then § Get new version (say m) § stack(E).push(m) § Use version m for E 12

Example a 1 = … B’’ b 1 = … E 2 = a 1 + b 1 E = a 1 + b 1 B’ a 2 = … E = a 2 + b 1 15

6.4 Version numbers for expressions § If E appears on the LHS: decide if a new version is needed or if the current version should be used § “current”: Version computed in a dominating node and no operand has been redefined § For the operands of a Φ function: § Identify correct version 17

§ Recall: for functions (for scalars) we used a stack of versions § Array [ V ] of stacks, with V the set of all variables § For E E , the set of all expressions of interest in the program § stack[ E ]: one stack for each expression § E ∈ E: stack[E] is a stack of versions (integers) § Given an expression E = a + b , after turning program into SSA for scalars, versions of a and b are known § E = a i + b j § If we have processed E = a i + b j before (with these versions of a , b ) then we use the version k of E on top of stack[E] E k = a i + b j § § a i , b j must be current versions of a, b § Must be the current version of E 18

§ So use version k of E if § stack(a).top = i § stack(b).top = j § and stack(E) = k with E k = a i + b j 19

§ If E = a i + b j has not been processed before, i.e., § stack(a).top = i § stack(b).top = j § and stack(E) = k with E k ≠ a i + b j (i.e., operand versions changed but we do not find a new version for E) then § Get new version (say m) § stack(E).push(m) § Use version m of E: E m 20

Example a 1 = … B’’ b 1 = … E 2 = a 1 + b 1 E = a 1 + b 1 B’ a 2 = … E = a 2 + b 1 21

Operands of Φ function § Visit nodes in depth-first order § Use dominator tree § Like when handling φ functions § After processing a block (node), record expression version for Φ function(s) in successor blocks that are not dominated § Make sure argument to Φ function reflects expression set along corresponding path 23

START Example a 1 = … b 1 = … a 2 = φ (a 4 ,a 1 ) E = a 2 + b 1 a 3 = … a 4 = φ (a 2 ,a 3 ) E = a 4 + b 1 EXIT 24

6.5 Are copies legal at point P? § Given a program in SSA format with φ and Φ functions § Versions of scalars and expressions have been determined § Given a basic block, P point at the start of block. A Φ function for E at point P with (one or more) ⊥ operands indicates that § Value of expression E is undefined if control reaches P along path that corresponds to ⊥ operand § Expression E is defined if control reaches P along a path that corresponds to version E k of E § Predecessor basic block that corresponds to ⊥ operand is a candidate to place a copy of E 26

a 1 = … b 1 = … Example B E 1 = a 1 + b 1 P E 2 = Φ ( ⊥ ,E 1 ) = E 2 EXIT 27

§ A copy of E can be inserted into B § Or any of its predecessors § (As long as operands of E are available) § Is inserting a copy acceptable? 28

§ A copy of E can be inserted into B § Or any of its predecessors § (As long as operands of E are available) § Is inserting a copy (into B) legal? § “Gold standard”: insertion of a copy must not change the program (results) 29

Detour: “must not change the program” § Given program P , transformed program T( P ) = P ’ § Transformation is legal if § P computes the same result as P ’ § Same output § Returns no new errors § Throws no new exceptions § Termination behavior unchanged § Allow that P encounters an error earlier or later § Allow that P throws an exception earlier or later § Note: values stored in memory by P may differ from values stored by P‘ § May accept that values after error/exception differ 30

a 1 = … b 1 = … Example B E 1 = a 1 + b 1 P E 2 = Φ ( ⊥ ,E 1 ) = E 2 EXIT 31

a 1 = … b 1 = … Example B E 1 = a 1 + b 1 P E 2 = Φ ( ⊥ ,E 1 ) a 2 = … EXIT E 3 = a 2 + b 1 32

Legal copies § Could we insert a copy of E into B? 33

Legal copies § Could we insert a copy of E into B? § We do not know anything about the effect of E. § Might throw an exception § Might raise an error (overflow, memory protection error, …) § A copy of E can be inserted into B if E is evaluated along all paths from B to EXIT 34

a 1 = … b 1 = … Example B E 1 = a 1 + b 1 P E 2 = Φ ( ⊥ ,E 1 ) B’ a 2 = … EXIT E 3 = a 2 + b 1 35

§ Our model of legality: insert into B only if there is a copy of E on all paths from B to EXIT § Blocks with this property are called “downsafe” § Earlier papers use the phrase “E anticipated in B” § Insertion legal: iff B is downsafe 36

Downsafety § Check if there is a path from B to EXIT that does not include E § If E occurs only as the operand to a Φ function in block B’. Check that E is used in B’ or B’ is downsafe. § We say a Φ function in block B’ is downsafe if E appears on the RHS of a statement in B’ or B’ is downsafe. § Can insert copies for Φ functions that are downsafe 37

Downsafety § Need to check that along all paths from B to EXIT there is a real occurrence of E or a downsafe Φ function § Simple algorithm: § Start at EXIT § Visit recursively all predecessor nodes, until all nodes have been visited § Mark a Φ function as downsafe iff a real occurrence or a downsafe Φ function appears on all paths to EXIT § After visiting all nodes: downsafe Φ functions are marked § Downsafety is a necessary condition 38

Example = E 1 E 2 = Φ (E 1 , ⊥ ) = E 2 E 3 = Φ (E 2 , ⊥ ) = E 3 EXIT 39

6.6 Profitable copy insertion § Goal: insert a copy of E into basic block if 1. Basic block is downsafe (for E) 2. Optimization reduces execution time § Problem: (2) depends on later phases of compiler § Loose coupling between IR and machine code 41

263-2810: Advanced Compiler Design 6.0 Partial redundancy - PowerPoint PPT Presentation

263-2810: Advanced Compiler Design 6.0 Partial redundancy elimination Thomas R. Gross Computer Science Department ETH Zurich, Switzerland Outline PRE with SSA in five (easy) steps Assumption: Program in SSA form for scalar variables 1.

Partner Partner Partner Partner +1 202 263 3241 +1 202 263 3241 +1 202 263 3241 + 1 202 263

Partial Redundancy Elimination CS243 Review Session Full Redundancy x = b + c y = b + c z = b

263-2810: Advanced Compiler Design 3.0 SSA form for arbitrary control flow graphs Thomas R. Gross

263-2810: Advanced Compiler Design 8.2 Scheduling for ILP processors Thomas R. Gross Computer

263-2810: Advanced Compiler Design 2.0 Sta>c Single Assignment Form Thomas R. Gross Computer

263-2810: Advanced Compiler Design Compilation with dynamic information Thomas R. Gross Computer

ASD TUG 2810 HYBRID TUGS TUGS TUGS & WORKBOATS ASD TUG 2810 HYBRID TUGS POLLUTION TUGS

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

Reuse Optimization Last time Common subexpression elimination (CSE) Today Partial

Overview Partial Constituent Fronting in German The phenomenon: Partial constituent fronting

H 375 O 132 C 88 N 6 Ca 1 P 1 H 263 O 110 C 106 N 16 P 1 http://www.smithsonianmag.com/ H 263 O

Enhancement of Water Quality and Quality of Life in an Ultra-Urban Stormshed No. 263 Begun in

Wireless Communication Systems @CS.NCTU Lecture 10: H.263 and H.263+ Instructor: Kate Ching-Ju

Principles of Compiler Design - The Brainf*ck Compiler - Clifford Wolf - www.clifford.at

Chosen-Ciphertext Security Chosen-Ciphertext Security without Redundancy without Redundancy

Kinematic Redundancy Robert Platt Northeastern University Kinematic Redundancy A

Software specification in CASL - The Common Algebraic Specification Language Till Mossakowski,

M. Beaudouin-Lafon - Groupware 1 Master Recherche - Universit Paris-Sud WYSIWIS / WYSIAWIS

Topic 3 Object Oriented Programming Encapsulation - Implementing Classes Creating large programs

Tolerance Volume Manish Mandad David Cohen-Steiner Pierre Alliez Inria Sophia Antipolis - 1

-Samples [AB98] Hyp: domain S is a smooth curve or surface. S 1 -Samples [AB98] Hyp:

CRYSTAL COLLI MATI ON EXPERI MENT AT THE TEVATRON (T- 980): I NSTALLATI ON REVI EW Nikolai

902-282: SPIRAL SLIDE P-1 Rev. 07/22/03 PARTS LIST KKJJJ SPECIFICATIONS Ref Description QTY

CSE 115 Introduction to Computer Science I Announcements Finish module pre-lab before lab exam

263-2810: Advanced Compiler Design 6.0 Partial redundancy - PowerPoint PPT Presentation

263-2810: Advanced Compiler Design 6.0 Partial redundancy elimination Thomas R. Gross Computer Science Department ETH Zurich, Switzerland Outline PRE with SSA in five (easy) steps Assumption: Program in SSA form for scalar variables 1.

Partner Partner Partner Partner +1 202 263 3241 +1 202 263 3241 +1 202 263 3241 + 1 202 263

Partial Redundancy Elimination CS243 Review Session Full Redundancy x = b + c y = b + c z = b

263-2810: Advanced Compiler Design 3.0 SSA form for arbitrary control flow graphs Thomas R. Gross

263-2810: Advanced Compiler Design 8.2 Scheduling for ILP processors Thomas R. Gross Computer

263-2810: Advanced Compiler Design 2.0 Sta&gt;c Single Assignment Form Thomas R. Gross Computer

263-2810: Advanced Compiler Design Compilation with dynamic information Thomas R. Gross Computer

ASD TUG 2810 HYBRID TUGS TUGS TUGS &amp; WORKBOATS ASD TUG 2810 HYBRID TUGS POLLUTION TUGS

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

Reuse Optimization Last time Common subexpression elimination (CSE) Today Partial

Overview Partial Constituent Fronting in German The phenomenon: Partial constituent fronting

H 375 O 132 C 88 N 6 Ca 1 P 1 H 263 O 110 C 106 N 16 P 1 http://www.smithsonianmag.com/ H 263 O

Enhancement of Water Quality and Quality of Life in an Ultra-Urban Stormshed No. 263 Begun in

Wireless Communication Systems @CS.NCTU Lecture 10: H.263 and H.263+ Instructor: Kate Ching-Ju

Principles of Compiler Design - The Brainf*ck Compiler - Clifford Wolf - www.clifford.at

Chosen-Ciphertext Security Chosen-Ciphertext Security without Redundancy without Redundancy

Kinematic Redundancy Robert Platt Northeastern University Kinematic Redundancy A

Software specification in CASL - The Common Algebraic Specification Language Till Mossakowski,

M. Beaudouin-Lafon - Groupware 1 Master Recherche - Universit Paris-Sud WYSIWIS / WYSIAWIS

Topic 3 Object Oriented Programming Encapsulation - Implementing Classes Creating large programs

Tolerance Volume Manish Mandad David Cohen-Steiner Pierre Alliez Inria Sophia Antipolis - 1

-Samples [AB98] Hyp: domain S is a smooth curve or surface. S 1 -Samples [AB98] Hyp:

CRYSTAL COLLI MATI ON EXPERI MENT AT THE TEVATRON (T- 980): I NSTALLATI ON REVI EW Nikolai

902-282: SPIRAL SLIDE P-1 Rev. 07/22/03 PARTS LIST KKJJJ SPECIFICATIONS Ref Description QTY

CSE 115 Introduction to Computer Science I Announcements Finish module pre-lab *before* lab exam

263-2810: Advanced Compiler Design 2.0 Sta>c Single Assignment Form Thomas R. Gross Computer

ASD TUG 2810 HYBRID TUGS TUGS TUGS & WORKBOATS ASD TUG 2810 HYBRID TUGS POLLUTION TUGS

CSE 115 Introduction to Computer Science I Announcements Finish module pre-lab before lab exam