GCC Compiler Overview From GCC - An Architectural Overview , D. - PDF document

GCC Compiler Overview From GCC - An Architectural Overview , D. Novillo, September 2006. CS553 Lecture Static Single Assignment Form 1 GCC Overview cont... � SSA Optimizers – � vectorization – � loop optimizations – � scalar optimizations: CCP, DCE, DSE, FRE, PRE, VRP, SRA – � field-sensitive, points-to alias analysis � RTL Optimizers – � RTL has infinite registers – � register allocation – � scheduling, SW pipelining, CSE, ... From GCC - An Architectural Overview , D. Novillo, September 2006. CS553 Lecture Static Single Assignment Form 2

LLVM Compiler Infrastructure Source: “ LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation” by Lattner and Adve Goals - lifelong analysis and optimization - modular compiler components IR is low level, control-flow graph and SSA - language-independent types: any-bit ints, float, double, pointers, arrays, structures, functions -interprocedural analysis and transformation Status -Apple is paying main developer (Chris Lattner) to continue development -built OpenGL JIT compiler with it in two weeks CS553 Lecture Value Numbering 3 Static Single Assignment Form � Last Time – � Induction variable detection and elimination and strength reduction � Today – � Program representations – � Static single assignment (SSA) form – � Program representation for sparse data-flow – � Conversion to and from SSA � Next Time – � Applications of SSA CS553 Lecture Static Single Assignment Form 4

Data Dependence � Definition – � Data dependences are constraints on the order in which statements may be executed � We say statement s 2 depends on s 1 – � Flow (true) dependence : s 1 writes memory that s 2 later reads (RAW) – � Anti-dependence : s 1 reads memory that s 2 later writes (WAR) – � Output dependences : s 1 writes memory that s 2 later writes (WAW) – � Input dependences : s 1 reads memory that s 2 later reads (RAR) � True dependences – � Flow dependences represent actual flow of data � False dependences – � Anti- and output dependences reflect reuse of memory, not actual data flow; can often be eliminated CS553 Lecture Static Single Assignment Form 5 Example s 1 a = b; � flow s 2 b = c + d; � anti s 3 e = a + d; output � input s 4 b = 3; � s 5 f = b * 2; � CS553 Lecture Static Single Assignment Form 6

Representing Data Dependences � Implicitly – � Using variable defs and uses – � Pros: simple – � Cons: hides data dependence (analyses must find this info) � Def-use chains (du chains) – � Link each def to its uses – � Pros: explicit; therefore fast – � Cons: must be computed and updated, space consuming � Alternate representations – � e.g., Static single assignment form (SSA), Program Dependence Graph (PDG), dependence flow graphs (DFG), value dependence graphs (VDG), CS553 Lecture Static Single Assignment Form 7 DU Chains � Definition – � du chains link each def to its uses � Example s 1 a = b; � s 2 b = c + d; du chain � s 3 e = a + d; � s 4 b = 3; � s 5 f = b * 2; � CS553 Lecture Static Single Assignment Form 8

UD Chains � Definition – � ud chains link each use to its defs � Example s 1 a = b; � ud chain s 2 b = c + d; � s 3 e = a + d; � s 4 b = 3; � s 5 f = b * 2; � CS553 Lecture Static Single Assignment Form 9 Role of Alternate Program Representations � Advantage – � Allow analyses and transformations to be simpler & more efficient/effective � Disadvantage – � May not be “executable” (requires extra translations to and from) – � May be expensive (in terms of time or space) � Process Optimized Code (RTL) Original Code (RTL) T2 T1 SSA Code1 SSA Code2 SSA Code3 CS553 Lecture Static Single Assignment Form 10

Static Single Assignment (SSA) Form � Idea – � Each variable has only one static definition – � Makes it easier to reason about values instead of variables – � Similar to the notion of functional programming � Transformation to SSA – � Rename each definition – � Rename all uses reached by that assignment � Example � v := ... � v 0 := ... � ... := ... v ... � ... := ... v 0 ... � v := ... � v 1 := ... � ... := ... v ... � ... := ... v 1 ... � What do we do when there’s control flow? CS553 Lecture Static Single Assignment Form 11 SSA and Control Flow � Problem – � A use may be reached by several definitions 1 2 v := ... 3 v := ... 1 ...v... 4 v 0 :=... v 1 :=... 2 3 4 ...v?... CS553 Lecture Static Single Assignment Form 12

SSA and Control Flow (cont) � Merging Definitions – � � -functions merge multiple reaching definitions � Example 1 v 0 :=... v 1 :=... 2 3 v 2 := � (v 0 ,v 1 ) 4 ...v 2 ... CS553 Lecture Static Single Assignment Form 13 Another Example v := 1 v 0 := 1 1 1 v := v+1 v 1 := � (v 0 ,v 2 ) 2 2 v 2 := v 1 +1 CS553 Lecture Static Single Assignment Form 14

SSA vs. ud/du Chains � SSA form is more constrained � Advantages of SSA – � More compact – � Some analyses become simpler when each use has only one def – � Value merging is explicit – � Easier to update and manipulate? � Furthermore – � Eliminates false dependences (simplifying context) � for (i=0; i<n; i++) A[i] = i; Unrelated uses of i are given for (i=0; i<n; i++) different variable names print(foo(i)); CS553 Lecture Static Single Assignment Form 15 SSA vs. ud/du Chains (cont) � Worst case du-chains? � switch (c1) { � case 1: x = 1; break; � case 2: x = 2; break; � case 3: x = 3; break; � } x 4 = � (x 1 , x 2 , x 3 ) � switch (c2) { � case 1: y1 = x; break; � case 2: y2 = x; break; � case 3: y3 = x; break; � case 4: y4 = x; break; � } m defs and n uses leads to m � n du chains CS553 Lecture Static Single Assignment Form 16

Transformation to SSA Form � Two steps – � Insert � -functions – � Rename variables CS553 Lecture Static Single Assignment Form 17 Where Do We Place � -Functions? � Basic Rule – � If two distinct (non-null) paths x � z and y � z converge at node z, and nodes x and y contain definitions of variable v, then a � -function for v is inserted at z x v 1 :=... y v 2 :=... v 3 := � (v 1 ,v 2 ) z ...v 3 ... CS553 Lecture Static Single Assignment Form 18

Approaches to Placing � -Functions � Minimal – � As few as possible subject to the basic rule � Briggs-Minimal – � Same as minimal, except v must be live across some edge of the CFG Pruned – � Same as minimal, except dead � -functions are not inserted � What’s the difference between Briggs Minimal and Pruned SSA? CS553 Lecture Static Single Assignment Form 19 Briggs Minimal vs. Pruned v = v = � Briggs Minimal will add a = v = v � function because v is live across the blue edge, but Pruned SSA will not because the � = v function is dead = � (v 0 ,v 1 ) v = v = � Neither Briggs Minimal nor = v = v Pruned SSA will place a � function in this case because v is not live across any CFG edge � Why would we ever use Briggs Minimal instead of Pruned SSA? CS553 Lecture Static Single Assignment Form 20

Machinery for Placing � -Functions entry � Recall Dominators – � d dom i if all paths from entry to node i include d d dom i d – � d sdom i if d dom i and d � i i � Dominance Frontiers – � The dominance frontier of a node d is the set of nodes that are “just barely” not dominated by d; i.e., the set of nodes n, such that – � d dominates a predecessor p of n, and – � d does not strictly dominate n – � DF(d) = {n | � p � pred(n), d dom p and d !sdom n} � Notational Convenience – � DF(S) = � n � S DF(n) CS553 Lecture Static Single Assignment Form 21 Dominance Frontier Example DF(d) = {n | � p � pred(n), d dom p and d !sdom n} Nodes in Dom(5) Dom(5) = {5, 6, 7, 8} 1 DF(5) = {4, 5, 12, 13} 5 5 2 9 3 6 7 10 11 4 8 12 13 What’s significant about the Dominance Frontier? In SSA form, definitions must dominate uses CS553 Lecture Static Single Assignment Form 22

Dominance Frontier Example II DF(d) = {n | � p � pred(n), d dom p and d !sdom n} Nodes in Dom(5) Dom(5) = {5, 6, 7, 8} 1 {4, 5, 13} DF(5) = 2 5 5 3 6 7 4 8 13 In this graph, node 4 is the first point of convergence between the entry and node 5, so do we need a � - function at node 13? CS553 Lecture Static Single Assignment Form 23 SSA Exercise 1 2 v :=... 7 3 v := ... v := ... 3 4 8 9 1 2 5 10 v 4 := � (v 1 ,v 2 ) v 5 := � (v 3 ,v 4 ) 6 DF(8) = {10} {10} DF(9) = {6} DF(d) = {n | � p � pred(n), d dom p and d !sdom n} DF(2) = {10} DF({8,9}) = {6} DF(10) = {6,10} DF({2,8,9,6,10}) = See http://www.hipersoft.rice.edu/grads/publications/dom14.pdf for a more thorough description of DF. CS553 Lecture Static Single Assignment Form 24

Dominance Frontiers Revisited Suppose that node 3 defines variable x x � Def(3) DF(3) = {5} 1 4 2 3 5 6 Do we need to insert a � - function for x anywhere else? Yes. At node 6. Why? CS553 Lecture Static Single Assignment Form 25 Dominance Frontiers and SSA � Let – � DF 1 (S) = DF(S) – � DF i+1 (S) = DF(S � DF i (S)) � Iterated Dominance Frontier – � DF � (S) � Theorem – � If S is the set of CFG nodes that define variable v, then DF � (S) is the set of nodes that require � -functions for v CS553 Lecture Static Single Assignment Form 26

GCC Compiler Overview From GCC - An Architectural Overview , D. - PDF document

GCC Compiler Overview From GCC - An Architectural Overview , D. Novillo, September 2006. CS553 Lecture Static Single Assignment Form 1 GCC Overview cont... SSA Optimizers vectorization loop optimizations scalar

C Programming Basics GCC 8.2.0 & GDB 8.1.1 nike.cs.uga.edu brew macOS gcc-8

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

IEEE- -GCC, Bahrain GCC, Bahrain IEEE IEEE-GCC, Bahrain 25 October 2007 25 October 2007 25

The Retargetability Model of GCC Uday Khedker (www.cse.iitb.ac.in/grc) GCC Resource Center,

GCC Configuration and Building Uday Khedker (www.cse.iitb.ac.in/grc) GCC Resource Center,

Code optimization in GCC S ebastian Pop Universit e Louis Pasteur Strasbourg FRANCE Code

Motivation It may involve Minor changes in the main source Requires static linking We

Write gcc in C++ Ian Lance Taylor Google June 17, 2008 C++ Write gcc in C++ Ian Lance Taylor

Link Time Optimization Mechanism in GCC-4.7.2 Uday Khedker (www.cse.iitb.ac.in/uday) GCC

GCC Code Organization Emulation libraries (eg. libgcc to emulate operations not supported on

Optimierende Compiler Aufbau GIMPLE IR Uberblick uber die GNU Compiler Collection gcc

Compiling for HSA accelerators with GCC Martin Jambor SUSE Labs 8th August 2015 Outline HSA

The Economies of New Energy Technologies in Europe and the Gulf Region GCC Regional Perspective

GCC VAT Framework 6 member state, GCC Framework expected to be released shortly Minimum turnover

GULF COOPERATION COUNCIL (GCC) VAT TAXATION 1 DISCUSSION Background of the GCC VAT reforms

FDI in the GCC Countries SAMA Quarterly Workshop 7 March 2016 Salah Alsayaary 1 Agenda:

Implementing Dodd-Frank: The CFTCs Moderating Actions, as of December 16, 2012 ISDAs

IE1206 Embedded Electronics PIC-block Documentation, Seriecom Pulse sensors Le1 Le2 I , U , R ,

Exclusive Neutrino Cross Sections From Mi MiniBooNE Martin Tzanov University of Colorado PANIC

Market Access and Competitiveness of the Swiss Financial Center October 26, 2018 Content A

A Propagation Engine for GCC Diego Novillo Red Hat Canada GCC Developers Summit Jun 23,

Cyber Resiliency Office for Weapon Systems (CROWS) Mr. Dennis Miller, SES Mr. Danny Holtzman,

On the structural theory of II 1 factors of negatively curved groups IONUT CHIFAN (joint with

Honeypot technologies 2006 First Conference / tutorial Junes 2006

GCC Compiler Overview From GCC - An Architectural Overview , D. - PDF document

GCC Compiler Overview From GCC - An Architectural Overview , D. Novillo, September 2006. CS553 Lecture Static Single Assignment Form 1 GCC Overview cont... SSA Optimizers vectorization loop optimizations scalar

C Programming Basics GCC 8.2.0 &amp; GDB 8.1.1 nike.cs.uga.edu brew macOS gcc-8

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

IEEE- -GCC, Bahrain GCC, Bahrain IEEE IEEE-GCC, Bahrain 25 October 2007 25 October 2007 25

The Retargetability Model of GCC Uday Khedker (www.cse.iitb.ac.in/grc) GCC Resource Center,

GCC Configuration and Building Uday Khedker (www.cse.iitb.ac.in/grc) GCC Resource Center,

Code optimization in GCC S ebastian Pop Universit e Louis Pasteur Strasbourg FRANCE Code

Motivation It may involve Minor changes in the main source Requires static linking We

Write gcc in C++ Ian Lance Taylor Google June 17, 2008 C++ Write gcc in C++ Ian Lance Taylor

Link Time Optimization Mechanism in GCC-4.7.2 Uday Khedker (www.cse.iitb.ac.in/uday) GCC

GCC Code Organization Emulation libraries (eg. libgcc to emulate operations not supported on

Optimierende Compiler Aufbau GIMPLE IR Uberblick uber die GNU Compiler Collection gcc

Compiling for HSA accelerators with GCC Martin Jambor SUSE Labs 8th August 2015 Outline HSA

The Economies of New Energy Technologies in Europe and the Gulf Region GCC Regional Perspective

GCC VAT Framework 6 member state, GCC Framework expected to be released shortly Minimum turnover

GULF COOPERATION COUNCIL (GCC) VAT TAXATION 1 DISCUSSION Background of the GCC VAT reforms

FDI in the GCC Countries SAMA Quarterly Workshop 7 March 2016 Salah Alsayaary 1 Agenda:

Implementing Dodd-Frank: The CFTCs Moderating Actions, as of December 16, 2012 ISDAs

IE1206 Embedded Electronics PIC-block Documentation, Seriecom Pulse sensors Le1 Le2 I , U , R ,

Exclusive Neutrino Cross Sections From Mi MiniBooNE Martin Tzanov University of Colorado PANIC

Market Access and Competitiveness of the Swiss Financial Center October 26, 2018 Content A

A Propagation Engine for GCC Diego Novillo Red Hat Canada GCC Developers Summit Jun 23,

Cyber Resiliency Office for Weapon Systems (CROWS) Mr. Dennis Miller, SES Mr. Danny Holtzman,

On the structural theory of II 1 factors of negatively curved groups IONUT CHIFAN (joint with

Honeypot technologies 2006 First Conference / tutorial Junes 2006

C Programming Basics GCC 8.2.0 & GDB 8.1.1 nike.cs.uga.edu brew macOS gcc-8