IR Simone Campanoni simonec@eecs.northwestern.edu Outline IR - PowerPoint PPT Presentation

IR Simone Campanoni simonec@eecs.northwestern.edu

Outline • IR • Explicit control flows • Explicit data types

A compiler High level programming language Front-end IR Middle-end IR Today: translating explicit control flow and data types Back-end Instruction selection Register allocation Assembly generation Machine code

L3 IR define :main (){ define void :main (){ %myRes <- call :myF(5) :entry %v1 <- %myRes * 4 int64 %myRes %v2 <- %myRes + %v1 int64 %v1 return %v2 int64 %v2 } %myRes <- call :myF(5) define :myF (%p1){ %v1 <- %myRes * 4 %p2 <- %p1 + 1 %v2 <- %myRes + %v1 return %p2 return %v2 } } define int64 :myF (int64 %p1){ :myLabel int64 %p1 int64 %p2 %p2 <- %p1 + 1 return %p2 }

L3 L3 p ::= f + ::= define label ( vars ) { i + } f i ::= var <- s | var <- t op t | var <- t cmp t | var <- load var | store var <- s | return | return t| label | br label | br var label | call callee ( args ) | var <- call callee ( args ) callee ::= u | print | allocate | array-error vars ::= | var | var (, var)* args ::= | t | t (, t)* s ::= t | label t ::= var | N u ::= var | label op ::= + | - | * | & | << | >> cmp ::= < | <= | = | >= | > N ::= (+|-)? [1-9][0-9]* label ::= :name var ::= %name name::= sequence of chars matching [a-zA-Z_][a-zA-Z_0-9]*

IR IR p ::= f + ::= define T label ( (type var)* ) { bb + } f define int64 :myF (int64 %p1){ bb ::= label i * te :myLabel te ::= br label | br t label label | return | return t int64 %p1 i ::= type var | var <- s | var <- t op t | var <- var([t]) + | var([t]) + <- s | var <- length var t | int64 %p2 call callee ( args? ) | var <- call callee ( args? ) | return %p2 var <- new Array(args) | var <- new Tuple(t) } T ::= type | void type ::= int64([])* | tuple | code callee ::= u | print | array-error args ::= t | t (, t)* s ::= t | label t ::= var | N u ::= var | label N ::= (+|-)? [1-9][0-9]* op ::= + | - | * | & | << | >> | < | <= | = | >= | > label ::= :[a-zA-Z_][a-zA-Z_0-9]* var ::= sequence of chars matching %[a-zA-Z_][a-zA-Z_0-9]*

IR IR p ::= f + ::= define T label ( (type var)* ) { bb + } f define int64 :myF (int64 %p1){ bb ::= label i * te :myLabel te ::= br label | br t label label | return | return t int64[] %v i ::= type var | var <- s | var <- t op t | var <- var([t]) + | var([t]) + <- s | var <- length var t | %v <- new Array(7) call callee ( args? ) | var <- call callee ( args? ) | return 0 var <- new Array(args) | var <- new Tuple(t) } T ::= type | void type ::= int64([])* | tuple | code callee ::= u | print | array-error args ::= t | t (, t)* s ::= t | label t ::= var | N u ::= var | label N ::= (+|-)? [1-9][0-9]* op ::= + | - | * | & | << | >> | < | <= | = | >= | > label ::= :[a-zA-Z_][a-zA-Z_0-9]* var ::= sequence of chars matching %[a-zA-Z_][a-zA-Z_0-9]*

IR IR p ::= f + ::= define T label ( (type var)* ) { bb + } f define int64 :myF (int64 %p1){ bb ::= label i * te :myLabel te ::= br label | br t label label | return | return t int64 %c i ::= type var | var <- s | var <- t op t | var <- var([t]) + | var([t]) + <- s | var <- length var t | %c <- %p1 >= 3 call callee ( args? ) | var <- call callee ( args? ) | br %c :true :false var <- new Array(args) | var <- new Tuple(t) T ::= type | void :true type ::= int64([])* | tuple | code return 1 callee ::= u | print | array-error args ::= t | t (, t)* s ::= t | label :false t ::= var | N return 0 u ::= var | label } N ::= (+|-)? [1-9][0-9]* op ::= + | - | * | & | << | >> | < | <= | = | >= | > label ::= :[a-zA-Z_][a-zA-Z_0-9]* var ::= sequence of chars matching %[a-zA-Z_][a-zA-Z_0-9]*

Now that you know the IR language Rewrite your L3 programs in IR and write a new IR program with more than 40 instructions

Outline • IR • Explicit control flows • Explicit data types

IR features • Basic blocks and control Flow Graph (CFG) • The middle-end job: analyze, analyze, analyze , and transform • To help analyzing the IR: explicit control flow • Liveness analysis is a simple example of what the middle-end does • Your liveness analysis had to “learn” who were the successors of an instruction • Successor/predecessor of an instruction: control flows • If I have 1000 code analyses, do they all have to “learn” the control flows? • Control flows need to be explicit in the code to simplify the middle-end

Representing the control flow of the program • Most instructions • Jump instructions • Branch instructions

Representing the control flow of the program A graph where nodes are instructions • Very large • Lot of straight-line connections • Can we simplify it? Basic block Sequence of instructions that is always entered at the beginning and exited at the end

Basic blocks A basic block is a maximal sequence of instructions such that • Only the first one can be reached from outside this basic block • All* instructions within are executed consecutively if the first one get executed • Only the last instruction can be a branch/jump • Only the first instruction can be a label • The storing sequence = execution order in a basic block

Inst = F.entryPoint() B = new BasicBlock() Basic blocks in compilers While (Inst){ if Inst is Label && B ∉𝟙 { • Automatically identified What about calls? B = new BasicBlock() • Algorithm: - Program exits } • Code changes trigger the re-identification - Exceptions B.add(Inst) • Increase the compilation time if Inst is Branch/Jump{ B = new BasicBlock() • Enforced by design } • Instruction exists only within the context of its basic block Inst = F.nextInst(Inst) • To define a function: } • you define its basic blocks first Add missing labels • Then you define the instructions of each basic block Add explicit jumps Delete empty basic blocks

Control Flow Graph (CFG) • A CFG is a graph G = <Nodes, Edges> • Nodes: Basic blocks Predecessor • Edges: (x,y) ϵ Edges iff … first instruction in basic block y might be executed ... just after the last instruction of the basic block x Ix Successor Iy ... ...

Control Flow Graph (CFG) • Entry node: block with the first instruction of the function • All basic blocks beside the first can be stored in any order • Exit nodes: blocks with the return instruction • Some compilers make a single exit node by adding a special node ret ret

IR IR p ::= f + define void :main (){ ::= define T label ( (type var)* ) { bb + } f bb ::= label i * te :entry te ::= br label | br t label label | return | return t call :myF(1, 2) i ::= type var | var <- s | var <- t op t | var <- var([t]) + | var([t]) + <- s | var <- length var t | return call callee ( args? ) | var <- call callee ( args? ) | } var <- new Array(args) | var <- new Tuple(t) define int64 :myF (int64 %p1, int64 %p2){ T ::= type | void type ::= int64([])* | tuple | code :entry callee ::= u | print | array-error int64 %v1 vars ::= var | var (, var)* args ::= t | t (, t)* %v1 = %p1 + %p2 s ::= t | label return %v1 t ::= var | N u ::= var | label } op ::= + | - | * | & | << | >> | < | <= | = | >= | > label ::= :[a-zA-Z_][a-zA-Z_0-9]* var ::= sequence of chars matching %[a-zA-Z_][a-zA-Z_0-9]*

From CFG to a sequence of instructions • CFG is a 2-dimension representation • L3 is a 1-dimension representation • We need to linearize CFG to generate L3 • Any order will preserve the original semantics as long as the entry point BB is the first one (property of the CFG) %v1 <- 5 What is the A A %v2 <- %v1 = 3 A No jump best linearization? B br %v2 :L B B %v3 <- 1 C C :L … C D D D

Naïve solution (not ok for your homework) • Ignore the problem • In other words: the sequence of basic blocks described in the L3 program file is going to be the sequence chosen • Translate a two labels IR branch into 2 branches in L3 br %cond :TRUE :FALSE br %cond :TRUE Your work br :FALSE

From CFG to a sequence of instructions • CFG is a 2-dimension representation • L3 is a 1-dimension representation • We need to linearize CFG to generate L3 • Any order will preserve the original semantics as long as the entry point BB is the first one (property of the CFG) • Different orders will have a different #branches • We want to select the one with the lowest #branches • Run-time vs. compile-time

The tracing problem How many jumps (conditional and unconditional) A A will be executed per loop iteration? B B 2 C C D D How many jumps (conditional and unconditional) A will be executed per loop iteration? 1 C B D

CFG linearization • A trace is a sequence of basic blocks (instructions) that could be executed at run time • It can include conditional branches • A program has many overlapping traces • For our goal: • Find a set of traces that cover the whole function without any overlapping • Each basic block belongs to exactly 1 trace • Remove unconditional branches within the same trace

IR Simone Campanoni simonec@eecs.northwestern.edu Outline IR - PowerPoint PPT Presentation

IR Simone Campanoni simonec@eecs.northwestern.edu Outline IR Explicit control flows Explicit data types A compiler High level programming language Front-end IR Middle-end IR Today: translating explicit control flow and data

For the LBECA Collaboration: Rafael F. Lang, Purdue University, rafael@purdue.edu S2 Only

Chiral Magnetic Effect with Wigner Functions Dniel Bernyi 1 , Vladimir Skokov 2 , Pter Lvai

Bayes Decision Theory - II Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175 Winter 2012 - UCSD

Charged Hadrons IP resolution in -decays and reconstruction of CP- mixing angle in H A.

Logistics Final projects due this Friday, but extensions are possible Grading: 40%

1 A Low-Level IR Example Register Transfer Language (RTL) Source code High-level IR (AST)

Containment Problem for points on another reducible conic Mike Janssen (joint with A. Denkert)

Augmented Lagrangians and Decomposition in Convex and Nonconvex Programming Terry Rockafellar

Disclosure Statement of Financial Interest I currently have, or have had over the last two years,

Quantum Integer Programming 47-779 Ising Model 1 William Larimer Mellon, Founder Agenda o

Information Retrieval Modeling Russian Summer School in Information Retrieval Djoerd Hiemstra

Proof Complexity of Quantified Boolean Formulas Olaf Beyersdorff School of Computing, University

::::: E F EXT " Some - c x . PI roots of unity "

Brian Siana Najmeh Emami , Anahita Alavi, Timothy Gburek Johan Richard, Dan Stark, Dan Weisz, Ben

This weeks Message: Behold, I am making everything new 1 Peter 5:8-9 Context 1 Peter

Layered Image Representation Chuck Dyer Based on the paper Representing moving images with

INGLS Adjetivos / Adjectives Comparativo e Superlativo / Comparative and Superlative Parte 2/

Scattered multi-static array imaging and eigenvalue problems Wagner B. Muniz Department of

WAVE ENERGY UTILIZATION Antnio F. O. Falco Instituto Superior Tcnico, Universidade

Quantum Binding in Newton Potential: a Source for Dark Energy? T. S. Bir, P . Vn: Foundations

A dispersive property of the Euler-Korteweg system Corentin Audiard Laboratoire Jacques-Louis

IMPACT OF NUCLEAR MOMENTS OF INERTIA ON FISSION OBSERVABLES 6th Workshop on Nuclear Fission and

Power Aware low-power systems Techniques: In the last few years, hardware was largely

ASSEMIC : A European Project for Advanced Microhandling and - assembly Dr. Ana Almansa ASSEMIC

IR Simone Campanoni simonec@eecs.northwestern.edu Outline IR - PowerPoint PPT Presentation

IR Simone Campanoni simonec@eecs.northwestern.edu Outline IR Explicit control flows Explicit data types A compiler High level programming language Front-end IR Middle-end IR Today: translating explicit control flow and data

For the LBECA Collaboration: Rafael F. Lang, Purdue University, rafael@purdue.edu S2 Only

Chiral Magnetic Effect with Wigner Functions Dniel Bernyi 1 , Vladimir Skokov 2 , Pter Lvai

Bayes Decision Theory - II Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175 Winter 2012 - UCSD

Charged Hadrons IP resolution in -decays and reconstruction of CP- mixing angle in H A.

Logistics Final projects due this Friday, but extensions are possible Grading: 40%

1 A Low-Level IR Example Register Transfer Language (RTL) Source code High-level IR (AST)

Containment Problem for points on another reducible conic Mike Janssen (joint with A. Denkert)

Augmented Lagrangians and Decomposition in Convex and Nonconvex Programming Terry Rockafellar

Disclosure Statement of Financial Interest I currently have, or have had over the last two years,

Quantum Integer Programming 47-779 Ising Model 1 William Larimer Mellon, Founder Agenda o

Information Retrieval Modeling Russian Summer School in Information Retrieval Djoerd Hiemstra

Proof Complexity of Quantified Boolean Formulas Olaf Beyersdorff School of Computing, University

::::: E F EXT &quot; Some - c x . PI roots of unity &quot;

Brian Siana Najmeh Emami , Anahita Alavi, Timothy Gburek Johan Richard, Dan Stark, Dan Weisz, Ben

This weeks Message: Behold, I am making everything new 1 Peter 5:8-9 Context 1 Peter

Layered Image Representation Chuck Dyer Based on the paper Representing moving images with

INGLS Adjetivos / Adjectives Comparativo e Superlativo / Comparative and Superlative Parte 2/

Scattered multi-static array imaging and eigenvalue problems Wagner B. Muniz Department of

WAVE ENERGY UTILIZATION Antnio F. O. Falco Instituto Superior Tcnico, Universidade

Quantum Binding in Newton Potential: a Source for Dark Energy? T. S. Bir, P . Vn: Foundations

A dispersive property of the Euler-Korteweg system Corentin Audiard Laboratoire Jacques-Louis

IMPACT OF NUCLEAR MOMENTS OF INERTIA ON FISSION OBSERVABLES 6th Workshop on Nuclear Fission and

Power Aware low-power systems Techniques: In the last few years, hardware was largely

ASSEMIC : A European Project for Advanced Microhandling and - assembly Dr. Ana Almansa ASSEMIC

::::: E F EXT " Some - c x . PI roots of unity "