SSA and DFAs Simone Campanoni simonec@eecs.northwestern.edu SSA - PowerPoint PPT Presentation

SSA and DFAs Simone Campanoni simonec@eecs.northwestern.edu

SSA Outline • SSA and why? • Reaching definitions, constant propagation with SSA forms • SSA in LLVM • Generate SSA code

Def-use chains v = 3 Within your CAT: you can follow def-use chains … e.g., i->getUses() … = v + 1 … … in both directions e.g., i->getDefinitions() … = v * 2 … CFG

Def-use chains v = 3 Within your CAT: you can follow def-use chains … e.g., i->getUses() … = v + 1 v = 5 … in both directions e.g., i->getDefinitions() … = v * 2 An use can get data from multiple definitions • … depending on the control flow executed This is why we need to propagate • data-flow values CFG through all possible control flows

Def-use chain and DFA OUT[ENTRY] = { }; for (each instruction i other than ENTRY) OUT[i] = { }; while (changes to any OUT occur) for (each instruction i other than ENTRY) { IN[ i ] = ∪ p a predecessor of i OUT[ p ]; OUT[ i ] = GEN[ i ] ∪ (IN[ i ] ─ KILL[ i ]); } } Given a variable t, we need to find all definitions of t in the CFG i: t <- … i: … GEN[ i ] = {i} GEN[ i ] = {} KILL[ i ] = defs(t) – {i} KILL[ i ] = {}

Static Single Assignment (SSA) Form • A variable is set only by one instruction in the function body %myVar = … A static assignment can be executed more than once start While (…){ %myVar = ... } def • The definition always dominates all its uses • Code analyses and transformations that assume SSA are (typically) use faster, they use less memory, and they include less code (compared to their non-SSA versions)

Compilers using SSA • Go • LLVM (IR) • WebKit • Swift (SIL) • Erlang • Recent GCC (GIMPLE IR) • LuaJit • Mono • IBM open source JVM • Portable.NET • … • Mozilla Firefox SpiderMonkey JavaScript engine (IR) • Chromium V8 JavaScript engine (IR) • PyPy • Android’s new optimizing compiler • PhP

LLVM IR: SSA and not SSA example float myF (float par1, float par2, float par3){ return (par1 * par2) + par3; } define float @myF(float %par1, float %par2, float %par3) { A S S %1 = fmul float %par1, %par2 T O %1 = fadd float %1, %par3 N ret float %1 } define float @myF(float %par1, float %par2, float %par3) { %1 = fmul float %par1, %par2 %2 = fadd float %1, %par3 SSA ret float %2 }

Consequences of SSA • Unrelated uses of the same variable in source code become different variables in the SSA form No WAW, WAR v = 5; v1 = 5 print(v); call print(v1) data dependencies To SSA IR v = 42; v2 = 42 between variables! print(v); call print(v2) • Use—def chain are greatly simplified • Data-flow analysis are simplified (… in the next slides) • Code analysis (e.g., data flow analysis) can be designed to run faster

Motivation for SSA • Code analysis needs to represent facts at every program point define float @myF(float %par1, float %par2, float %par3) { %1 = fmul float %par1, %par2 Definition of %1 reaches here %2 = fadd float %1, %par3 Definition of %1 reaches here ret float %2 } • What if • There are a lot of facts and there are a lot of program points? • Potentially takes a lot of space/time • Code analyses run slow • Compilers run slow

Example: reaching definition We iterate over instructions and if a new instruction doesn’t redefine x, This is a dense representation then, we keep propagating “x=3” of data-flow values This is needed to know whether this x can/must/cannot be equal to 3

Sparse representation • Instead, we’d like to use a sparse representation • Only propagate facts about x where they’re needed • Exploit static single assignment form • Each variable is defined (assigned to) exactly once • Definitions dominate their uses

Static Single Assignment (SSA) Add SSA edges from definitions to uses • No intervening statements define variable • Safe to propagate facts about x only along SSA edges Why can’t we do in non-SSA IRs? No guarantee that • def dominates use No guarantee • about which def will be the last def before an use

What about join nodes in the CFG? • Add Φ functions to model joins • One argument for each incoming branch • Operationally • selects one of the arguments based on how control flow reach this node • The backend needs to eliminate Φ nodes b = d + 1 b2 = d + 1 b2 = d + 1 b = c + 1 b1 = c + 1 b1 = c + 1 b3=Φ(b1, b2) If (b > N) If (? > N) If (b3 > N) SSA Still not SSA Not SSA

Eliminating Φ in the back-end • Basic idea: Φ represents facts that value of join may come from different paths • So just set along each possible path b2 = d + 1 b1 = c + 1 b2 = d + 1 b1 = c + 1 b3 = b2 b3 = b1 b3=Φ(b1, b2) If (b3 > N) If (b3 > N) Not SSA

Eliminating Φ in practice • Copies performed at Φ may not be useful • Joined value may not be used later in the program (So why leave it in?) • Use dead code elimination to kill useless Φs • Subsequent register allocation will map the variables onto the actual set of machine register

SSA efficiency in practice

Consequences of SSA • Unrelated uses of the same variable in source code become different variables in the SSA form v = 5; v1 = 5 print(v); call print(v1) To SSA IR v = 42; v2 = 42 print(v); call print(v2) • Use—def chain are greatly simplified • Data-flow analysis are simplified • Code analysis (e.g., data flow analysis) can be designed to run faster

Def-use chain OUT[ENTRY] = { }; for (each instruction i other than ENTRY) OUT[i] = { }; while (changes to any OUT occur) for (each instruction i other than ENTRY) { IN[ i ] = ∪ p a predecessor of i OUT[ p ]; OUT[ i ] = GEN[ i ] ∪ (IN[ i ] ─ KILL[ i ]); } } i: t <- … i: … GEN[ i ] = {i} GEN[ i ] = {} KILL[ i ] = defs(t) – {i} KILL[ i ] = {}

Def-use chain with SSA OUT[ENTRY] = { }; for (each instruction i other than ENTRY) OUT[i] = { }; while (changes to any OUT occur) for (each instruction i other than ENTRY) { IN[ i ] = ∪ p a predecessor of i OUT[ p ]; OUT[ i ] = GEN[ i ] ∪ (IN[ i ] ─ KILL[ i ]); } } i: t <- … i: … GEN[ i ] = {i} GEN[ i ] = {} KILL[ i ] = {} KILL[ i ] = {}

Code example i: b0 = 1 ?: b0 = b0 + 2 j:b1 = b0 + 1 Question answered by reaching definition analysis: does the definition “i” reach “j”?

Code example How should we design i: b0 = 1 constant propagation for SSA IRs? k:b2 = 2 j:b1 = 1 + 1 p:b3=Φ(b1, b2) z:return b3 Does it mean we can always propagate constants to variable uses? What are the definitions of b3 that reach “z”?

SSA in LLVM • The IR must be in SSA all the time • Checked at boundaries of passes • No time wasted converting automatically IR to its SSA form • CAT designed with this constraint in mind • Φ instructions only at the top of a basic block

SSA in LLVM: Φ instructions When the predecessor just executed is %4 store the constant 1 to %.0

SSA in LLVM: Φ instructions When the predecessor just executed is %5 store %6 to %.0

SSA in LLVM: Φ instructions • A PHI instruction can have many (predecessor,value) pairs as inputs • A PHI instruction must have one pair per predecessor • A PHI instruction must have at least one pair • A PHI instruction is a definition • Hence, it must dominates all its uses

SSA in LLVM: Variable def-use chains • Iterate over users of a definition: for (auto &user : i.users()){ i is the definition of %v i: %v = … if (auto j = dyn_cast<Instruction>(&user)){ j is a user of i … This fact is called “use” } } j: … = %v • Iterate over uses for (auto &use : i.uses()){ Use User User *user = use.getUser(); if (auto j = dyn_cast<Instruction>(user)){ … Instruction Constant … } }

SSA in LLVM: Basic block def-use chains • Def = definition of a basic block • User = ?

SSA in LLVM: Function def-use chains • Def = definition of a function • User = ?

SSA in LLVM: variables • Let’s say we have the following C code: • The equivalent bitcode is the following: • %3, %5, and %.0 are variables. How can we access them? E.g., Function::getVariable(%3) E.g., Instruction::getVariableDefined() • It seems variables do not exist from the LLVM API!

SSA in LLVM: variables (2) Value Instruction Argument I.getOperand(0) returns an instruction pointer ( llvm::Instruction * ) I.getOperand(0) returns an argument pointer ( llvm::Argument * ) The variable defined by an instruction is represented by the instruction itself! This is thanks to the SSA representation Value * Instruction::getOperand(unsigned i) Value * CallInst::getArgOperand(unsigned i)

SSA in LLVM: variables (3) • The variable defined by an instruction is represented by the instruction itself • How can we find out the type of the variable defined? Type *varType = inst->getType() if (varType->isIntegerTy()) … if (varType->isIntegerTy(32)) … if (varType->isFloatingPointTy()) … Type PointerType … IntegerType

SSA and DFAs Simone Campanoni simonec@eecs.northwestern.edu SSA - PowerPoint PPT Presentation

SSA and DFAs Simone Campanoni simonec@eecs.northwestern.edu SSA Outline SSA and why? Reaching definitions, constant propagation with SSA forms SSA in LLVM Generate SSA code Def-use chains v = 3 Within your CAT: you can follow

SSA Technicalities Last Time Introduced SSA Today Aliasing in SSA Building SSA

SSA form SSA form Michel Schinz SSA form Straight-line code Transforming a piece of

SSA Form & SSA-form: x 17-4 Each name is defined exactly once. Dead Code Elimination

CO444H SSA SSA Construction SSA-based analysis Ben Livshits 1 Refresher: Reaching Definitions

CPSC 121: Models of Computation 2018S DFAs in Depth Meghan Allen, based on notes by Steve

SSA form Static single assignment (SSA) form Michel Schinz Advanced Compiler Construction

CPSC 121: Models of Computation Module 11: Sets, Functions and DFAs. CPSC 121 2016W T1 1

CSE 311: Foundations of Computing highlights Fall 2014 DFAs Regular Expressions Lecture

Chapter Four: DFA Applications Formal Language, chapter 4, slide 1 1 We have seen how DFAs

Tree SSA Tree SSA Design and Implementation Design and Implementation Diego Novillo Red Hat

Abstract This report examines the results of seventeen SSA ACQ/PEAKUP observations executed

Scalable Array SSA and Array Dataflow Analysis Silvius Rus Guobin He Lawrence Rauchwerger SSA

Clil project An English splash into the Mediterranean Sea Prof.ssa Caterina Moccia Prof.ssa

SSA in Scheme Static single assignment (SSA) : assignment conversion (boxing),

2016 Age Friendly SSA Office of Research & Evaluation Survey Findings Conducted by

E ssa y Que stio n De ve lo pme nt & Asse ssme nt Wor kshops - Oc t. 10, 2014 E ast AM

[Galois-disjoint] Polycyclic codes over finite chain rings LAWCI 2018, Campinas Thomas

Decoding Homology A lexicon for the uninitiated Tutorial Lecture Dan Browne - University

On the Implementation of Model Predictive Control for On-line Walking Pattern Generation Dimitar

Une attaque par rejeu sur le protocole SEND Tony Cheneau mail: tony.cheneau@it-sudparis.eu

CSSE 220 Coupling and Cohesion Scoping Please turn in your assignment in the back Review of

A generic worklist analysis algorithm Advanced program representations Maintain a mapping from

tP t r P ts st

Image Analysis The topics of this lecture are (chapter 11) Representation and description Chain

SSA and DFAs Simone Campanoni simonec@eecs.northwestern.edu SSA - PowerPoint PPT Presentation

SSA and DFAs Simone Campanoni simonec@eecs.northwestern.edu SSA Outline SSA and why? Reaching definitions, constant propagation with SSA forms SSA in LLVM Generate SSA code Def-use chains v = 3 Within your CAT: you can follow

SSA Technicalities Last Time Introduced SSA Today Aliasing in SSA Building SSA

SSA form SSA form Michel Schinz SSA form Straight-line code Transforming a piece of

SSA Form &amp; SSA-form: x 17-4 Each name is defined exactly once. Dead Code Elimination

CO444H SSA SSA Construction SSA-based analysis Ben Livshits 1 Refresher: Reaching Definitions

CPSC 121: Models of Computation 2018S DFAs in Depth Meghan Allen, based on notes by Steve

SSA form Static single assignment (SSA) form Michel Schinz Advanced Compiler Construction

CPSC 121: Models of Computation Module 11: Sets, Functions and DFAs. CPSC 121 2016W T1 1

CSE 311: Foundations of Computing highlights Fall 2014 DFAs Regular Expressions Lecture

Chapter Four: DFA Applications Formal Language, chapter 4, slide 1 1 We have seen how DFAs

Tree SSA Tree SSA Design and Implementation Design and Implementation Diego Novillo Red Hat

Abstract This report examines the results of seventeen SSA ACQ/PEAKUP observations executed

Scalable Array SSA and Array Dataflow Analysis Silvius Rus Guobin He Lawrence Rauchwerger SSA

Clil project An English splash into the Mediterranean Sea Prof.ssa Caterina Moccia Prof.ssa

SSA in Scheme Static single assignment (SSA) : assignment conversion (boxing),

2016 Age Friendly SSA Office of Research &amp; Evaluation Survey Findings Conducted by

E ssa y Que stio n De ve lo pme nt &amp; Asse ssme nt Wor kshops - Oc t. 10, 2014 E ast AM

[Galois-disjoint] Polycyclic codes over finite chain rings LAWCI 2018, Campinas Thomas

Decoding Homology A lexicon for the uninitiated Tutorial Lecture Dan Browne - University

On the Implementation of Model Predictive Control for On-line Walking Pattern Generation Dimitar

Une attaque par rejeu sur le protocole SEND Tony Cheneau mail: tony.cheneau@it-sudparis.eu

CSSE 220 Coupling and Cohesion Scoping Please turn in your assignment in the back Review of

A generic worklist analysis algorithm Advanced program representations Maintain a mapping from

tP t r P ts st

Image Analysis The topics of this lecture are (chapter 11) Representation and description Chain

SSA Form & SSA-form: x 17-4 Each name is defined exactly once. Dead Code Elimination

2016 Age Friendly SSA Office of Research & Evaluation Survey Findings Conducted by

E ssa y Que stio n De ve lo pme nt & Asse ssme nt Wor kshops - Oc t. 10, 2014 E ast AM