Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep - PowerPoint PPT Presentation

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep 05

Call graphs • Statically compute a precise call graph • Maps call sites to functions called • Challenge: • Methods • Higher-order functions • Can use precise call graph for: • optimization • reduce dispatch overhead • convert calls to lambdas to direct jumps • reduce code size • program understanding 2

Various techniques • Unique Name [Calder and Grunwald, POPL’94] • Class Hierarchy Analysis [Dean, Grove, Chambers, ECOOP’95] [Fernandez, PLDI’95] • Optimistic Reachability Analysis • Rapid Type Analysis [Bacon and Sweeney, OOPSLA’96] • Propagation-based analysis • 0-CFA [Shivers, PLDI’88] • k -CFA [Shivers ‘91] • Unification-based analysis [Steensgaard, POPL’96] • Interprocedural Class Analysis [DeFouw, Grove, Chambers, POPL’98] 3

Unique Name • Does not build call graph, but does resolve virtual calls class A { int foo() { return 1; } • If only one method named m in } entire program class B extends A { • Replace all virtual calls to a int foo() { return 2; } int bar(int i) { return i+1; } method named m with a non- } virtual call void main() { • Do at link time on object files B p = new B(); • Can resolve (1) only int r1 = p.bar(1); // 1: B.bar int r2 = p.foo(); // 2: B.foo • For C++ benchmarks, resolves 15% A q = p; of virtual calls int r3 = q.foo(); // 3: B.foo • Can’t handle same method name in } different classes 4

Class Hierarchy Analysis class A { int foo() { return 1; } • Use static type of receiver and the } class hierarchy to narrow set of class B extends A { possible targets int foo() { return 2; } int bar(int i) { return i+1; } • Whole program analysis } • Flow insensitive void main() { • O ( N ) B p = new B(); • Can resolve (1) and (2) int r1 = p.bar(1); // 1: B.bar int r2 = p.foo(); // 2: B.foo • For C++ benchmarks, resolves 51% A q = p; of virtual calls int r3 = q.foo(); // 3: B.foo } 5

Rapid Type Analysis class A { int foo() { return 1; } • Do CHA to build call graph } • If no object of class C allocated in class B extends A { the program, int foo() { return 2; } int bar(int i) { return i+1; } • Remove edges to methods of C } • O ( N ) void main() { • Slightly more expensive than CHA B p = new B(); • Can resolve (1), (2), and (3) int r1 = p.bar(1); // 1: B.bar int r2 = p.foo(); // 2: B.foo • For C++ benchmarks, resolves 71% A q = p; of virtual calls int r3 = q.foo(); // 3: B.foo } 6

Disjoint polymorphism • Multiple related object types used class Base { independently void m() { assert(false); } • e.g., Square and Circle objects void p() { assert(false); } are never mixed together in, say, } a Collection of Shapes class Derived1 extends Base { • Pathological case: void m() { ... } • Derived1 and Derived2 are } disjoint class Derived2 extends Base { • No Base objects allocated void p() { ... } • All calls are through Base } pointers 7

Unification-based analysis • Partitions variables in program and maps each partition to a set of classes class A { • Initialize with each variable in own int foo() { return 1; } partition } • If classes can flow between variables, class B extends A { int foo() { return 2; } unify the classes for those variables } target = source; void main() { A p = new B(); T1 m(T2 target) { ... } int r1 = p.foo(); // 4: B.foo m(source); A q = new A(); q = new B(); • Resolves (4), but not (5) int r2 = q.foo(); // 5: B.foo • O ( Nα ( N , N )) } 8

Interprocedural class analysis • Framework integrates • propagation-based analysis (0-CFA) • unification-based analysis • optimistic reachability analysis (RTA) • Computes set of classes for each program variable • Builds call graph as side effect 9

Flow graph representation • Node for each variable, method, new, call • Algorithm computes set of classes for each node • Edge between two nodes if classes can flow between them target = source; source target T1 m(T2 target) { ... } m(source); 10

Basic algorithm (0-CFA) • Construct nodes and edges for top-level variables, statements, and expressions (e.g., main) • Propagate classes through flow graph starting with main and top-level new expressions • When call encountered, add edge to target and construct flow graph for target method (if not already done) • If method not reachable, it will be pruned (as in RTA) 11

Edge filters • Edges may have a filter set • encode constraints ensured by type declarations or by dynamic dispatch • Don’t propagate class if filter does not include that class • Makes algorithm more precise than 0-CFA B.m() filter: {B} class B { m() { ... this ... } } this B class C ext B { m() { ... this ... } } o this C filter: {C} B o = new C(); C.m() o.m() 12

Call merging • Analysis parameterized by MergeCalls • When MergeCalls = false: o1 this B o1.m(a1) B.m(x1) { ... } a1 x1 o2 this C o2.m(a2) C.m(x2) { ... } a2 x2 • When MergeCalls = true: o1 this B o1.m(a1) B.m(x1) { ... } a1 m0 x1 o2 m1 this C o2.m(a2) C.m(x2) { ... } a2 x2 13

Node merging • Can speedup analysis by merging nodes into supernodes • Nodes merged with successors source target target = source; T1 m(T2 target) { ... } m(source); source target • Always merging is equivalent to unification-based analysis 14

Merging parameters • Analysis parameterized by P and MergeWithGlobal • When P = k , merge node with its successors if node visited more than k times • When P = 0, always merge • When P = N , never merge • When MergeWithGlobal = true, use only one global supernode 15

Instantiations Algorithm P MergeWithGlobal MergeCalls Complexity 0-CFA N N/A false O ( N 3 ) linear-edge 0-CFA N N/A true O ( N 2 ) bounded 0-CFA O (1) false false O ( N 2 α ( N , N )) bounded linear-edge 0-CFA O (1) false true O ( Nα ( N , N )) simply bounded 0-CFA O (1) true false O ( N 2 ) simply bounded linear-edge 0-CFA O (1) true true O ( N ) equivalence class analysis 0 false true O ( Nα ( N , N )) RTA 0 true true O ( N ) 16

Analysis time • Analysis time increases slightly with P • Mostly flat when P small, finite • MergeWithGlobal = true (simply bounded) • saves ~10% on Cecil • negligible improvement for Java • but all the benchmarks are Java compilers • 250% for one case when P = N • MergeCalls = true (linear edge) • up to 3x for Cecil, or more • only 5-20% savings for Java • no multimethods, so less edge filtering? • some programs can only be analyzed with linear edge (or small P ) 17

Precision • Larger P more precise (less merging) • Run-time speedup 0-10% for P = 0, 10-350% for P = N • MergeCalls = true (linear edge) • About as precise as quadratic edge • Less so for Java, but no difference in speedup • MergeWithGlobal = true (simply bounded) • Slightly less precision • but on some Cecil benchmarks, improved precision of MergeWithGlobal = false caused 2.5x speedup • precision lost on hot virtual calls? 18

Questions • All of these analyses are whole-program • Can they be modularized? • Integrating alias analysis, or more precise points to analysis • Extend class analysis to incorporate context as in k -CFA 19

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep - PowerPoint PPT Presentation

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep 05 Call graphs Statically compute a precise call graph Maps call sites to functions called Challenge: Methods Higher-order functions Can use precise call

Interprocedural Analysis Last time Interprocedural analysis Today Interprocedural alias

Interprocedural Analysis Last time Alias analysis Today Interprocedural analysis CS553

Interprocedural Analysis 15819M 15819M Program Analysis Jonathan Aldrich Interprocedural

Interprocedural Analysis and Abstract Interpretation cs6463 1 Outline Interprocedural

Interprocedural Analysis and Optimization Mod/Ref Analysis Alias Analysis Constant Propagation

1 What Is Control-Flow Analysis? Loop Concepts Control-flow analysis discovers the flow of

Towards Dynamic Interprocedural Analysis in JVMs Feng Qian and Laurie Hendren { fqian,hendren }

Flow Analysis Data-flow analysis, Control-flow analysis, Abstract interpretation, AAM Helpful

Alias Analysis Last time Interprocedural analysis Today Intro to alias analysis (pointer

Program Analysis Chris Hankin(clh) Efficiency Type and Effect Control Flow Analysis Systems

Control Flow CPU Sean Barker 1 Physical Control Flow Physical control flow <startup>

Class 10 Review; questions Questions about project Arbitrary interprocedural control

Class 9 Review; questions Discussion of Semester Project Arbitrary interprocedural

Programming in C 1 Flow of Control Flow of control The order in which statements are

V3 1/3/2015 Programming in C 1 Flow of Control Flow of control The order in which

Data-flow analysis Introduction to data-flow analysis Michel Schinz based on material by

CS293S GCSE Yufei Ding Review So far, we have seen Local Value Numbering Finds

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

Graph IRs Control Flow Graphs + The three-address IR looks like assembly language including

Analysis and Optimizations Analysis and Optimizations Program Analysis Program Analysis

CS406: Compilers Spring 2020 Week 13: Control Flow Graphs (Slides courtesy: Prof. Milind

CMSC 430 Introduction to Compilers Fall 2018 Data Flow Analysis Applications and

CS 251 Fall 2019 CS 240 Principles of Programming Languages Foundations of Computer Systems

CFA Simone Campanoni simonec@eecs.northwestern.edu Problems with Canvas? Problems with slides?

Sambuz

Useful Links

Newsletter

Mail Us

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep - PowerPoint PPT Presentation

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep 05 Call graphs Statically compute a precise call graph Maps call sites to functions called Challenge: Methods Higher-order functions Can use precise call

Interprocedural Analysis Last time Interprocedural analysis Today Interprocedural alias

Interprocedural Analysis Last time Alias analysis Today Interprocedural analysis CS553

Interprocedural Analysis 15819M 15819M Program Analysis Jonathan Aldrich Interprocedural

Interprocedural Analysis and Abstract Interpretation cs6463 1 Outline Interprocedural

Interprocedural Analysis and Optimization Mod/Ref Analysis Alias Analysis Constant Propagation

1 What Is Control-Flow Analysis? Loop Concepts Control-flow analysis discovers the flow of

Towards Dynamic Interprocedural Analysis in JVMs Feng Qian and Laurie Hendren { fqian,hendren }

Flow Analysis Data-flow analysis, Control-flow analysis, Abstract interpretation, AAM Helpful

Alias Analysis Last time Interprocedural analysis Today Intro to alias analysis (pointer

Program Analysis Chris Hankin(clh) Efficiency Type and Effect Control Flow Analysis Systems

Control Flow CPU Sean Barker 1 Physical Control Flow Physical control flow &lt;startup&gt;

Class 10 Review; questions Questions about project Arbitrary interprocedural control

Class 9 Review; questions Discussion of Semester Project Arbitrary interprocedural

Programming in C 1 Flow of Control Flow of control The order in which statements are

V3 1/3/2015 Programming in C 1 Flow of Control Flow of control The order in which

Data-flow analysis Introduction to data-flow analysis Michel Schinz based on material by

CS293S GCSE Yufei Ding Review So far, we have seen Local Value Numbering Finds

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

Graph IRs Control Flow Graphs + The three-address IR looks like assembly language including

Analysis and Optimizations Analysis and Optimizations Program Analysis Program Analysis

CS406: Compilers Spring 2020 Week 13: Control Flow Graphs (Slides courtesy: Prof. Milind

CMSC 430 Introduction to Compilers Fall 2018 Data Flow Analysis Applications and

CS 251 Fall 2019 CS 240 Principles of Programming Languages Foundations of Computer Systems

CFA Simone Campanoni simonec@eecs.northwestern.edu Problems with Canvas? Problems with slides?

Sambuz

Useful Links

Newsletter

Mail Us

Control Flow CPU Sean Barker 1 Physical Control Flow Physical control flow <startup>