Pointer Analysis in the Presence of Dynamic Class Loading Martin - PowerPoint PPT Presentation

Pointer Analysis in the Presence of Dynamic Class Loading Martin Hirzel, Amer Diwan University of Colorado at Boulder Michael Hind IBM T.J. Watson Research Center 1

Pointer analysis motivation Code a = new C( ); // G What does it do? b = new C( ); // H a = b ; a . f = b ; Points-to sets What is it good for? pointsTo ( a ) == { G,H } pointsTo ( b ) == { H } Code browsing pointsTo ( G . f ) == { H } Tools Code transformations pointsTo ( H.f ) == { H } Error detection Clients Devirtualization Optimizations Load elimination Parallelization Connectivity-based garbage collection 2

Static flow- and context-insensitive pointer analysis by Andersen Analysis components Analysis data structures Call graph builder Code Constraint finder Method compilation Constraint propagator Points-to sets Constraint graph Clients 3

Static analysis can not deal with all of Java • Class loading may be implicitly triggered by any … – Constructor call – Static field access – Static method call • Classes may come from the web or be generated on the fly � Pretending a “static world” fails for most real-world applications 4

Java challenges Analysis components Analysis data structures 1. Online call Call graph builder graph building Constraint finder Method compilation 3. Unresolved 4. Reflection and types native code 2. Re-propagation Constraint propagator Constraint graph Clients 5

1. Online call graph building caller a = x.m ( b ); callee A:: m ( c ) { return d ; } 6

1. Online call graph building caller a = x.m ( b ); e = y.m ( f ); callee A:: m ( c ) { return d ; } 7

1. Online call graph building caller a = x.m ( b ); e = y.m ( f ); callee A:: m ( c ) { B:: m ( g ) { return d ; return h ; } } 8

Architecture for online call graph building Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Method compilation Constraint propagator Constraint graph Clients 9

Java challenges Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Method compilation 3. Unresolved 4. Reflection and types native code 2. Re-propagation Constraint propagator Constraint graph Clients 10

2. Focused re-propagation Code Points-to sets a = new C( ); // G pointsTo ( a ) == { G } b = new C( ); // H pointsTo ( b ) == { H } a.f = b ; pointsTo ( G . f ) == { H } pointsTo ( H . f ) == { } a = b ; 11

2. Focused re-propagation Code Points-to sets a = new C( ); // G pointsTo ( a ) == { G ,H } b = new C( ); // H pointsTo ( b ) == { H } a.f = b ; pointsTo ( G . f ) == { H } pointsTo ( H . f ) == { } a = b ; 12

2. Focused re-propagation Code Points-to sets a = new C( ); // G pointsTo ( a ) == { G ,H } b = new C( ); // H pointsTo ( b ) == { H } a.f = b ; pointsTo ( G . f ) == { H } pointsTo ( H . f ) == { H } a = b ; 13

Architecture for focused re-propagation Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Method compilation Propagator worklist Constraint propagator Constraint graph Clients 14

Java challenges Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Method compilation 3. Unresolved 4. Reflection and types native code Propagator worklist Constraint propagator Constraint graph Clients 15

3. Unresolved types X x = …; caller a = x.m ( b ); ? Can X have a subclass that inherits m from Y ? callee Y:: m ( c ) { ! Cannot tell before X is return d ; resolved! } 16

Architecture for managing unresolved types Virtual machine events Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Deferred constraints Method compilation Resolution manager Propagator worklist Constraint propagator Type resolution Constraint graph Clients 17

Java challenges Virtual machine events Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Deferred constraints Method compilation 4. Reflection and Resolution manager native code Propagator worklist Constraint propagator Type resolution Constraint graph Clients 18

4. Reflection and native code Reflection Field f = B. class . getField (“…”); B b = …; f . set ( b , v ); Native code Java-side code Native-side code 00100101 a = b . m ( c ); 01001110 10010011 Object VM_JNIFunctions. 01001001 CallObjectMethod ( method , args ) { 10001111 return method . invoke ( args ); 10001111 } 00100101 19

Architecture for dealing with reflection and native code Virtual machine events Analysis components Analysis data structures Call graph builder Caller/callee look-up Constraint finder Deferred constraints Method compilation Reflection execution Resolution manager Propagator worklist Native code execution Constraint propagator Type resolution Constraint graph Clients 20

Other events leading to constraints Virtual machine events Analysis components Analysis data structures Building and start-up Call graph builder Caller/callee look-up Bytecode attributes Constraint finder Deferred constraints Method compilation Reflection execution Resolution manager Propagator worklist Native code execution Constraint propagator Type resolution Constraint graph Clients 21

Clients using our pointer analysis Virtual machine events Analysis components Analysis data structures Call graph builder Building and start-up Caller/callee look-up Bytecode attributes Constraint finder Deferred constraints Method compilation Reflection execution Resolution manager Propagator worklist Native code execution Constraint propagator Type resolution Constraint graph Clients 22

Dealing with invalidated results Many techniques from prior work – Guard optimized code (extant analysis) – Pre-existence based inlining – On-stack replacement – and more Connectivity-based garbage collection – Trigger propagator only before collection – Merge partitions if necessary 23

Evaluation methodology Java virtual machine – Jikes RVM from IBM, is itself written in Java Benchmarks – SPECjvm98 suite, xalan, hsql Results not comparable to static analysis – Analyze more code: Jikes RVM adds a lot of Java code – Analyze less code: Not all application classes get loaded 24

Propagation cost Eager At GC At End Count Avg. Total Count Avg. Total Total hsql 391 10.1s 1h06m 6 1m17s 7m40s 7m07s jess 734 16.8s 3h26m 3 1m58s 5m53s 3m02s javac 1,103 12.5s 3h50m 5 1m54s 9m32s 6m27s xalan 1,726 11.2s 5h22m 1 2m01s 2m01s 7m45s � Eagerness trades off average cost against total cost � On average, focused re-propagation is much cheaper than full propagation � Total cost is a function of code size and propagator eagerness 25

How long does a program have to run to amortize the analysis cost? Eager At GC At End Count Avg. Total Count Avg. Total Total hsql 391 10.1s 1h06m 6 1m17s 7m40s 7m07s jess 734 16.8s 3h26m 3 1m58s 5m53s 3m02s javac 1,103 12.5s 3h50m 5 1m54s 9m32s 6m27s xalan 1,726 11.2s 5h22m 1 2m01s 2m01s 7m45s � Long-running Overall analysis overhead Application applications runtime can amortize 10% 5% 2.5% not-too-eager 5m 50m 1h40m 3h20m analysis cost Analysis cost 15m 2h30m 5h 10h to amortize 1h 10h 20h 1d16h 26 5h 1d16h 4d04h 8d08h

Validation Virtual machine events Analysis components Analysis data structures Call graph builder Building and start-up Caller/callee look-up Bytecode attributes Constraint finder Deferred constraints Method compilation Reflection execution Resolution manager Propagator worklist Native code execution Constraint propagator Type resolution Constraint graph Clients 27 Validation

Validation • Piggy-back validation on garbage collection • For each pointer, check consistency with analysis results • Incorrect analysis would lead to tricky bugs in clients 28

Related work Andersen’s analysis for “static Java” [RountevMilanovaRyder’01] [LiangPenningsHarrold’01] [WhaleyLam’02] [LhotakHendren’03] Weaker analyses with dynamic class loading DOIT – [PechtchanskiSarkar’01] XTA – [QianHendren’04] Ruf’s escape analysis – [BogdaSingh’01, King’03] Demand-driven / incremental analysis 29

Conclusions • 1 st non-trivial pointer analysis for all of Java • Identified and solved the challenges: 1. Online call graph building 2. Focused re-propagation 3. Managing unresolved types 4. Reflection and native code • Evaluated efficiency • Validated correctness 30

Pointer Analysis in the Presence of Dynamic Class Loading Martin - PowerPoint PPT Presentation

Pointer Analysis in the Presence of Dynamic Class Loading Martin Hirzel, Amer Diwan University of Colorado at Boulder Michael Hind IBM T.J. Watson Research Center 1 Pointer analysis motivation Code a = new C( ); // G What does it do? b =

Opaque Pointer Types To a world without pointer to pointer bitcasts Motivation Proximal

Pointer arithmetic arrays only arrays only Pointer arithmetic Can add or subtract an

Pointer Basics Lecture 13 COP 3014 Fall 2019 November 7, 2019 What is a Pointer? A pointer

Dangling Pointer Dangling Pointer Jonathan Afek, 2/ 8/ 07, BlackHat USA 1 Table of Contents

Pointers and Memory 1 Pointer values Pointer values are memory addresses

Alias Analysis Last time Reuse optimization Today Alias analysis (pointer analysis)

Hierarchical Pointer Analysis for Distributed Programs Distributed Programs Amir Kamil and

Precision-Guided Context Sensitivity for Pointer Analysis Yue Li, Tian Tan, Anders Mller,

A Probabilistic Pointer Analysis A Probabilistic Pointer Analysis for Speculative Optimization

Making k- Object-Sensitive Pointer Analysis More Precise with Still k -Limiting Tian Tan , Yue Li

Alias Analysis Last time Alias analysis I (pointer analysis) Address Taken FIAlias,

Pentalift Pentalift Equipment Equipment Corporation Corporation Loading Dock Loading Dock

Presence Presence Presence When we wake up in the morning we may automatically leave our

CS711 Advanced Programming Languages Pointer Analysis Overview and Flow-Sensitive Analysis Radu

HD- -SCR and Tele SCR and Tele- -Pointer Pointer HD Nobuhiro Torata, Kyushu University

CS 103 Unit 11 Linked Lists Mark Redekopp 2 NULL Pointer Just like there was a null

Network Core Mechanisms of Exponence 2 nd Network Meeting, January 2008 Bernd Wiese The

Te Teachings s of f th the Pro rophet t (s) s) Appreciating Appr ng Others on on Family

Measuring Semantic Coherence of a Conversation Svitlana Vakulenko , Maarten de Rijke, Michael

Network Services on Service Extensible Routers Lukas Ruf Computer Engineering and Networks

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Outline Probability Calculations Using the Normal Distribution (5.1) Linear Combinations

STA 103: Probability and Statistical Inference Paul Marriott paul@stat.duke.edu 223C, Old

Encoding Separation Logic in Coq and Its Application Reynald Affeldt Nicolas Marti AIST-RCIS