Making Context-sensitive Points-to Analysis with Heap Cloning - PowerPoint PPT Presentation

Making Context-sensitive Points-to Analysis with Heap Cloning Practical For The Real World Chris Lattner Andrew Lenharth Vikram Adve Apple UIUC UIUC

What is Heap Cloning? Distinguish objects by acyclic call path void foo() { list* mkList(int num) { c1: list* L1 = mkList(10); list* L = NULL; c2: list* L2 = mkList(10); while (--num) } list_1: L = new list(L); } Without heap cloning: With heap cloning: Lists are allocated in a common Disjoint data structure place so they are the same list instances are discovered list_1 c1/list_1 c2/list_1 L1 L2 L1 L2

Why Heap Cloning? ● Discover disjoint data structure instances – able to process and/or optimize each instance ● More precise alias analysis ● Important in discovering coarse grain parallelism* ● More precise shape analysis? But widely considered non-scalable and rarely used * Ryoo et. al., HiPEAC '06

Some Uses of Our Analysis Data Structure Analysis (DSA) is well tested, used for major program transformations ● Automatic Pool Allocation – PLDI 2005 – Best Paper ● Pointer Compression – MSP 2005 ● SAFECode – PLDI 2006 ● Less conservative GC ● Per-instance profiling ● Alias Analysis – optimizations that use alias results Available at llvm.org

Key Contributions Heap cloning (with unification) can be scalable and fast ● Many algorithmic choices, optimizations necessary – We measure several of them ● Sound and useful analysis on incomplete programs ● New techniques – Fine-grained completeness tracking solves 3 practical issues – Call graph discovery during analysis, no iteration – New engineering optimizations

Outline ● Algorithm overview ● Results summary ● Optimizations and their effectiveness

Design Decisions Fast analysis and scalable for production compilers! Improves Speed, Improves Precision Hurts Precision Scalable Algorithms Scalable Algorithms Common Design of Common Design of ● Unification based ● Flow insensitive ● Field sensitive ● Drop context-sensitivity ● Context sensitive ● Heap cloning in SCCs of call graph ● Fine-grained completeness ● Use-based type inferencing for C

DS Graph Properties {G,H,S,U} : Storage Each pointer field has a int Z; class single outgoing edge void twoLists() { list *X = makeList(10); list *Y = makeList(100); addGToList(X); int: GMRC Y X addGToList(Y); Z freeList(X); freeList(Y); } list: HMRC list: HMRC Object type list* int list* int Field-sensitive for “type-safe” nodes These data have been proven (a) disjoint ; (b) confined within twoLists()

Algorithm Fly-by 3 Phase Algorithm ● Local – Field-sensitive intra-procedural summary graph ● Bottom-up on SCCs of the call graph – Clone and inline callees into callers – summary of full effects of calling the function ● Top-down on SCCs of the call graph – Clone and inline callers into callees

Completeness A graph node is complete if we can prove we have seen all operations on its objects 1. Support incomplete programs 2. Safely speculate on type safety 3. Construct call graph incrementally

Incompleteness - Sources Incompleteness is a transitive closure starting from escaping memory: Externally visible globals list* ExternGV; static int LocalGV; Return values and arguments int* escaping_fun(list*) {...} of escaping functions static int* local_fun(list*) { ... Return value and arguments x = extern_fun(L1); of external or unresolved ... } indirect calls

Call Graph Discovery ● Discover call targets in a context-sensitive way ● Incompleteness ensures correctness of points-to graphs with unresolved call sites ● SCCs may be formed by resolving an indirect call – Key insight: safe to process SCC even if some of its functions are already processed – See paper for details

Methodology ● Benchmarks: Benchmark kLOC siod 12.8 – SPEC 95 and 2000 134.perl 26.9 – Linux 2.4.22 252.eon 35.8 – povray 3.1 – Ptrdist 255.vortex 67.2 ● Presenting 9 benchmarks with 254.gap 71.3 253.perlbmk 85.1 slowest analysis time povray31 108.3 – Except 147.vortex and 126.gcc 176.gcc 222.2 – Lots more in paper vmlinux 355.4 ● Machine: 1.7 Ghz AMD Athlon, 1 GB Ram

Results - Speed < 5% of GCC -O3 time

Results – Memory Usage

Avoiding Bad Behavior ● Equivalence classes – Avoid N^2 space and time for globals not used in most functions ● Globals Graph* – Avoid N^2 replication of globals in nodes ● SCC collapsing* – Avoid recursive inlining – hurts precision ● Optimized Cloning and Merging* – Avoid lots of allocation traffic * used by others also

Slowdowns – No Optimizations 21.8x 1x == fully optimized ` 7.5x

No SCC Collapsing Naive Merging Optimizations Effects No Globals Graph No Equivalence Classes

Results – By Size Speedup due to optimizations grows as program size does Average LOC Average Speedup Largest 4 programs 280k 10.8x Second largest 4 72k 4.4x Third largest 4 52k 2.7x Optimizations are essential for scalability, not just speed

Summary ● Context sensitive analyses with heap cloning can be efficient enough for production compilers ● Sound and useful analysis is possible on incomplete programs ● Many optimizations necessary for speed and scalability

Questions? Rob: Why heap cloning? Andrew: It's better than sheep cloning. Rob: Yes, heap cloning raises none of the ethical concerns of sheep cloning, and sometimes the sheep have strange developmental issues that you don't get with heap cloning.

Related – Ruf Similarities Differences ● Unification ● Requires whole ● Heap cloning program ● Field sensitive ● For type safe language ● Globals graph ● Requires call graph ● Intelligent inlining – used context insensitive ● Drop context sensitivity in SCC

Related – Liang (FICS) Similarities Differences ● Iterates during Bottom Up ● Unification ● No heap cloning ● Context sensitive ● Field sensitive ● Requires call graph

Related – Liang (MOPPA) Similarities Differences ● Unification ● Iterates during Bottom Up ● Context sensitive ● Requires call graph or ● Field sensitive iterates to construct it ● Globals graph ● Memory intensive ● Heap Cloning

Related - Whaley-Lam Similarities Differences ● Context sensitive ● Constraint solving algorithm ● Call graph is input to context-sensitive alg – discovered by context- insensitive alg ● For type safe language ● No heap cloning ● Much slower on similar hardware

Related - Bodik Similarities Differences ● Context sensitive ● Subset based ● Heap cloning ● Requires call graph ● SCC collapsing ● Demand driven ● Requires whole program ● For type safe language ● Much slower on similar hardware

Related - Nystron ● Top-down, bottom-up ● Subset based ● Some codes cause structure ● Context sensitive runtime explosion ● Heap cloning ● SCC collapsing ● Behavior of Globals stored in side structure

Why Heap Cloning? Part 2! ● Rob: Why heap cloning? ● Andrew: It's better than sheep cloning. ● Rob: Yes, heap cloning raises none of the ethical concerns of sheep cloning, and sometimes the sheep have strange developmental issues that you don't get with heap cloning.

Making Context-sensitive Points-to Analysis with Heap Cloning - PowerPoint PPT Presentation

Making Context-sensitive Points-to Analysis with Heap Cloning Practical For The Real World Chris Lattner Andrew Lenharth Vikram Adve Apple UIUC UIUC What is Heap Cloning? Distinguish objects by acyclic call path void foo() { list*

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap

PROGRAM HEAP 2020-2021 WHAT IS HEAP? HEAP is a federally funded program that assists low

Heapsort Build-Max-Heap Next we build a full heap from an unsorted sequence Build-Max-Heap(A)

Windows 8 Heap Internals Windows 8 Heap Internals Windows 8 Heap Internals INTRODUCTION Windows 8

Fibonacci Heap Group Minus One Second December 6, 2016 Group Minus One Second Fibonacci Heap

Chapter 19: Binomial Heaps We will study another heap structure called, the binomial heap. The

1 Fib- -Heap Heap- -Extract Extract- -Min Min Example: Fib- -Heap Heap- -Extract

Context Sensitivity Example of a CSG Informatics 2A: Lecture 26 2 Context in Programming

A6: Sensitive Data Exposure A6 Sensitive Data Exposure Sensitive data stored or transmitted

heaps and heapsort on n elements height of a heap is in (log n ) building a heap bottum-up in O (

Context Sensitive Solutions Context Context Sensitive Solutions (CSS) is a collaborative approach

Context-sensitive Analysis Attribute Grammar And Type Checking cs5363 1 Context-Sensitive

Understanding the heap by breaking it A case study of the heap as a persistent data structure

A heap, a stack, a bottle and a rack The Stack Canary Birds The heap Answer: The first three

In the Beginning was the Word Stephen Heap In the Beginning was the Word

When Testing in Production is a Good Idea Dan Robinson CTO, Heap whoami Joined as Heap's

Welcome #sqlclone sqlclonefeedback@red-gate.com What problems do we face when provisioning

Cloning Tools Photoshop Tutorials Introduction In a skilled and experienced hand, the cloning

GROWING HE MP IN NE W ME XICO Ne w Me xic o He mp Se r vic e s F ull Se r vic e Co mpany

An Empirical Study of Long-Lived Code Clones Dongxiang Cai Hong Kong University of Science and

How to Insert iTunes Music into PPT Presentation Have you ever come across such problem that the

Responses of Microbial Communities to Increases in Nitrogen Loads in Salt Marshes Jennifer L.

Technology Coordinator Webinar Illinois State Board of Education January 31, 2018 Whole Child

4/2/2019 EOY LPACs Presentation for Teachers 1 4/2/2019 Composition of the LPAC The LPAC is