Sebastian Hack, Christian Hammer, Jan Reineke Saarland University - PowerPoint PPT Presentation

Sebastian Hack, Christian Hammer, Jan Reineke Saarland University Static Program Analysis Introduction Winter Semester 2014 Slides based on: • H. Seidl, R. Wilhelm, S. Hack: Compiler Design, Volume 3, Analysis and Transformation, Springer Verlag, 2012 • F. Nielson, H. Riis Nielson, C. Hankin: Principles of Program Analysis, Springer Verlag, 1999 • R. Wilhelm, B. Wachter: Abstract Interpretation with Applications to Timing Validation. CAV 2008: 22-36 • Helmut Seidl’s slides 1

A Short History of Static Program Analysis • Early high-level programming languages were implemented on very small and very slow machines. • Compilers needed to generate executables that were extremely efficient in space and time. • Compiler writers invented efficiency-increasing program transformations, wrongly called optimizing transformations. • Transformations must not change the semantics of programs. • Enabling conditions guaranteed semantics preservation. • Enabling conditions were checked by static analysis of programs. 2

Theoretical Foundations of Static Program Analysis • Theoretical foundations for the solution of recursive equations: Kleene (30s), Tarski (1955) • Gary Kildall (1972) clarified the lattice-theoretic foundation of data-flow analysis. • Patrick Cousot (1974) established the relation to the programming-language semantics. 3

Static Program Analysis as a Verification Method • Automatic method to derive invariants about program behavior, answers questions about program behavior: – will index always be within bounds at program point p ? – will memory access at p always hit the cache? • answers of sound static analysis are correct, but approximate: don’t know is a valid answer! • analyses proved correct wrt. language semantics, 4

Introduction 1 a simple imperative programming language with: • / / variables registers • R = e ; / / assignments • R = M [ e ]; / / loads • M [ e 1 ] = e 2 ; / / stores • if ( e ) s 1 else s 2 / / conditional branching • goto L ; / / no loops An intermediate language into which (almost) everything can be translated. In particular, no procedures. So, only intra-procedural analyses! 5

Example — Rules-of-Sign Analysis 2 Problem: Determine at each program point the sign of the values of all variables of numeric type. Example program: 1: x = 0; 2: y = 1; 3: while (y > 0) do 4: y = y + x; 5: x = x + (-1); 6

Program representation as control-flow graphs 0 x = 0 1 y = 1 2 true(y>0) false(y>0) 4 3 y = y+x 5 x = x+(-1) 7

We need the following ingredients: • a set of information elements, each a set of possible signs, • a partial order, “ ⊑ ”, on these elements, specifying the ”relative strength” of two information elements, • these together form the abstract domain, a lattice, • functions describing how signs of variables change by the execution of a statement, abstract edge effects, • these need an abstract arithmetic, an arithmetic on signs. 8

We construct the abstract domain for single variables starting with the Signs = 2 {− , 0 , + } with the relation “ ⊑ ” =“ ⊆ ”. lattice {-,0,+} {-,0} {-,+} {0,+} {-} {0} {+} { } 9

The analysis should ”bind” program variables to elements in Signs . So, the abstract domain is D = ( Vars → Signs ) ⊥ , a Sign-environment. ⊥ ∈ D is the function mapping all arguments to {} . D 1 ⊑ D 2 The partial order on D is iff D 1 = ⊥ or D 1 x ⊆ D 2 x ( x ∈ Vars ) Intuition? D 1 is at least as precise as D 2 since D 2 admits at least as many signs as D 1 10

The analysis should ”bind” program variables to elements in Signs . So, the abstract domain is D = ( Vars → Signs ) ⊥ . a Sign-environment. ⊥ ∈ D is the function mapping all arguments to {} . D 1 ⊑ D 2 The partial order on D is iff D 1 = ⊥ or D 1 x ⊆ D 2 x ( x ∈ Vars ) Intuition? D 1 is at least as precise as D 2 since D 2 admits at least as many signs as D 1 11

How did we analyze the program? In particular, how did we walk the 0 lattice for y at program point 5? x = 0 1 {-,0,+} y = 1 {-,0} {-,+} {0,+} 2 true(y>0) false(y>0) {-} {0} {+} 4 3 y = y+x { } 5 x = x+(-1) 12

How is a solution found? Iterating until a fixed-point is reached 0 x = 0 1 0 1 2 3 4 5 y = 1 x y x y x y x y x y x y 2 true(y>0) false(y>0) 4 3 y = y+x 5 x = x+(-1) 13

Idea: • We want to determine the sign of the values of expressions. 14

Idea: • We want to determine the sign of the values of expressions. • { + , − , 0 } , For some sub-expressions, the analysis may yield which means, it couldn’t find out. 15

Idea: • We want to determine the signs of the values of expressions. • { + , − , 0 } , For some sub-expressions, the analysis may yield which means, it couldn’t find out. • We replace the concrete operators working on values by ✷ ✷ ♯ abstract operators working on signs: 16

Idea: • We want to determine the signs of the values of expressions. • { + , − , 0 } , For some sub-expressions, the analysis may yield which means, it couldn’t find out. • We replace the concrete operators working on values by ✷ ✷ ♯ abstract operators working on signs: • The abstract operators allow to define an abstract evaluation of expressions: ] ♯ : [ [ e ] ( Vars → Signs ) → Signs 17

Determining the sign of expressions in a Sign-environment works as follows:  { + } if c > 0    ] ♯ D [ [ c ] = {−} if c < 0   { 0 } if c = 0  ] ♯ [ [ v ] = D ( v ) ] ♯ D ] ♯ D ✷ ♯ [ ] ♯ D [ [ e 1 ✷ e 2 ] = [ [ e 1 ] [ e 2 ] ] ♯ D ] ♯ D ✷ ♯ [ [ [ ✷ e ] = [ e ] 18

Abstract operators working on signs (Addition) + # {0} {+} {-} {-, 0} {-, +} {0, +} {-, 0, +} {0} {0} {+} {+} {-} {-, 0} {-, +} {0, +} {-, 0, +} {-, 0, +} 19

Abstract operators working on signs (Multiplication) × # {0} {+} {-} {-, 0} {-, +} {0, +} {-, 0, +} {0} {0} {0} {+} {-} {-, 0} {-, +} {0, +} {-, 0, +} {0} Abstract operators working on signs (unary minus) − # {0} {+} {-} {-, 0} {-, +} {0, +} {-, 0, +} {0} {-} {+} {+, 0} {-, +} {0, -} {-, 0, +} 20

Working an example: D = { x �→ { + } , y �→ { + }} ] ♯ D ] ♯ D + ♯ [ ] ♯ D [ [ x + 7] = [ [ x ] [7] { + } + ♯ { + } = = { + } ] ♯ D { + } + ♯ ( − ♯ [ ] ♯ D ) [ [ x + ( − y )] = [ y ] { + } + ♯ ( − ♯ { + } ) = { + } + ♯ {−} = = { + , − , 0 } 21

] ♯ is the abstract edge effects associated with edge k . [ [ lab ] It depends only on the label lab : ] ♯ D [ [;] = D ] ♯ D [ [true ( e )] = D ] ♯ D [ [false ( e )] = D ] ♯ D ] ♯ D } [ [ x = e ;] = D ⊕ { x �→ [ [ e ] ] ♯ D [ [ x = M [ e ];] = D ⊕ { x �→ { + , − , 0 }} ] ♯ D [ [ M [ e 1 ] = e 2 ;] = D D � = ⊥ ... whenever These edge effects can be composed to the effect of a path π = k 1 . . . k r : ] ♯ = [ ] ♯ ◦ . . . ◦ [ ] ♯ [ [ π ] [ k r ] [ k 1 ] 22

Consider a program node v : → For every path π from program entry start to v the analysis should determine for each program variable x the set of all signs that the values of x may have at v as a result of executing π . → Initially at program start, no information about signs is available. → The analysis computes a superset of the set of signs as safe information. = = ⇒ For each node v , we need the set: ] ♯ ⊥ | π : start → ∗ v } � S [ v ] = { [ [ π ] 23

Question: How do we compute S [ u ] for every program point u ? Idea: Collect all constraints on the values of S [ u ] into a system of constraints: S [ start ] ⊇ ⊥ ] ♯ ( S [ u ]) S [ v ] ⊇ [ [ k ] k = ( u, _ , v ) edge 24

Question: How can we compute S [ u ] for every program point u ? Idea: Collect all constraints on the values of S [ u ] into a system of constraints: S [ start ] ⊇ ⊥ ] ♯ ( S [ u ]) S [ v ] ⊇ [ [ k ] k = ( u, _ , v ) edge Why ⊇ ? 25

Wanted: • a least solution (why least?) • an algorithm that computes this solution Example: 26

0 x = 0 1 S [0] ⊇ ⊥ y = 1 S [1] ⊇ S [0] ⊕ { x �→ { 0 }} S [2] ⊇ S [1] ⊕ { y �→ { + }} 2 ] ♯ S [5] } S [2] ⊇ S [5] ⊕ { x �→ [ [ x + ( − 1)] true(y>0) false(y>0) S [3] ⊇ S [2] 4 3 S [4] ⊇ S [2] ] ♯ S [4] } S [5] ⊇ S [4] ⊕ { y �→ [ [ y + x ] y = y+x 5 x = x+(-1) 27

Sebastian Hack, Christian Hammer, Jan Reineke Saarland University - PowerPoint PPT Presentation

Sebastian Hack, Christian Hammer, Jan Reineke Saarland University Static Program Analysis Introduction Winter Semester 2014 Slides based on: H. Seidl, R. Wilhelm, S. Hack: Compiler Design, Volume 3, Analysis and Transformation, Springer

Static Program Analysis Foundations of Abstract Interpretation Sebastian Hack, Christian Hammer,

Static Program Analysis Foundations of Abstract Interpretation Sebastian Hack, Christian Hammer,

Incremental Interactive Computation Matthew Hammer hammer@cs.umd.edu Tuesday, October 7, 2014

SLIPPERY SLIDES HACK.|100% WORKING!|NEW METHOD|HACK TOOL. Free No Ads Hard-Boiled Egg Hack Goes

Safely Remove a Hammer From the Equation Conventional Hammer Union Problems associated with

SSA Introduction Sebastian Hack hack@cs.uni-saarland.de Compiler Construction 2013 saarland

Global Value Numbering Sebastian Hack hack@cs.uni-saarland.de 17. Dezember 2013 saarland

Global Value Numbering Sebastian Hack hack@cs.uni-saarland.de 5. Dezember 2017 Saarland

Code Generation: Intro Sebastian Hack hack@cs.uni-saarland.de Compiler Construction 2017

FC2015 RSDYK Dyke quality assessment by remote sensing Robert Hack Robert Hack 14-Apr-09 1

Hack The CPython Batuhan Taskaya @isidentical What is hacking? Why do we hack? Yes, we want

PORTING THE HAMMER FILE SYSTEM TO LINUX Daniel Lorch June 10, 2009 Outline 2/13 Motivation 1.

and Workflows Objective Items in our Toolbox Screwdriver Hammer Box Cutter Items in our

the context of Reverse Engineering Sebastian Porst (sebastian.porst@zynamics.com) Christian

Instruction Selection on SSA Graphs Sebastian Hack, Sebastian Buchwald, Andreas Zwinkau Compiler

Abstract Interpretation of FIFO Replacement Daniel Grund Jan Reineke Saarland University,

Survey Design Principles Luke Kachersky, Ph.D. Why Understand beyond customer behavior

Allan Schougaard Director of Application Management Dreamhammer, Inc. Company Focus 2

Applying Recursion: Grammars and Parsing Time flies like an arrow. Fruit flies like a banana.

Number of New COVID-19 Cases in Arkansas 160 140 120 100 NUMBER OF NEW CASES 80 60 40 20

Curling: Why The _ Do You _? Zaheen Ahmad Rational Behaviour Rational agents play to maximize

Mathematical Tools for the Masses Joseph Malkevitch Professor Emeritus York College (CUNY)

RowHammer in 15' Nicolas RUFF nruff+sstic15@google.com Life of an electron SRAM: static RAM

Integration of general-purpose automated theorem provers in Lean Gabriel Ebner Formal Methods in