Pushdown Control-Flow Analysis of Higher Order Programs Christopher - PowerPoint PPT Presentation

Pushdown Control-Flow Analysis of Higher Order Programs Christopher Earl 1 Matthew Might 1 David Van Horn 2 1 University of Utah { cwearl,might } @cs.utah.edu 2 Northeastern University dvanhorn@ccs.neu.edu August 21, 2010

Who uses function (calls)?

Who uses function (calls)? Pushdown control-flow analysis models function calls precisely.

Simple example of merging return-points (let* ((id (lambda (x) x)) (a (id 3)) (b (id 4))) a)

The big picture Classical control-flow analysis is not precise enough.

The big picture Classical control-flow analysis is not precise enough. Pushdown control-flow analysis has better precision.

The big picture Classical control-flow analysis is not precise enough. Pushdown control-flow analysis has better precision. We generalize k-CFA to a pushdown control-flow analysis.

The big picture Classical control-flow analysis is not precise enough. Pushdown control-flow analysis has better precision. We generalize k-CFA to a pushdown control-flow analysis. Our approach has several advantages: Direct-style Polyvariant Polynomial

Control-flow analysis < pushdown control-flow analysis Expressiveness of k-CFA = NFA

Control-flow analysis < pushdown control-flow analysis Expressiveness of k-CFA = NFA Expressiveness of PDCFA = PDA

Our approach

Target language/stack behavior ( let (( x e 1 )) e 2 ) = Push frame ( x , e 2 , . . . ) onto stack. ⇒

Target language/stack behavior ( let (( x e 1 )) e 2 ) = Push frame ( x , e 2 , . . . ) onto stack. ⇒ a = Pop top of stack. ⇒

Target language/stack behavior ( let (( x e 1 )) e 2 ) = Push frame ( x , e 2 , . . . ) onto stack. ⇒ a = Pop top of stack. ⇒ ( f a ) = Stack no-op. ⇒

Concrete Semantics A CESK machine.

Concrete Semantics A CESK machine. Configuration = State × Stack

Concrete Semantics A CESK machine. Configuration = State × Stack State = Expression × Environment × Store

Abstract Semantics Abstracted environment = ⇒

Abstract Semantics Abstracted environment = ⇒ environments = finite

Abstract Semantics Abstracted environment = ⇒ environments = finite Abstracted store = ⇒

Abstract Semantics Abstracted environment = ⇒ environments = finite Abstracted store = ⇒ stores = finite

Abstract Semantics Abstracted environment = ⇒ environments = finite Abstracted store = ⇒ stores = finite Abstracted state = ⇒

Abstract Semantics Abstracted environment = ⇒ environments = finite Abstracted store = ⇒ stores = finite Abstracted state = ⇒ states = finite

Size of the abstract configuration-space Using the stack = ⇒

Size of the abstract configuration-space Using the stack = ⇒ configuration-space = infinite

Size of the abstract configuration-space Using the stack = ⇒ configuration-space = infinite The configuration-space cannot be explicitly searched.

Size of the abstract state-space State-space = finite Always.

� � � � Finite model of pushdown control-flow analysis · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ � ˆ ǫ � ˆ ˆ ς 2 ς 3 ς 4

� � � � Finite model of pushdown control-flow analysis · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ � ˆ ǫ � ˆ ˆ ς 2 ς 3 ς 4 This representation is a PDA.

� � � � � � While finite, this naive PDA is inefficient: · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ � ˆ ǫ � ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7

� � � � � � While finite, this naive PDA is inefficient: · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ � ˆ ǫ � ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7 (Provably) unreachable configurations/states are included.

� � � � � � While finite, this naive PDA is inefficient: · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ � ˆ ǫ � ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7 (Provably) unreachable configurations/states are included. Legal path from initial configuration/state = ⇒

� � � � � � While finite, this naive PDA is inefficient: · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ � ˆ ǫ � ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7 (Provably) unreachable configurations/states are included. Legal path from initial configuration/state = ⇒ reachable

� � � � � � � � Shortcut edges: finding the top of the stack · · · · · · � � � � � � � � � � � � � � � � ˆ ˆ ς 1 ς 5 � � ǫ ˆ ˆ � φ + φ − � � � � � � � � � � � ǫ ǫ � ˆ ˆ ˆ ˆ ς 3 � � � ς 6 ς 2 ς 4 � ˆ φ ′ − � � ˆ φ ′′ � − ˆ ς 7

� � � � � � � � Shortcut edges: finding the top of the stack · · · · · · � � � � � � � � � � � � � � � � ǫ � ˆ ˆ ς 1 ς 5 � � ǫ ˆ ˆ � φ + φ − � � � � � � � � � � � ǫ ǫ � ˆ ˆ ˆ ˆ ς 3 � � � ς 6 ς 2 ς 4 � ˆ φ ′ − � � ˆ φ ′′ � − ˆ ς 7

� � � � � � � � Shortcut edges: finding the top of the stack ˆ ς 0 · · · � � � ˆ φ ′ � � + � � � � � � � � � � ǫ � ˆ ˆ ς 1 ς 5 � � ˆ ǫ ˆ � φ + φ − � � � � � � � � � � � ǫ ǫ � ˆ ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7

� � � � � � � � Shortcut edges: finding the top of the stack ˆ ˆ ς 0 ς 8 � ˆ � ˆ φ ′ φ ′ � � � + − � � � � � � � � � ǫ � ˆ ˆ ς 1 ς 5 � � ˆ ǫ ˆ � φ + φ − � � � � � � � � � � � ǫ ǫ � ˆ ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7

� � � � � � � � Shortcut edges: finding the top of the stack ǫ � ˆ ˆ ς 0 ς 8 � ˆ � ˆ φ ′ φ ′ � � � + − � � � � � � � � � ǫ � ˆ ˆ ς 1 ς 5 � � ˆ ǫ ˆ � φ + φ − � � � � � � � � � � � ǫ ǫ � ˆ ˆ ˆ ˆ � � � ς 2 ς 3 ς 4 ς 6 ˆ � φ ′ − � � ˆ φ ′′ � − ˆ ς 7

� � � � � � Dyck state graphs: a lean PDA representation ǫ � ˆ ˆ ς 0 ς 8 � ˆ ˆ � φ ′ φ ′ � � � + − � � � � � � � � � ǫ � ˆ ˆ ς 1 ς 5 � � ǫ ˆ � ˆ φ − φ + � � � � � � � � � � � ǫ ǫ � ˆ ˆ ˆ ς 2 ς 3 ς 4 Only reachable states and configurations are included.

Our contributions

Direct-style Polyvariant Polynomial

Direct-style:

Direct-style: by the language (A-Normal Form)

Direct-style: by the language (A-Normal Form) Polyvariant:

Direct-style: by the language (A-Normal Form) Polyvariant: the abstract semantics can use a parameter, k, identical to the k in k-CFA

Polynomial: monovariance and store-widening Standard (infinite) pushdown control-flow analysis: Configuration = Expression × Environment × Store × Stack Frame = Variable × Expression × Environment

Polynomial: monovariance and store-widening Dyck state graphs: State = Expression × Environment × Store Frame = Variable × Expression × Environment

Polynomial: monovariance and store-widening Monovariant Dyck state graphs: State = Expression × Store Frame = Variable × Expression

Polynomial: monovariance and store-widening Monovariant Dyck state graphs with store-widening: State = Expression (with a global store) Frame = Variable × Expression

Recap Pushdown control-flow analysis precisely models the stack.

Pushdown Control-Flow Analysis of Higher Order Programs Christopher - PowerPoint PPT Presentation

Pushdown Control-Flow Analysis of Higher Order Programs Christopher Earl 1 Matthew Might 1 David Van Horn 2 1 University of Utah { cwearl,might } @cs.utah.edu 2 Northeastern University dvanhorn@ccs.neu.edu August 21, 2010 Who uses function

Pushdown Automata Context Free Languages IV Input tape 1 2 Pushdown Automata 3 5 4 State

Pushdown Automata A pushdown automata (PDA) is essentially: Pushdown Automata An NFA with

Pushdown Automata Pushdown Automata p.1/25 Rationale CFG are specification

Parity to Safety in Polynomial Time for Pushdown and Collapsible Pushdown Games Matthew Hague

Reachability analysis of first-order definable pushdown systems (= pushdown systems in sets with

1 What Is Control-Flow Analysis? Loop Concepts Control-flow analysis discovers the flow of

A Pumping Lemma for Pushdown Graphs of Any Level Pawe Parys University of Warsaw Higher order

Pushdown Automata Reading: Chapter 6 1 Pushdown Automata (PDA) Informally: A PDA is

Pushdown Automata 7-0 Pushdown Automata The automata we saw so far were

Equivalence of Pushdown Automata with Context-Free Grammar Equivalence of Pushdown Automata with

Higher order complexity Hugo Fre Mathieu Hoyrup CCA 2013 Hugo Fre Higher order

Flow Analysis Data-flow analysis, Control-flow analysis, Abstract interpretation, AAM Helpful

Programming in C 1 Flow of Control Flow of control The order in which statements are

V3 1/3/2015 Programming in C 1 Flow of Control Flow of control The order in which

York University www.cs.york.ac.uk/~ndm First order vs Higher order Higher order:

Program Analysis Chris Hankin(clh) Efficiency Type and Effect Control Flow Analysis Systems

Fundamentals of (Static ic and Dynamic ic) ) Software Verif ific icatio ion Control-flow

1 Liveness by Example (cont) Control Flow Graphs (CFGs) Live range of a Definition a is live

Click to go to website: www.njctl.org Slide 2 / 156 Geometry GeoGeometr Geometry Logic and

Our history: 1956 Foundatjon of the Jandaia, with a small structure in the center of So

Programs analysis Programs analysis by SamCOTS by SamCOTS ( S tatic A nalyser of M alicious COTS

Program Slicing Gias Uddin Special Topic Lecture prepared from the Survey of Frank Tip on Program

Function Call Re-Vectorization Pupil: Rubens Emilio Alves Moreira Advisor: Fernando Magno Quinto

Big-Step Bounded Model Checking for So6ware Nishant Sinha ,

Sambuz

Useful Links

Newsletter

Mail Us