Decision Tree Based Learning of Program Invariants Deepak DSouza - PowerPoint PPT Presentation

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree Based Learning of Program Invariants Deepak D’Souza Department of Computer Science and Automation Indian Institute of Science, Bangalore. FM Update Meeting IIT Mandi 17 July 2017

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants What this talk is about Paper titled Learning invariants using decision trees and implication counterexamples, by Garg, Neider, Madhusudan, and Roth, in POPL 2016. A way to automate deductive-style program verification. Extends the Decision Tree classification technique in Machine Learning, to handle implication samples, with applications to finding proofs of programs. Also talk about some directions to extend this work.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Outline of this talk Floyd-Hoare Style Verification 1 Decision Tree Learning 2 ICE Learning 3 Proofs with Multiple Invariants 4

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Proving assertions in programs // Pre: 10 <= y // Pre: true // Pre: 0 <= n y := y + 1; if (a <= b) int a = m; z := x + y; min = a; int x = 0; else while (x < n) { // Post: x <= z min = b; a = a + 1; x = x + 1; // Post: min <= a && min <= b } // Post: a = m + n

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Proving assertions in programs // Pre: 10 <= y // Pre: true // Pre: 0 <= n y := y + 1; if (a <= b) int a = m; z := x + y; min = a; int x = 0; else while (x < n) { // Post: x <= z min = b; a = a + 1; x = x + 1; // Post: min <= a && min <= b } // Post: a = m + n Model-checking vs Deductive Reasoning.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Floyd-Hoare Style of Program Verification Robert W. Floyd: “Assigning meanings to programs” Proceedings of the American Mathematical Society Symposia on Applied Mathematics (1967) C A R Hoare: “An axiomatic basis for computer programming”, Communications of the ACM (1969).

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Example proof y > 10 y ≥ 0 y := y + 1 y ≥ 1 z := x + y y ≥ 1 ∧ z = x + y z > x

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Example proof of add program n ≥ 0 n ≥ 0 a := m; n ≥ 0 ∧ a = m x := 0 a = m + x ∧ x ≤ n while (x < n) { a := a + 1 x := x + 1 a = m + n

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Problems with automating such proofs To check: { y > 10 } y := y + 1; z := x + y; { x < z } Use the weakest precondition rules to generate the verification condition: ( y > 10) = ⇒ ( y > − 1) . Check the verification condition by asking a theorem prover / SMT solver if the formula ( y > 10) ∧ ¬ ( y > − 1) . is satisfiable.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants What about Programs with loops? Find an adequate and assume Pre inductive invariant Inv : S 1 Pre = ⇒ WP ( S 1 , Inv ) 1 (“inductive invariant”) invariant Inv while (b) { ( Inv ∧ b ) = ⇒ 2 WP ( S 2 , Inv ) (“inductive S 2 invariant”) } Inv ∧ ¬ b = ⇒ 3 WP ( S 3 , Post ) S 3 (“adequate”). assert Post

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Adequate loop invariant n ≥ 0 n ≥ 0 An adequate loop invariant needs to satisfy: a := m; { n ≥ 0 } a := m; x := 0 { a = m + x ∧ x ≤ n } . n ≥ 0 ∧ a = m { a = m + x ∧ x ≤ n ∧ x < n } a := a+1; x := x+1 { a = m + x ∧ x ≤ n } . x := 0 { a = m + x ∧ x ≤ n ∧ x ≥ n } skip a = m + x ∧ x ≤ n { a = m + n } . while (x < n) { Verification conditions are generated accordingly. a := a + 1 Note that a = m + x is not an adequate loop invariant. x := x + 1 a = m + n

� fi fi fi fi fi fi fi fi fi fi ffi fi fi Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Learning loop invariants Main hurdle in automating program verification is coming up with adequate loop invariants. Several white-box approaches have been used (CEGAR, Lazy Annotation, using interpolation, and tools like Slam/Blast, Synergy). Instead explore a black-box approach, based on a Teacher-Learner model. ---- --- + + + Learner Teacher Dynamic + + Program + + engine Constraint Solver H

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Black-box Learning for add program ( m �→ 2 , n �→ 3 , a �→ − , x �→ − ) ( m �→ 1 , n �→ 1 , − , − ) a := m; (2 , 3 , 2 , − ) x := 0 + (2 , 3 , 2 , 0) (1 , 1 , 1 , 0) (2 , 3 , 3 , 1) (1 , 1 , 2 , 1) (2 , 3 , 4 , 2) (2 , 3 , 5 , 3) − while (x < n) { (1 , 1 , 3 , 2) (2 , 3 , 2 , 0) a := a + 1 (2 , 3 , 3 , 0) x := x + 1 } (2 , 3 , 5 , 3) (1 , 1 , 2 , 1)

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree Based Learning Given a set of positive samples S + and negative samples S − , learn a predicate H from a given concept class. Example concept class: Boolean combinations of atomic predicates of the form x ≤ c , where x is a prog variable and c ≤ 10. Or octagonal constraints + x + y ≤ c ... A brute-force search is always possible, but we would like to be more efficient in practice.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree learning algorithm Maintain a tree whose nodes correspond to subsets of the sample points Root node contains all given samples Choose a non-finished node n , and an attribute a to split on. Create two children n a and n ¬ a of n with corresponding subset of samples. If a node is “homogeneous”, mark it pos/neg and finished. Recurse till all nodes are finished. Output predicate corresponding to disjunction of all positive nodes.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree learning by example 5 _ + _ _ _ + + + + 5 5 y ≤ 1 _ + 5 5 _ + _ _ _ _ + + + _ _ + + 5 5 x ≤ 3 + + 5 5 + _ + _ _ _ + 5 5 5 Predicate learnt: y ≤ 1 ∨ ( y > 1 ∧ x > 3).

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Choosing attribute based on entropy If n has P positive and N negative samples: Entropy ( n ) = P + N · log P + N P + N · log P + N P N − − N P Entropy measures reduction in uncertainty in number of bits. Gives us a measure of the “impurity” of a node. Choose attribute a which maximizes Entropy ( n ) − ( Entropy ( n a ) + Entropy ( n ¬ a )).

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree: Example where entropy does not do well 5 _ + _ + + _ + _ + 5 Best attribute would be y ≤ 1 followed by x ≤ 1, but entropy would choose x ≤ 3 as first split.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants ICE: The need for implication counterexamples Introduced by Garg, L¨ oding, Madhusudan, and Neider, in a assume Pre paper in CAV 2014. S 1 Just Examples (positive) and + invariant Inv Counterexamples (negative) are not _ while (b) { ? enough: the Teacher needs to give S 2 Implication samples as well. } This way the Teacher is honest, not precluding some candidate S 3 invariant by an arbitrary answer. assert Post Leads to a robust learning framework.

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants ICE learning by example Learner conjectures H : 1 m ≤ n ∧ x ≤ a ( m �→ 2 , n �→ 3 , a �→ − , x �→ − ) ( m �→ 1 , n �→ 1 , − , − ) a := m; (2 , 3 , 2 , − ) x := 0 + (2 , 2 , 4 , 1) (2 , 3 , 2 , 0) (1 , 1 , 1 , 0) (2 , 3 , 3 , 1) (1 , 1 , 2 , 1) (2 , 3 , 4 , 2) (2 , 2 , 5 , 2) (2 , 3 , 5 , 3) while (x < n) { − (1 , 1 , 3 , 2) (2 , 3 , 2 , 0) a := a + 1 (2 , 3 , 3 , 0) x := x + 1 } (2 , 3 , 5 , 3) (1 , 1 , 2 , 1)

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants ICE learning by example Learner conjectures H : 1 m ≤ n ∧ x ≤ a ( m �→ 2 , n �→ 3 , a �→ − , x �→ − ) 2 Teacher replies with Example: ( m �→ 1 , n �→ 1 , − , − ) (2 , 1 , 2 , 0). a := m; (2 , 3 , 2 , − ) x := 0 + (2 , 2 , 4 , 1) (2 , 3 , 2 , 0) (1 , 1 , 1 , 0) (2 , 3 , 3 , 1) (1 , 1 , 2 , 1) (2 , 3 , 4 , 2) (2 , 2 , 5 , 2) (2 , 3 , 5 , 3) while (x < n) { − (1 , 1 , 3 , 2) (2 , 3 , 2 , 0) a := a + 1 (2 , 3 , 3 , 0) x := x + 1 } (2 , 3 , 5 , 3) (1 , 1 , 2 , 1)

Decision Tree Based Learning of Program Invariants Deepak DSouza - PowerPoint PPT Presentation

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree Based Learning of Program Invariants Deepak DSouza Department of Computer Science and Automation Indian Institute of Science,

Decision Tree Decision Trees A decision tree is a decision support tool that uses a tree-like

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

Decision tree learning Aim: find a small tree consistent with the training examples Idea:

Tree-sitter @maxbrunsfeld What is Tree-sitter? Why I wrote Tree-sitter What were

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

Decision Tree and Automata Learning Stefan Edelkamp 1 Overview - Decision tree representation

Searching for Program Invariants using Genetic Programming and Mutation Testing Sam Ratcliff,

A Brief History of Decision Tree Implementation MAX AUSTIN Overview Famous Decision Tree

Final Examples Announcements Trees Tree-Structured Data def tree(label, branches=[]): A tree

Session 12 Tree-based models: tree and rpart Two libraries The tree library is like the

K-theoretic Gromov-Witten invariants and derived algebraic geometry Marco Robalo (IMJ-PRG, UPMC)

Integer Invariants of Abelian Cayley Graphs Deelan Jalil James Madison University July 26, 2013

Characterizing Algebraic Invariants by Differential Radical Invariants Khalil Ghorbal Carnegie

PROGRAM OVERVIEW 03/26/2019 Page 1 of 16 FINAL PRESENTATION Prioritization Methodology

Deep Learning: multi-layer neural networks Recurrent Neural Networks: sequence data Long

By Herb Blank Over the past six months, I have led the team that developed the Thomson Reuters

The (Random) Forest for the (Decision) Trees William Warfel Office of Institutional Research

Using Openstreetmap crowdsourced data and La Landsat im imagery for la land cover mapping in

Evaluation of Park Harrison Brown for R244 Park: An Open Platform for Learning- Augmented

Colorado Assessment Tool Project Purposes and Operationalization of the Intake & Eligibility

Agenda Background and History What is the new proposal? Why this route? Why for us?

Decision Tree Based Learning of Program Invariants Deepak DSouza - PowerPoint PPT Presentation

Floyd-Hoare Style Verification Decision Tree Learning ICE Learning Proofs with Multiple Invariants Decision Tree Based Learning of Program Invariants Deepak DSouza Department of Computer Science and Automation Indian Institute of Science,

Decision Tree Decision Trees A decision tree is a decision support tool that uses a tree-like

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

Decision tree learning Aim: find a small tree consistent with the training examples Idea:

Tree-sitter @maxbrunsfeld What is Tree-sitter? Why I wrote Tree-sitter What were

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

Decision Tree and Automata Learning Stefan Edelkamp 1 Overview - Decision tree representation

Searching for Program Invariants using Genetic Programming and Mutation Testing Sam Ratcliff,

A Brief History of Decision Tree Implementation MAX AUSTIN Overview Famous Decision Tree

Final Examples Announcements Trees Tree-Structured Data def tree(label, branches=[]): A tree

Session 12 Tree-based models: tree and rpart Two libraries The tree library is like the

K-theoretic Gromov-Witten invariants and derived algebraic geometry Marco Robalo (IMJ-PRG, UPMC)

Integer Invariants of Abelian Cayley Graphs Deelan Jalil James Madison University July 26, 2013

Characterizing Algebraic Invariants by Differential Radical Invariants Khalil Ghorbal Carnegie

PROGRAM OVERVIEW 03/26/2019 Page 1 of 16 FINAL PRESENTATION Prioritization Methodology

Deep Learning: multi-layer neural networks Recurrent Neural Networks: sequence data Long

By Herb Blank Over the past six months, I have led the team that developed the Thomson Reuters

The (Random) Forest for the (Decision) Trees William Warfel Office of Institutional Research

Using Openstreetmap crowdsourced data and La Landsat im imagery for la land cover mapping in

Evaluation of Park Harrison Brown for R244 Park: An Open Platform for Learning- Augmented

Colorado Assessment Tool Project Purposes and Operationalization of the Intake &amp; Eligibility

Agenda Background and History What is the new proposal? Why this route? Why for us?

Colorado Assessment Tool Project Purposes and Operationalization of the Intake & Eligibility