Decision Procedures for Algebraic Data Types with Abstractions - PowerPoint PPT Presentation

Decision Procedures for Algebraic Data Types with Abstractions Philippe Suter, Mirco Dotta and Viktor Kuncak

Verification of functional programs proof counterexample (input, trace)

sealed abstract class Tree case class Node(left: Tree, value: Int, right: Tree) extends Tree case class Leaf() extends Tree object BST { def add(tree: Tree, element: Int): Tree = tree match { case Leaf() ⇒ Node(Leaf(), element, Leaf()) case Node(l, v, r) if v > element ⇒ Node(add(l, element), v, r) case Node(l, v, r) if v < element ⇒ Node(l, v, add(r, element)) case Node(l, v, r) if v == element ⇒ tree } ensuring ( result ≠ Leaf()) } (tree = Node(l, v, r) ∧ v > element ∧ result ≠ Leaf()) ⇒ Node(result, v, r) ≠ Leaf() We know how to generate verification conditions for functional programs

Proving verification conditions (tree = Node(l, v, r) ∧ v > element ∧ result ≠ Leaf()) ⇒ Node(result, v, r) ≠ Leaf() D.C. Oppen, Reasoning about Recursively Defined Data Structures , POPL ’ 78 G. Nelson, D.C. Oppen, Simplification by Cooperating Decision Procedure , TOPLAS ’ 79 Previous work gives decision procedures that can handle certain verification conditions

sealed abstract class Tree case class Node(left: Tree, value: Int, right: Tree) extends Tree case class Leaf() extends Tree object BST { def add(tree: Tree, element: Int): Tree = tree match { case Leaf() ⇒ Node(Leaf(), element, Leaf()) case Node(l, v, r) if v > element ⇒ Node(add(l, element), v, r) case Node(l, v, r) if v < element ⇒ Node(l, v, add(r, element)) case Node(l, v, r) if v == element ⇒ tree } ensuring (content( result ) == content(tree) ∪ { element }) def content(tree: Tree) : Set[Int] = tree match { case Leaf() ⇒ ∅ case Node(l, v, r) ⇒ content(l) ∪ { v } ∪ content(r) } }

Complex verification condition Set Expressions t 1 = Node(t 2 , e 1 , t 3 ) ∧ content(t 4 ) = content(t 2 ) ∪ { e 2 } ∧ content(Node(t 4 , e 1 , t 3 )) ≠ content(t 1 ) ∪ { e 2 } where def content(tree: Tree) : Set[Int] = tree match { case Leaf() ⇒ ∅ case Node(l, v, r) ⇒ content(l) ∪ { v } ∪ content(r) } Recursive Function Algebraic Data Types

Our contribution Decision procedures for extensions of algebraic data types with certain recursive functions

Formulas we aim to prove Quantifier-free Formula t 1 = Node(t 2 , e 1 , t 3 ) ∧ content(t 4 ) = content(t 2 ) ∪ { e 2 } ∧ content(Node(t 4 , e 1 , t 3 )) ≠ content(t 1 ) ∪ { e 2 } where def content(tree: Tree) : Set[Int] = tree match { case Leaf() ⇒ ∅ case Node(l, v, r) ⇒ content(l) ∪ { v } ∪ content(r) } Generalized Fold Function Domain with a Decidable Theory

General form of our recursive functions empty : C combine : ( C , E , C) → C def content(tree: Tree) : Set[Int] = tree match { def α (tree: Tree) : C = tree match { case Leaf() ⇒ ∅ case Leaf() ⇒ empty case Node(l, v, r) ⇒ content(l) ∪ { v } ∪ content(r) case Node(l, v, r) ⇒ combine ( α (l), v, α (r)) } }

Scope of our result - Examples Tree content abstraction, as a: Set *Kuncak,Rinard’07+ Multiset *Piskac,Kuncak’08+ List *Plandowski’04+ Tree size, height, min *Papadimitriou’81+ Invariants (sortedness ,…) *Nelson,Oppen’79+

How do we prove such formulas? Quantifier-free Formula t 1 = Node(t 2 , e 1 , t 3 ) ∧ content(t 4 ) = content(t 2 ) ∪ { e 2 } ∧ content(Node(t 4 , e 1 , t 3 )) ≠ content(t 1 ) ∪ { e 2 } where def content(tree: Tree) : Set[Int] = tree match { case Leaf() ⇒ ∅ case Node(l, v, r) ⇒ content(l) ∪ { v } ∪ content(r) } Generalized Fold Function Domain with a Decidable Theory

Separate the Conjuncts t 1 = Node(t 2 , e 1 , t 3 ) ∧ content(t 4 ) = content(t 2 ) ∪ { e 2 } ∧ content(Node(t 4 , e 1 , t 3 )) ≠ content(t 1 ) ∪ { e 2 } t 1 = Node(t 2 , e 1 , t 3 ) ∧ t 5 = Node(t 4 , e 1 , t 3 ) ∧ c 4 = c 2 ∪ { e 2 } ∧ c 5 ≠ c 1 ∪ { e 2 } ∧ c 1 = content(t 1 ) ∧ … ∧ c 5 = content(t 5 )

1 1 t 1 t 1 t 4 t 4 7 7 4 4 = t 2 t 2 0 0 2 2 t 3 t 3 t 5 t 5 ∪ 4 content = c 4 = ∪ c 2 2 ∅ c 3 = c 4 = { 4 } ∪ { 2 } ∪ ∅ ∪ c 3 ∪ c 2

Overview of the decision procedure tree constraints from the input formula set constraints from the input formula c 4 = c 2 ∪ { e 2 } ∧ c 5 ≠ c 1 ∪ { e 2 } t 1 = Node(t 2 , e 1 , t 3 ) ∧ t 5 = Node(t 4 , e 1 , t 3 ) mappings from the input formula c i = content(t i ), i ∈ , 1, …, 5 - c 4 = c 2 ∪ { e 2 } c 5 ≠ c 1 ∪ { e 2 } ∧ c 1 = c 2 ∪ { e 1 } ∪ c 3 c 1 = c 2 ∪ { e 1 } ∪ c 3 ∧ ∧ c 5 = c 4 ∪ { e 1 } ∪ c 3 c 5 = c 4 ∪ { e 1 } ∪ c 3 ∧ resulting formula additional derived constraints The resulting formula is in the Decision decidable theory of sets Procedure for Sets

What we have seen is a simple correct algorithm But is it complete?

A verifier based on such procedure val c1 = content(t1) val c2 = content(t2) if (t1 ≠ t2) , if (c1 == ∅ ) { assert (c2 ≠ ∅ ) x = c2.chooseElement } } Warning : possible assertion violation c 1 = content(t 1 ) ∧ c 2 = content(t 2 ) ∧ t 1 ≠ t 2 ∧ c 1 = ∅ ∧ c 2 = ∅

Source of incompleteness c 1 = content(t 1 ) ∧ c 2 = content(t 2 ) ∧ t 1 ≠ t 2 ∧ c 1 = ∅ ∧ c 2 = ∅ t 1 ≠ t 2 … ∅ c 1 = ∅ ∧ c 2 = ∅ Models for the formula in the logic of sets must not contradict the disequalities over trees

How to make the algorithm complete • Case analysis for each tree variable: – is it Leaf ? – Is it not Leaf ? c 1 = content(t 1 ) ∧ c 2 = content(t 2 ) ∧ t 1 ≠ t 2 ∧ c 1 = ∅ ∧ c 2 = ∅ ∧ t 1 = Leaf ∧ t 2 = Node(t 3 , e, t 4 ) ∧ t 1 = Leaf ∧ t 2 = Leaf ∧ t 1 = Node(t 3 , e 1 , t 4 ) ∧ t 2 = Node(t 5 , e 2 , t 6 ) ∧ t 1 Node(t 3 , e, t 4 ) ∧ t 2 = Leaf This gives a complete decision procedure for the content function that maps to sets

What about other content functions? Tree content abstraction, as a: Set Multiset List Tree size, height, min Invariants (sortedness ,…)

Sufficient Surjectivity How and when we can have a complete algorithm

Choice of trees is constrained by sets tree constraints from the input formula set constraints from the input formula c 4 = c 2 ∪ { e 2 } ∧ c 5 ≠ c 1 ∪ { e 2 } t 1 = Node(t 2 , e 1 , t 3 ) ∧ t 5 = Node(t 4 , e 1 , t 3 ) mappings from the input formula c i = content(t i ), i ∈ , 1, …, 5 - c 4 = c 2 ∪ { e 2 } c 5 ≠ c 1 ∪ { e 2 } ∧ c 1 = c 2 ∪ { e 1 } ∪ c 3 c 1 = c 2 ∪ { e 1 } ∪ c 3 ∧ ∧ c 5 = c 4 ∪ { e 1 } ∪ c 3 c 5 = c 4 ∪ { e 1 } ∪ c 3 ∧ resulting formula additional derived constraints Decision Procedure for Sets

Inverse images • When we have a model for c 1 , c 2 , … how can we pick distinct values for t 1 , t 2 ,… ? t i ∈ content -1 (c i ) ⇔ c i = content(t i ) α α -1 The cardinality of α -1 (c i ) is what matters.

‘ Surjectivity ’ of set abstraction content -1 ∅ content -1 { 1, 5 } 5 5 5 1 … 1 1 1 5 |content -1 ( ∅ )| = 1 |content -1 (,1, 5-)| = ∞

In-order traversal 2 inorder - 1 [ 1, 2, 4, 7 ] 7 4

‘ Surjectivity ’ of in -order traversal inorder -1 [ ] 1 inorder -1 5 [ 1, 5 ] 5 1 |inorder -1 ( list )| = (number of trees of size n = length( list ))

More trees map to longer lists |inorder -1 ( list )| … … length( list )

An abstraction function α (e.g. content, inorder) is sufficiently surjective if and only if, for each number p > 0 , there exist, computable as a function of p : - a finite set of shapes S p - a closed formula M p in the collection theory such that M p (c) implies | α -1 (c)| > p such that, for every term t, M p ( α (t)) or š(t) in S p . Pick p sufficiently large. Guess which trees have a problematic shape. Guess their shape and their elements. By construction values for all other trees can be found.

Generalization of the Independence of Disequations Lemma For a conjunction of n disequalities over tree terms, if for each term we can pick a value from a set of trees of size at least n+1 , then we can pick values that satisfy all disequalities. We can make sure there will be sufficiently many trees to choose from.

Sufficiently surjectivity holds in practice Theorem: For every sufficiently surjective abstraction our procedure is complete. Theorem: The following abstractions are sufficiently surjective: set content, multiset content, list (any-order), tree height, tree size, minimum, sortedness A complete decision procedure for all these cases!

Related Work G. Nelson, D.C. Oppen, Simplification by Cooperating Decision Procedure , TOPLAS ’ 79 V. Sofronie-Stokkermans, Locality Results for Certain Extensions of Theories with Bridging Functions , CADE ’ 09 Some implemented systems: ACL2, Isabelle, Coq, Verifun, Liquid Types

Decision Procedures for Algebraic Data Types with Abstractions • Reasoning about functional programs reduces to proving formulas • Decision procedures always find a proof or a counterexample • Previous decision procedures handle recursion-free formulas • We introduced decision procedures for formulas with recursive fold functions

Thank you !

Extra Slides

Decision procedure for data structure hierarchy tree bag (multiset) mcontent setof msize set 7 ssize 3 Supports all natural operations on trees, multisets, sets, and homomorphisms between them

Decision Procedures for Algebraic Data Types with Abstractions - PowerPoint PPT Presentation

Decision Procedures for Algebraic Data Types with Abstractions Philippe Suter, Mirco Dotta and Viktor Kuncak Verification of functional programs proof counterexample (input, trace) sealed abstract class Tree case class Node(left: Tree, value:

Algebraic Data Types Mark Hibberd Mar 28, 2011 Mark Hibberd Algebraic Data Types Overview

An Implementation of Algebraic Data Types in Java using the Visitor Pattern Anton Setzer 1.

Algebraic Data Types Christine Rizkallah CSE, UNSW Term 3 2020 1 Composite Data Types as

The ABCs of ADTs Algebraic Data Types Justin Lubin January 18, 2018 Asynchronous Anonymous @

DataCamp Data Types for Data Science DataCamp Data Types for Data Science Data types Data type

Algebraic Data Types Christine Rizkallah CSE, UNSW (and data61) Term 3 2019 1 Composite Data

Algebraic Data Types Christine Rizkallah CSE, UNSW (and data61) Term 3 2019 1 Composite Data

Decision Procedures for Verification Combinations of Decision Procedures (3) 4.02.2019 Viorica

Decision Procedures for Verification Combinations of decision procedures (4) 10.02.2015 Viorica

Decision Procedures An Algorithmic Point of View Part I Decision Procedures for Propositional

Decision Procedures for Verification Decision Procedures (3) 12.01.2017 Viorica

Decision Procedures An Algorithmic Point of View Decision Procedures for Propositional Logic D.

The Calculus of Computation: Decision Procedures with 10. Combining Decision Procedures

Decision Procedures 1: Survey of decision procedures John Harrison Intel Corporation TYPES

Types Dynamic types Types are broken down into many categories Static types Duck typing

Rewrite-based decision procedures Maria Paola Bonacina Dipartimento di Informatica Universit` a

Raising to generic powers Jonathan Kirby University of Oxford Modnet conference, Barcelona, 2008

Complexity Optimal Decision Procedure for PDL with Parallel Composition Joseph Boudou IRIT,

Modular Type Checking With Decision Procedures Iavor S. Diatchki Galois Inc Iavor S. Diatchki

Separation logic and fragments: from expressive power to decision procedures St ephane Demri

Data structures with arithmetic constraints: non-disjoint combinations E. Nicolini, C.

Seminar Decision Procedures and Applications Instructor: Viorica Sofronie-Stokkermans Universit

Verified Decision Procedures for Equivalence of Regular Expressions Tobias Nipkow & Dmitriy

Software Verification : Introduction Ranjit Jhala, UC San Diego April 4, 2013 What is

Decision Procedures for Algebraic Data Types with Abstractions - PowerPoint PPT Presentation

Decision Procedures for Algebraic Data Types with Abstractions Philippe Suter, Mirco Dotta and Viktor Kuncak Verification of functional programs proof counterexample (input, trace) sealed abstract class Tree case class Node(left: Tree, value:

Algebraic Data Types Mark Hibberd Mar 28, 2011 Mark Hibberd Algebraic Data Types Overview

An Implementation of Algebraic Data Types in Java using the Visitor Pattern Anton Setzer 1.

Algebraic Data Types Christine Rizkallah CSE, UNSW Term 3 2020 1 Composite Data Types as

The ABCs of ADTs Algebraic Data Types Justin Lubin January 18, 2018 Asynchronous Anonymous @

DataCamp Data Types for Data Science DataCamp Data Types for Data Science Data types Data type

Algebraic Data Types Christine Rizkallah CSE, UNSW (and data61) Term 3 2019 1 Composite Data

Algebraic Data Types Christine Rizkallah CSE, UNSW (and data61) Term 3 2019 1 Composite Data

Decision Procedures for Verification Combinations of Decision Procedures (3) 4.02.2019 Viorica

Decision Procedures for Verification Combinations of decision procedures (4) 10.02.2015 Viorica

Decision Procedures An Algorithmic Point of View Part I Decision Procedures for Propositional

Decision Procedures for Verification Decision Procedures (3) 12.01.2017 Viorica

Decision Procedures An Algorithmic Point of View Decision Procedures for Propositional Logic D.

The Calculus of Computation: Decision Procedures with 10. Combining Decision Procedures

Decision Procedures 1: Survey of decision procedures John Harrison Intel Corporation TYPES

Types Dynamic types Types are broken down into many categories Static types Duck typing

Rewrite-based decision procedures Maria Paola Bonacina Dipartimento di Informatica Universit` a

Raising to generic powers Jonathan Kirby University of Oxford Modnet conference, Barcelona, 2008

Complexity Optimal Decision Procedure for PDL with Parallel Composition Joseph Boudou IRIT,

Modular Type Checking With Decision Procedures Iavor S. Diatchki Galois Inc Iavor S. Diatchki

Separation logic and fragments: from expressive power to decision procedures St ephane Demri

Data structures with arithmetic constraints: non-disjoint combinations E. Nicolini, C.

Seminar Decision Procedures and Applications Instructor: Viorica Sofronie-Stokkermans Universit

Verified Decision Procedures for Equivalence of Regular Expressions Tobias Nipkow &amp; Dmitriy

Software Verification : Introduction Ranjit Jhala, UC San Diego April 4, 2013 What is

Verified Decision Procedures for Equivalence of Regular Expressions Tobias Nipkow & Dmitriy