Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog - PowerPoint PPT Presentation

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog 1 , Marinela Parovic 1 , us Gallego Arias 2 , and Milos Gligoric 1 Emilio Jes´ ASE 2019 1 2 1 / 24

Program Verification Using Proof Assistants Verified software: encode program in formalism, write specifications for functions and prove them Proof assistants: prove specifications interactively Coq, HOL4, HOL Light, Isabelle/HOL, Lean, Nuprl, ... Verified executable systems built using proof assistants are reaching unprecedented scale CompCert (C compiler), 8 person years, 120k LOC seL4 (OS kernel), 25 person years, 200k LOC Verdi Raft (consensus protocol), 2 person years, 50k LOC These systems are being deployed CompCert – embedded systems seL4 – military autonomous vehicles 2 / 24

An Example of a Verified Function github.com/uwplse/StructTact Using Coq proof assistant Dependency of large verified systems, including Verdi Raft and Oeuf compiler From mathcomp Require Import all_ssreflect. Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v 3 / 24

An Example of a Verified Function github.com/uwplse/StructTact Using Coq proof assistant Dependency of large verified systems, including Verdi Raft and Oeuf compiler Import From mathcomp Require Import all_ssreflect. Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v 3 / 24

An Example of a Verified Function github.com/uwplse/StructTact Using Coq proof assistant Dependency of large verified systems, including Verdi Raft and Oeuf compiler From mathcomp Require Import all_ssreflect. Function Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v 3 / 24

An Example of a Verified Function github.com/uwplse/StructTact Using Coq proof assistant Dependency of large verified systems, including Verdi Raft and Oeuf compiler Specification From mathcomp Require Import all_ssreflect. Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v 3 / 24

An Example of a Verified Function github.com/uwplse/StructTact Using Coq proof assistant Dependency of large verified systems, including Verdi Raft and Oeuf compiler From mathcomp Require Import all_ssreflect. Proof Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v 3 / 24

Problem: Incomplete and Missing Specifications StructTact Verdi Raft incomplete spec Specifications might not cover the core parts of the function Some functions might have no specification at all Usage of such functions could lead to surprises and even bugs How do we detect incomplete and missing specifications? 4 / 24

Our Contributions 1 Introduce mutation proving for proof assistants libraries Mutate functions and check if any lemma fails No lemma fails (live mutant) may indicate incomplete spec Analogous to mutation testing 2 Implement mutation proving for Coq libraries, mCoq 3 Optimize mCoq with selective and parallel proof checking 4 Quantitatively evaluate mCoq on 12 popular Coq libraries 5 Qualitatively evaluate dozens of live mutants and report incomplete specifications 5 / 24

Mutation Proving Example, Original Code From mathcomp Require Import all_ssreflect. Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v Proof passes 6 / 24

Mutation Proving Example, Mutated Code From mathcomp Require Import all_ssreflect. Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false - | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) + | a :: l’ ⇒ (f a == true) || (g a == true && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v 7 / 24

Mutation Proving Example, Mutated Code From mathcomp Require Import all_ssreflect. Fixpoint before_func A (f g : A → bool) (l : list A) : bool := match l with | [::] ⇒ false - | a :: l’ ⇒ (f a == true) || (g a == false && before_func A f g l’) + | a :: l’ ⇒ (f a == true) || (g a == true && before_func A f g l’) end. Lemma before_func_app : ∀ A (f g : A → bool) (l l’ : list A), before_func A f g l → before_func A f g (l + + l’). Proof. intros;induction l ⇒ /=; intuition; move/orP: H; case; [by move/eqP → |]. by move/andP ⇒ [H1 H2]; rewrite H1 /=; apply/orP; right; apply IHl. Qed. Before.v Proof still passes → Live mutant → Incomplete specification 7 / 24

mCoq Architecture and Workflow sexp parser .v file 1 transformer sercomp 2 Coq QMutator SerAPI compser 5 3 4 sexp file 8 / 24

mCoq Architecture and Workflow sexp parser .v file 1 transformer sercomp 2 Coq QMutator SerAPI compser 5 3 4 sexp file Fixpoint.. Lemma.. Proof.. Qed. 8 / 24

mCoq Architecture and Workflow sexp parser .v file 1 transformer sercomp 2 Coq QMutator SerAPI compser 5 3 4 sexp file sexp 1 Fixpoint.. 2 Lemma.. Proof.. Qed. 8 / 24

mCoq Architecture and Workflow sexp parser .v file 1 transformer sercomp 2 Coq QMutator SerAPI compser 5 3 4 sexp file mutants sexp 1 3 Fixpoint.. 2 4 Lemma.. Proof.. Qed. 8 / 24

mCoq Architecture and Workflow sexp parser .v file 1 transformer sercomp 2 Coq QMutator SerAPI compser 5 3 4 sexp file mutants 5 live sexp 1 3 Fixpoint.. killed 2 4 live Lemma.. killed Proof.. live Qed. killed 8 / 24

mCoq Components (1): Serialization and Deserialization SerAPI extended OCaml library supporting full (de)serialization of Coq code sercomp command-line SerAPI -based OCaml program which takes Coq .v file and outputs lists of sexps compser command-line SerAPI -based program which takes lists of sexps and performs proof checking with Coq 9 / 24

mCoq Components (2): QMutator and Runner QMutator sexp transformation library in Java that performs mutations given mutation operator and location Runner driver program in Java and bash to orchestrate components and compute mutation scores 10 / 24

Mutation Operators Inspired by our experience (17 years cumulative) and mutation operators for functional languages Category Name Description Reorder branches in if-else expression GIB General GIC Reverse constructor order in inductive type GME Replace exp in the 2nd match case with 1st case exp LRH Replace list with head singleton list LRT Replace list with its tail LRE Replace list with empty list Lists Reorder arguments to the list append operator LAR LAF Replace list append expression with first argument Replace list append expression with second argument LAS Replace plus with minus NPM NZO Replace zero with one Numbers NSZ Replace successor constructor with zero NSA Replace successor constructor with its argument BFT Replace false with true Booleans BTF Replace true with false 11 / 24

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog - PowerPoint PPT Presentation

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog 1 , Marinela Parovic 1 , us Gallego Arias 2 , and Milos Gligoric 1 Emilio Jes ASE 2019 1 2 1 / 24 Program Verification Using Proof Assistants Verified software: encode

COQ DEVELOPMENT TEAM SESSION Coq Development Team Coq Workshop 2019 Portland Sep 8th, 2019

Coq Coq Codet! Towards a Verified Toolchain for Coq in MetaCoq Matthieu Sozeau . r 2 , Inria

The Coq Proof Script Visualiser (coq-psv) Coq Workshop 2020, Virtual Mario Frank

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

The Coq proof assistant : From graphical presentation to principles and practice Coq syntax

Experience Report: Smuggling a Little Bit of Coq Inside a CAD Development Context Dimitur Krustev

a Coq retrospective at the heart of Coq architecture the genesis of version 7.0

The Coq proof assistant : principles and practice J.-F. Monin Universit Grenoble Alpes 2016

Drift, mutation, selection, and the evolutionary dispersion of mean phenotypes recombination

Equivalence Modulo States Bo Wang, Yingfei Xiong , Yangqingwei Shi, Lu Zhang, Dan Hao Peking

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 & Gregory

Impredicativity in Coq Yotam Dvir Tel-Aviv University 2019-11-20 Today 1. What is

Designing a state transaction machine for Coq Bruno Barras & Enrico Tassi 12 Aug 2012

Learning to Format Coq Code Using Language Models Pengyu Nie 1 , Karl Palmskog 2 , Junyi Jessy Li

Simultaneous Identification of Multiple Driver Pathways in Cancer Mark D. M. Leiserson, et.al

BRCA Founder Mutations Ashkenazi Jewish (Hungarian and Russian): BRCA1 - 185delAG and

Mutation testing at SAAB Niklas Pettersson niklas.pettersson2@saabgroup.com Joakim Brnnstrm

Mutation Testing Software Engineering Gordon Fraser Saarland University 1 The most common

Maximal green sequences of minimal mutation-infinite quivers John Lawson joint with Matthew

Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a protein is a replacement of

Mutation Testing How good your tests are 2017 whoami iOS Developer by day compiler

Semantic Mutation Testing John A. Clark, Haitao Dan, Robert M Hierons The 8th CREST Open

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog - PowerPoint PPT Presentation

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog 1 , Marinela Parovic 1 , us Gallego Arias 2 , and Milos Gligoric 1 Emilio Jes ASE 2019 1 2 1 / 24 Program Verification Using Proof Assistants Verified software: encode

COQ DEVELOPMENT TEAM SESSION Coq Development Team Coq Workshop 2019 Portland Sep 8th, 2019

Coq Coq Codet! Towards a Verified Toolchain for Coq in MetaCoq Matthieu Sozeau . r 2 , Inria

The Coq Proof Script Visualiser (coq-psv) Coq Workshop 2020, Virtual Mario Frank

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

The Coq proof assistant : From graphical presentation to principles and practice Coq syntax

Experience Report: Smuggling a Little Bit of Coq Inside a CAD Development Context Dimitur Krustev

a Coq retrospective at the heart of Coq architecture the genesis of version 7.0

The Coq proof assistant : principles and practice J.-F. Monin Universit Grenoble Alpes 2016

Drift, mutation, selection, and the evolutionary dispersion of mean phenotypes recombination

Equivalence Modulo States Bo Wang, Yingfei Xiong , Yangqingwei Shi, Lu Zhang, Dan Hao Peking

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 &amp; Gregory

Impredicativity in Coq Yotam Dvir Tel-Aviv University 2019-11-20 Today 1. What is

Designing a state transaction machine for Coq Bruno Barras &amp; Enrico Tassi 12 Aug 2012

Learning to Format Coq Code Using Language Models Pengyu Nie 1 , Karl Palmskog 2 , Junyi Jessy Li

Simultaneous Identification of Multiple Driver Pathways in Cancer Mark D. M. Leiserson, et.al

BRCA Founder Mutations Ashkenazi Jewish (Hungarian and Russian): BRCA1 - 185delAG and

Mutation testing at SAAB Niklas Pettersson niklas.pettersson2@saabgroup.com Joakim Brnnstrm

Mutation Testing Software Engineering Gordon Fraser Saarland University 1 The most common

Maximal green sequences of minimal mutation-infinite quivers John Lawson joint with Matthew

Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a protein is a replacement of

Mutation Testing How good your tests are 2017 whoami iOS Developer by day compiler

Semantic Mutation Testing John A. Clark, Haitao Dan, Robert M Hierons The 8th CREST Open

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 & Gregory

Designing a state transaction machine for Coq Bruno Barras & Enrico Tassi 12 Aug 2012