The C standard formalized in Coq, whats next? Robbert Krebbers - PowerPoint PPT Presentation

The C standard formalized in Coq, what’s next? Robbert Krebbers Aarhus University, Denmark May 13, 2016 @ Cambridge Computer Laboratory, UK 1

What is this program supposed to do? The C quiz, question 1 int main() { int x; int y = (x = 3) + (x = 4); printf("x=%d,y=%d\n", x, y); } 2

What is this program supposed to do? The C quiz, question 1 int main() { int x; int y = (x = 3) + (x = 4); printf("x=%d,y=%d\n", x, y); } Let us try some compilers ◮ Clang prints x=4,y=7 , seems just left-right 2

What is this program supposed to do? The C quiz, question 1 int main() { int x; int y = (x = 3) + (x = 4); printf("x=%d,y=%d\n", x, y); } Let us try some compilers ◮ Clang prints x=4,y=7 , seems just left-right ◮ GCC prints x=4,y=8 , does not correspond to any order 2

What is this program supposed to do? The C quiz, question 1 int main() { int x; int y = (x = 3) + (x = 4); printf("x=%d,y=%d\n", x, y); } Let us try some compilers ◮ Clang prints x=4,y=7 , seems just left-right ◮ GCC prints x=4,y=8 , does not correspond to any order This program violates the sequence point restriction ◮ due to two unsequenced writes to x ◮ resulting in undefined behavior ◮ thus both compilers are right 2

Underspecification in C11 ◮ Unspecified behavior : two or more behaviors are allowed For example: order of evaluation in expressions (+57 more) ◮ Implementation defined behavior : like unspecified behavior, but the compiler has to document its choice For example: size and endianness of integers (+118 more) ◮ Undefined behavior: the standard imposes no requirements at all, the program is even allowed to crash For example: dereferencing a NULL or dangling pointer, signed integer overflow, . . . (+201 more) 3

Underspecification in C11 ◮ Unspecified behavior : two or more behaviors are allowed For example: order of evaluation in expressions (+57 more) Non-determinism ◮ Implementation defined behavior : like unspecified behavior, but the compiler has to document its choice For example: size and endianness of integers (+118 more) Parametrization ◮ Undefined behavior: the standard imposes no requirements at all, the program is even allowed to crash For example: dereferencing a NULL or dangling pointer, signed integer overflow, . . . (+201 more) No semantics/crash state 3

Why does C use underspecification that heavily? Pros for optimizing compilers: ◮ More optimizations are possible ◮ High run-time efficiency ◮ Easy to support multiple architectures 4

Why does C use underspecification that heavily? Pros for optimizing compilers: ◮ More optimizations are possible ◮ High run-time efficiency ◮ Easy to support multiple architectures Cons for programmers/formal methods people: ◮ Portability and maintenance problems ◮ Hard to capture precisely in a semantics ◮ Hard to formally reason about 4

Approaches to underspecification CompCert (Leroy et al. ) / VST (Appel et al. ) ◮ Main goal: verification of/w.r.t. CompCert compiler in Coq ◮ Semantics only needs to be correct for CompCert compiler For example: integer overflow and aliasing violations not UB KCC (Ellison & Rosu, Hathhorn et al. ) ◮ Main goal: compiler independent C11 semantics in K ◮ Describes most unspecified and undefined behavior ◮ No proof assistant support CH 2 O (Krebbers & Wiedijk) ◮ Main goal: compiler independent C11 semantics in Coq ◮ Describes all unspecified and undefined behavior ◮ Describes some implementation-defined behavior For example: no legacy architectures with 1s’ complement Cerberus (Sewell et al. ) ◮ Main goal: ‘ defacto ’ C11 semantics in LEM ◮ Improve standard to match the way C is used in practice 5

The CH 2 O project OCaml part Coq part C sources CH 2 O CH 2 O core C abstract C 6

The CH 2 O project OCaml part Coq part C sources Operational CH 2 O semantics CH 2 O core C abstract C Γ , δ ⊢ S 1 � S 2 6

The CH 2 O project OCaml part Coq part Typing judgment Γ ⊢ S : f main Type preservation C sources Type soundness & progress Operational CH 2 O semantics CH 2 O core C abstract C Γ , δ ⊢ S 1 � S 2 6

The CH 2 O project OCaml part Coq part Typing judgment Γ ⊢ S : f main Type preservation C sources Type soundness & progress Operational CH 2 O semantics CH 2 O core C abstract C Γ , δ ⊢ S 1 � S 2 Soundness & Completeness Executable semantics S 2 ∈ exec Γ ,δ S 1 6

The CH 2 O project OCaml part Coq part Typing judgment Γ ⊢ S : f main Type preservation C sources Type soundness & progress Operational CH 2 O semantics CH 2 O core C abstract C Γ , δ ⊢ S 1 � S 2 Soundness & Soundness & Completeness Completeness Pure expression Executable evaluation semantics [ [ e ] ] Γ ,ρ, m = ν S 2 ∈ exec Γ ,δ S 1 6

The CH 2 O project OCaml part Coq part Typing judgment Γ ⊢ S : f main Type preservation C sources Type soundness & progress Axiomatic Operational semantics Soundness CH 2 O semantics CH 2 O core C abstract C R , J , T ⊢ Γ ,δ Γ , δ ⊢ S 1 � S 2 { P } s { Q } Soundness & Soundness & Completeness Completeness Pure expression Executable evaluation semantics [ [ e ] ] Γ ,ρ, m = ν S 2 ∈ exec Γ ,δ S 1 6

The CH 2 O project OCaml part Coq part Refinement Typing judgment judgment S 1 ⊑ f Γ ⊢ S : f main Γ S 2 : f main Type preservation C sources Invariance Type soundness & progress Axiomatic Operational semantics Soundness CH 2 O semantics CH 2 O core C abstract C R , J , T ⊢ Γ ,δ Γ , δ ⊢ S 1 � S 2 { P } s { Q } Soundness & Soundness & Completeness Completeness Pure expression Executable evaluation semantics [ [ e ] ] Γ ,ρ, m = ν S 2 ∈ exec Γ ,δ S 1 6

Non-local control flow and block scope variables The C quiz, question 2 int *p = NULL; l: if (p) { return (*p); } else { int j = 17; p = &j; goto l; } 7

Non-local control flow and block scope variables The C quiz, question 2 int *p = NULL; memory: l: if (p) { p return (*p); } else { NULL int j = 17; p = &j; goto l; } 7

Non-local control flow and block scope variables The C quiz, question 2 int *p = NULL; memory: l: if (p) { p j return (*p); } else { NULL 17 int j = 17; p = &j; goto l; } 7

Non-local control flow and block scope variables The C quiz, question 2 int *p = NULL; memory: l: if (p) { p j return (*p); } else { • 17 int j = 17; p = &j; goto l; } 7

Non-local control flow and block scope variables The C quiz, question 2 int *p = NULL; memory: l: if (p) { p return (*p); } else { • int j = 17; p = &j; goto l; } 7

Non-local control flow and block scope variables The C quiz, question 2 int *p = NULL; memory: l: if (p) { p return (*p); } else { • int j = 17; p = &j; goto l; } C11, 6.2.4p2: the value of a pointer becomes indeterminate when the object it points to (or just past) reaches the end of its lifetime. = ⇒ Undefined behavior 7

Non-local control flow and block scope variables Goto considered harmful? http://xkcd.com/292/ 8

Non-local control flow and block scope variables Goto considered harmful? http://xkcd.com/292/ Not necessarily: ⊢ { P } . . . goto main_sub3; . . . { Q } 8

Non-local control flow and block scope variables Separation logic for non-local control Statement judgment: R , J , T ⊢ { P } s { Q } 9

Non-local control flow and block scope variables Separation logic for non-local control Statement judgment: R , J , T ⊢ { P } s { Q } where: ◮ { P } s { Q } is a Hoare triple, as usual 9

Non-local control flow and block scope variables Separation logic for non-local control Statement judgment: R , J , T ⊢ { P } s { Q } where: ◮ { P } s { Q } is a Hoare triple, as usual ◮ R has to hold to execute a return 9

Non-local control flow and block scope variables Separation logic for non-local control Statement judgment: R , J , T ⊢ { P } s { Q } where: ◮ { P } s { Q } is a Hoare triple, as usual ◮ R has to hold to execute a return ◮ J maps labels to their jumping condition When executing a goto l , the assertion J l has to hold 9

Non-local control flow and block scope variables Separation logic for non-local control Statement judgment: R , J , T ⊢ { P } s { Q } where: ◮ { P } s { Q } is a Hoare triple, as usual ◮ R has to hold to execute a return ◮ J maps labels to their jumping condition When executing a goto l , the assertion J l has to hold ◮ T maps break s/ continue s to their jumping condition 9

The C standard formalized in Coq, whats next? Robbert Krebbers - PowerPoint PPT Presentation

The C standard formalized in Coq, whats next? Robbert Krebbers Aarhus University, Denmark May 13, 2016 @ Cambridge Computer Laboratory, UK 1 What is this program supposed to do? The C quiz, question 1 int main() { int x; int y = (x = 3)

COQ DEVELOPMENT TEAM SESSION Coq Development Team Coq Workshop 2019 Portland Sep 8th, 2019

Coq Coq Codet! Towards a Verified Toolchain for Coq in MetaCoq Matthieu Sozeau . r 2 , Inria

The Coq Proof Script Visualiser (coq-psv) Coq Workshop 2020, Virtual Mario Frank

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

Formalized Search for FC-Families c, Miodrag Filip Mari c, Bojan Vu ckovi Zivkovi

Experience Report: Smuggling a Little Bit of Coq Inside a CAD Development Context Dimitur Krustev

The Coq proof assistant : From graphical presentation to principles and practice Coq syntax

a Coq retrospective at the heart of Coq architecture the genesis of version 7.0

The Coq proof assistant : principles and practice J.-F. Monin Universit Grenoble Alpes 2016

A Separation Logic for Non-determinism and Sequence Points in C Formalized in Coq Robbert

Impredicativity in Coq Yotam Dvir Tel-Aviv University 2019-11-20 Today 1. What is

Designing a state transaction machine for Coq Bruno Barras & Enrico Tassi 12 Aug 2012

Learning to Format Coq Code Using Language Models Pengyu Nie 1 , Karl Palmskog 2 , Junyi Jessy Li

Learning to Format Coq Code Using Language Models Pengyu Nie 1 , Karl Palmskog 2 , Junyi Jessy Li

The Coq proof assistant : More on Prop and principles and practice Set J.-F. Monin Universit

The Coq proof assistant : inductive predicate principles and practice Well-founded induction

The Dual Simplex Method Combinatorial Problem Solving (CPS) Javier Larrosa Albert Oliveras

Outline Introduction to Parsing Regular languages revisited Ambiguity and Syntax Errors

Ubiquitous Computing Spring 2010 - Making Sense of Sensing

Arithmetic and Inference in a Large Theory Adam Pease, Infosys, Foothill Research Center

Taaltheorie en Taalverwerking BSc Artificial Intelligence Raquel Fernndez Institute for Logic,

Automatically Annotating Text with Linked Open Data Delia Rusu , Bla Fortuna, Dunja Mladeni

EQUAL Encyclopaedic QA for Lists Iustin Dornescu Research Group in Computational Linguistics,

Action recognition Cordelia Schmid INRIA Grenoble Action recognition examples Short

The C standard formalized in Coq, whats next? Robbert Krebbers - PowerPoint PPT Presentation

The C standard formalized in Coq, whats next? Robbert Krebbers Aarhus University, Denmark May 13, 2016 @ Cambridge Computer Laboratory, UK 1 What is this program supposed to do? The C quiz, question 1 int main() { int x; int y = (x = 3)

COQ DEVELOPMENT TEAM SESSION Coq Development Team Coq Workshop 2019 Portland Sep 8th, 2019

Coq Coq Codet! Towards a Verified Toolchain for Coq in MetaCoq Matthieu Sozeau . r 2 , Inria

The Coq Proof Script Visualiser (coq-psv) Coq Workshop 2020, Virtual Mario Frank

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

Formalized Search for FC-Families c, Miodrag Filip Mari c, Bojan Vu ckovi Zivkovi

Experience Report: Smuggling a Little Bit of Coq Inside a CAD Development Context Dimitur Krustev

The Coq proof assistant : From graphical presentation to principles and practice Coq syntax

a Coq retrospective at the heart of Coq architecture the genesis of version 7.0

The Coq proof assistant : principles and practice J.-F. Monin Universit Grenoble Alpes 2016

A Separation Logic for Non-determinism and Sequence Points in C Formalized in Coq Robbert

Impredicativity in Coq Yotam Dvir Tel-Aviv University 2019-11-20 Today 1. What is

Designing a state transaction machine for Coq Bruno Barras &amp; Enrico Tassi 12 Aug 2012

Learning to Format Coq Code Using Language Models Pengyu Nie 1 , Karl Palmskog 2 , Junyi Jessy Li

Learning to Format Coq Code Using Language Models Pengyu Nie 1 , Karl Palmskog 2 , Junyi Jessy Li

The Coq proof assistant : More on Prop and principles and practice Set J.-F. Monin Universit

The Coq proof assistant : inductive predicate principles and practice Well-founded induction

The Dual Simplex Method Combinatorial Problem Solving (CPS) Javier Larrosa Albert Oliveras

Outline Introduction to Parsing Regular languages revisited Ambiguity and Syntax Errors

Ubiquitous Computing Spring 2010 - Making Sense of Sensing

Arithmetic and Inference in a Large Theory Adam Pease, Infosys, Foothill Research Center

Taaltheorie en Taalverwerking BSc Artificial Intelligence Raquel Fernndez Institute for Logic,

Automatically Annotating Text with Linked Open Data Delia Rusu , Bla Fortuna, Dunja Mladeni

EQUAL Encyclopaedic QA for Lists Iustin Dornescu Research Group in Computational Linguistics,

Action recognition Cordelia Schmid INRIA Grenoble Action recognition examples Short

Designing a state transaction machine for Coq Bruno Barras & Enrico Tassi 12 Aug 2012