Formal verification of a compiler front-end for mini-ML Zaynah - PowerPoint PPT Presentation

Formal verification of a compiler front-end for mini-ML Zaynah Dargaye, Xavier Leroy, Andrew Tolmach INRIA Paris-Rocquencourt WG 2.8, july 2007 Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 1 / 34

Formal verification of compilers Apply formal methods to a compiler. Prove a semantic preservation property: Theorem For all source codes S, if the compiler generates machine code C from source S, without reporting a compilation error, and if S has well-defined semantics, then C has well-defined semantics and S and C have the same observable behaviour. Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 2 / 34

Formal verification of compilers Motivations: Useful for high-assurance software, verified (at the source level) using formal methods. A challenge for mechanized program proof. For fun! (compilers + pure F.P. + mechanized proof, all in one easy-to-explain project). Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 3 / 34

The Compcert effort INRIA/CNAM/Paris 7, since 2003 Develop and prove correct a realistic compiler, usable for critical embedded software. Source language: a subset of C. Target language: PowerPC assembly. Generates reasonably compact and fast code ⇒ some optimizations. This is “software-proof codesign” (as opposed to proving an existing compiler). We use the Coq proof assistant to conduct the proof of semantic preservation and to write most of the compiler. Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 4 / 34

The Compcert effort – status A prototype compiler that executes (under MacOS X). From Clight AST to PowerPC assembly AST: entirely verified in Coq (40000 lines); entirely programmed in Coq, then automatically extracted to executable Caml code. Uses monads, persistent data structures, etc. Performances of generated code: better than gcc -O0 , close to gcc -O1 . Compilation times: comparable to those of gcc -O1 . References: X. Leroy, POPL 2006 (back-end); S. Blazy, Z. Dargaye, X. Leroy, Formal Methods 2006 (C front-end). Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 5 / 34

Front-ends for other source languages Clight Cminor PPC Cminor could be a reasonable I.L. for other source languages. Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 7 / 34

A flavor of Cminor "quicksort"(lo, hi, a): int -> int -> int -> void { var i, j, pivot, temp; if (! (lo < hi)) return; i = lo; j = hi; pivot = int32[a + hi * 4]; block { loop { if (! (i < j)) exit; block { loop { if (i >= hi || int32[a + i * 4] > pivot) exit; i = i + 1; } } /* ... */ } } temp = int32[a + i * 4]; int32[a + i * 4] = int32[a + hi * 4]; int32[a + hi * 4] = temp; "quicksort"(lo, i - 1, a) : int -> int -> int -> void; tailcall "quicksort"(i + 1, hi, a) : int -> int -> int -> void; } Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 9 / 34

Front-ends for other source languages Clight Cminor PPC Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 11 / 34

Front-ends for other source languages Clight Cminor PPC Reactive language? Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 11 / 34

Front-ends for other source languages Mini-ML Clight Cminor PPC Reactive language? Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 11 / 34

Front-ends for other source languages Coq specs Mini-ML Clight Cminor PPC Reactive language? Towards a trusted execution path for programs written and proved in Coq. This includes the Compcert compiler itself . . . (bootstrap!) Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 11 / 34

mini-ML: syntax Pure, call-by-value, datatypes + shallow pattern matching. Terms: a ::= n variable (de Bruijn) | λ. a | a 1 a 2 | µ.λ. a recursive function | let a 1 in a 2 | C ( a 1 , . . . , a n ) data constructor | match a with p 1 → a 1 . . . p n → a n p ::= C n Patterns: i.e. C ( n , . . . , 1) Also: constants and arithmetic operators. More or less the output language for Coq’s extraction, minus mutually-recursive functions. Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 12 / 34

mini-ML: dynamic semantics Big-step operational semantics with environments e ⊢ a ⇒ v with v ::= C ( v 1 , . . . , v n ) | ( λ. a )[ e ] | ( µ.λ. a )[ e ] and e = v 1 . . . v n . Entirely standard. Big-step semantics with substitutions also used in some of the proofs. Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 14 / 34

mini-ML: (no) type system Our Mini-ML is untyped: Makes it easier to translate various typed F.P.L. to mini-ML, e.g. Coq with its extremely powerful type system. We are doing semantic-preserving compilation, which subsumes all the guarantees that type-preserving compilation provides. Exception: we demand that constructors are grouped into “datatype declarations” to facilitate pattern-matching compilation (see example). Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 15 / 34

Example of mini-ML type list = Nil | Cons program let map = µ map. λ x. match x with | Nil -> Nil | Cons(hd, tl) -> Cons(f hd, map f tl) in map ( λ x. Cons(x, Nil)) Nil Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 17 / 34

Overview of the compiler w/ numbered uncurrying w/ n -ary pat-match mini-ML “compilation” constructors functions closure CPS conversion conversion w/ closed, Cminor g n i m a n toplevel fns s o t NQANF o r f o Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 18 / 34

Uncurrying let -bound curried functions are turned into n -ary functions. let f = λ x. λ y. ... in Pair(f 1 2, f 1) ⇓ let f = λ (x, y). ... in Pair(f(1, 2), (( λ x. λ y.f(x,y))(1))) Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 20 / 34

Generation of Cminor code Quite straightforward if Cminor had dynamic memory allocation with garbage collection. (Mostly, represent constructor applications and closures as pointers to appropriately-filled memory blocks.) But Cminor has no memory allocator, no GC, and no run-time system of any kind. . . Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 21 / 34

Run-time systems: the bane of high-level languages Run-time systems are big (e.g. 50000 lines), messy, written in C, system-dependent, often buggy, . . . Yet, the run-time system must be proved correct in the context of a verified compiler for a high-level language. Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 22 / 34

What needs to be done For the memory allocator and (tracing) garbage collector: The algorithms must be proved correct. (Mostly routine.) The actual implementation (typically in Cminor) must be proved correct. (Painful, like all proofs of imperative programs.) This proof must be connected to that of the compiler: Compiler-generated code must respect GC contract (Data representation conventions, don’t touch block headers, etc) GC must be able to find the memory roots (among the compiler-managed registers, call stack, etc) Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 23 / 34

Example: finding roots using frame descriptors Registers size=8 root #roots=1 r0=stk4 hash size=20 root Heap #roots=3 code addr r0=reg4 r1=stk8 hash root r2=stk12 root Frame code addr descriptors Stack Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 24 / 34

Formal verification of a compiler front-end for mini-ML Zaynah - PowerPoint PPT Presentation

Formal verification of a compiler front-end for mini-ML Zaynah Dargaye, Xavier Leroy, Andrew Tolmach INRIA Paris-Rocquencourt WG 2.8, july 2007 Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 1 / 34

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

SAE MINI BAJA Front & Rear End Rear End: Jacob Ruiz Front End: Will Preston Lucas Cramer

Formal Verification of RISC-V cores with riscv-formal Clifford Wolf CTO, Symbiotic EDA

COURSE OVERVIEW WEB SKILL SETS Front-End Back-End Design Front-End Back-End MY BLOG HTTP

COURSE OVERVIEW WEB SKILL SETS Front end Back end Design Front-End Back-End MY BLOG HTTP

Compiler verification for fun and profit Xavier Leroy Inria Paris-Rocquencourt FMCAD,

MINI OPENDRIVE 1 MINI MINI OPENDRIVE EXP OPENDRIVE EXP Experience, eXpertise, Performance The

Is End-to-End Integrity Verification Really End- to-End? Ahmed Alhussen, Batyr Charyyev, and Engin

Model-Checking Acknowledgment Formal Verification Formal verification means to apply

Front-end RESTful Back- end Connection Connecting AngularJS Front-end with RESTful Express-Node

Formal verification of an optimizing compiler or: a software-proof codesign approach to the

Optimizing Front End Checkout Merchandising Maximizing Shopper Interaction In A New Era Of

Baja SAE Preliminary Design Front End + Rear End Project Description Front/Rear End SAE

A GENERAL-PURPOSE A GENERAL-PURPOSE CRN-TO-DSD COMPILER FRAMEWORK CRN-TO-DSD COMPILER FRAMEWORK

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

Formal Verification Methods 5: Floating Point Verification John Harrison Intel Corporation

Data Centers & Co-designed Distributed Systems A Data Center Inside a Data Center Data

CEKgo extensions M ::= . . . | go M | here M F ::= ( W ) | ( M E ) |

A model for the extended predicative Mahlo Universe Anton Setzer (joint work with Reinhard Kahle,

New Impossibility Results for Concurrent Composition and a Non-Interactive Completeness Theorem

Modeling Resource-Coupled Computations MarkHereld Computa0onIns0tute

MATH 12002 - CALCULUS I 1.4: Limit Laws Professor Donald L. White Department of Mathematical

Average - case Lower Bounds for Approximate Near - Neighbor fs om Isoperimetric Inequalities

Today Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor

Sambuz

Useful Links

Newsletter

Mail Us

Formal verification of a compiler front-end for mini-ML Zaynah - PowerPoint PPT Presentation

Formal verification of a compiler front-end for mini-ML Zaynah Dargaye, Xavier Leroy, Andrew Tolmach INRIA Paris-Rocquencourt WG 2.8, july 2007 Dargaye, Leroy, Tolmach (INRIA) Verification of a mini-ML compiler WG 2.8, july 2007 1 / 34

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

SAE MINI BAJA Front &amp; Rear End Rear End: Jacob Ruiz Front End: Will Preston Lucas Cramer

Formal Verification of RISC-V cores with riscv-formal Clifford Wolf CTO, Symbiotic EDA

COURSE OVERVIEW WEB SKILL SETS Front-End Back-End Design Front-End Back-End MY BLOG HTTP

COURSE OVERVIEW WEB SKILL SETS Front end Back end Design Front-End Back-End MY BLOG HTTP

Compiler verification for fun and profit Xavier Leroy Inria Paris-Rocquencourt FMCAD,

MINI OPENDRIVE 1 MINI MINI OPENDRIVE EXP OPENDRIVE EXP Experience, eXpertise, Performance The

Is End-to-End Integrity Verification Really End- to-End? Ahmed Alhussen, Batyr Charyyev, and Engin

Model-Checking Acknowledgment Formal Verification Formal verification means to apply

Front-end RESTful Back- end Connection Connecting AngularJS Front-end with RESTful Express-Node

Formal verification of an optimizing compiler or: a software-proof codesign approach to the

Optimizing Front End Checkout Merchandising Maximizing Shopper Interaction In A New Era Of

Baja SAE Preliminary Design Front End + Rear End Project Description Front/Rear End SAE

A GENERAL-PURPOSE A GENERAL-PURPOSE CRN-TO-DSD COMPILER FRAMEWORK CRN-TO-DSD COMPILER FRAMEWORK

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

Formal Verification Methods 5: Floating Point Verification John Harrison Intel Corporation

Data Centers &amp; Co-designed Distributed Systems A Data Center Inside a Data Center Data

CEKgo extensions M ::= . . . | go M | here M F ::= ( W ) | ( M E ) |

A model for the extended predicative Mahlo Universe Anton Setzer (joint work with Reinhard Kahle,

New Impossibility Results for Concurrent Composition and a Non-Interactive Completeness Theorem

Modeling Resource-Coupled Computations MarkHereld Computa0onIns0tute

MATH 12002 - CALCULUS I 1.4: Limit Laws Professor Donald L. White Department of Mathematical

Average - case Lower Bounds for Approximate Near - Neighbor fs om Isoperimetric Inequalities

Today Closed World Assumption &amp; Negation as Failure. Clark completion Lloyd-Topor

Sambuz

Useful Links

Newsletter

Mail Us

SAE MINI BAJA Front & Rear End Rear End: Jacob Ruiz Front End: Will Preston Lucas Cramer

Data Centers & Co-designed Distributed Systems A Data Center Inside a Data Center Data

Today Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor