Programming language semantics Paul Jackson School of Informatics - PowerPoint PPT Presentation

Programming language semantics Paul Jackson School of Informatics University of Edinburgh Formal Verification Spring 2018

Using maths to verify software ◮ First need ability to construct mathematical models of programs ◮ Highly non-trivial – most programming languages are complex and have no formal description ◮ Particularly difficult when handling concurrency ◮ Most focus on functional behaviour. Only few handle performance ◮ To enable the automation of proof we then need systematic recipes for carrying out proof ◮ Notion of proof is broad: it might involve ◮ Applying rules of a program calculus ◮ Computing data-structures (e.g. BDDs in symbolic model checking) 2 / 22

Common themes in FV approaches ◮ A frequent theme is reducing program correctness to checking validity of formulas in propositional or first-order logic. ◮ This enables use of automated theorem prover technology ◮ SAT solvers in bounded model checking ◮ SMT solvers with the weakest precondition approach taken by the Spark FV tools ◮ Just as with compilers, another theme is first translating to a simpler intermediate program language before engaging the core generation of logical formulas. ◮ Spark Gnat prove tool uses front end of GNAT compiler. 3 / 22

IMP - a toy imperative programming language ◮ Numbers N m , n ::= . . . | − 1 | 0 | 1 | 2 | . . . ◮ Variables Var x , y ◮ Arithmetic expressions Aexp a ::= n | x | a 0 + a 1 | a 0 − a 1 | a 0 × a 1 ◮ Boolean expressions Bexp b ::= true | false | a 0 = a 1 | a 0 ≤ a 1 | ¬ b | b 0 ∧ b 1 | b 0 ∨ b 1 ◮ Commands Com c ::= skip | x := a | c o ; c 1 | if b then c 0 else c 1 | while b do c This is abstract syntax, ignoring parentheses 4 / 22

Operational semantics ◮ Define a set of states Σ as all functions σ : Var → N ◮ Use relations to define how ◮ expressions evaluate to values in a given state ◮ commands execute, changing the program state. 5 / 22

Evaluation of arithmetic expressions Use 3 place relation � a , σ � → n where a is an arithmetic expression, σ the current state and n the value of the expression. Relation defined in syntax-directed way: � n , σ � → n � x , σ � → σ ( x ) � a 0 , σ � → n 0 � a 1 , σ � → n 1 where n is n 0 + n 1 � a 0 + a 1 , σ � → n Similarly can define relation for Boolean expressions. 6 / 22

Big-step operational semantics for IMP Relation � c , σ � → σ ′ expresses that command c executed in initial state σ terminates in final state σ ′ . � skip , σ � → σ � a , σ � → m � x := a , σ � → σ [ m / x ] � c 0 , σ � → σ ′′ � c 1 , σ ′′ � → σ ′ � c 0 ; c 1 , σ � → σ ′ 7 / 22

Big-step operational semantics cont. � b , σ � → true � c 0 , σ � → σ ′ � if b then c 0 else c 1 , σ � → σ ′ � b , σ � → false � c 1 , σ � → σ ′ � if b then c 0 else c 1 , σ � → σ ′ � b , σ � → false � while b do c , σ � → σ � b , σ � → true � c , σ � → σ ′′ � while b do c , σ ′′ � → σ ′ � while b do c , σ � → σ ′ 8 / 22

Program specifications A basic way of specifying desired program behaviour is using preconditions and postconditions. We commonly write { P } c { Q } to express that if program c is started in a state satisfying precondition P and if it terminates, it will terminate in a state satisfying postcondition Q . { P } c { Q } is known as a Hoare triple. It can be defined semantically in terms of the big-step operational semantics relation = { P } c { Q } . = for all σ, σ ′ ∈ Σ if σ | = P and � σ, c � → σ ′ then σ ′ | | = Q Doing proofs directly with the execution relation → is tedious. 9 / 22

Hoare logics An alternative to reasoning directly with the execution relation is using a calculus with Hoare triples. An example rule: { P } c 0 { R } { R } c 1 { Q } { P } c 0 ; c 1 { Q } Such calculi are known as Hoare logics. Hoare logics can be good for paper proofs and proofs using an interactive theorem prover, but are not the best for automation. In the above rule, what is a recipe for R ? Weakest pre-condition based approaches are better. 10 / 22

Weakest pre-condition The weakest pre-condition function WP ( , ) can be defined semantically: WP ( c , Q ) . = { σ | for all σ ′ if � c , σ � → σ ′ then σ ′ | = Q } where we identify predicates with the sets of states that satisfy them. WP ( , ) is closely related to Hoare triples. We have ( for all σ if σ | = P then σ ∈ WP ( c , Q )) iff | = { P } c { Q } and in particular { WP ( c , Q ) } c { Q } WP ( c , Q ) is indeed the weakest pre-condition of c and Q . 11 / 22

How weakest pre-conditions can be used for verification If we can compute WP ( c , Q ) as a formula, given formula for Q , then proving the predicate logic formula ∀ ¯ x . P ⇒ WP ( c , Q ) is sufficient for establishing { P } c { Q } Here ◮ The ∀ ¯ x is a quantification over all the variables in Var – the syntactic equivalent of quantifying over all states ◮ ∀ ¯ x . P ⇒ WP ( c , Q ) is called a verification condition or VC 12 / 22

Weakest precondition equations WP ( skip , Q ) = Q WP ( x := a , Q ) = Q [ x �→ a ] WP ( c 0 ; c 1 , Q ) = WP ( c 0 , WP ( c 1 , Q )) WP ( if b then c 0 else c 1 , Q ) = ( b ⇒ WP ( c 0 , Q )) ∧ ( ¬ b ⇒ WP ( c 1 , Q )) WP ( while b do c , Q ) = ( b ⇒ WP ( c ; while b do c , Q )) ∧ ( ¬ b ⇒ Q ) Here now the left and right hand sides of the equations are Boolean expressions in the program variables. Given formula Q and c without while loops, equations specify how to compute WP ( c , Q ) as a formula. If c has while loops, computation would not terminate. 13 / 22

Addressing the loop issue Rough idea: 1. Add a loop invariant assertion to every loop of a program c ◮ These assertions cut the control flow of c into loop-free segments 2. Show { P } c { Q } by showing { P ′ } c ′ { Q ′ } for each segment c ′ making up c . ◮ Each P ′ is either P or a loop invariant. ◮ Each Q ′ is either a loop invariant or Q . 3. Show { P ′ } c ′ { Q ′ } by proving x . P ′ ⇒ WP ( c ′ , Q ′ ) ∀ ¯ A detail: Segments might have multiple initial and final points. Must check { P ′ } c ′′ { Q ′ } for each path c ′′ in segment c ′ 14 / 22

Program segments To express segments, need new command – assume Boolean expression A assume A with � b , σ � → true � assume b , σ � → σ ′ WP ( assume A , Q ) = A ⇒ Q A while loop with invariant I { I } while b do c has ◮ I terminating the segment for the code before the loop ◮ a segment assume b ; c starting and ending with I . ◮ a segment assume ¬ b starting with I and continuing with the code after the loop 15 / 22

A program and its control flow graph { P } r := 1 ; if n > 0 then { I } while r × r ≤ n do r := r + 1 else skip { Q } r := r + 1 r × r ≤ n n > 0 I Q P ¬ ( r × r ≤ n ) r := 1 ¬ ( n > 0) skip where assume b is abbreviated to b 16 / 22

Splitting control flow graph into segments Control flow graph with cycle for loop: r := r + 1 r × r ≤ n n > 0 I Q P ¬ ( r × r ≤ n ) r := 1 ¬ ( n > 0) skip Splitting at loop invariant I yields acyclic segments: I r × r ≤ n r := r + 1 I I n > 0 ¬ ( r × r ≤ n ) Q I P r := 1 ¬ ( n > 0) skip 17 / 22

Enumerating paths of each segment With segments: I r × r ≤ n I r := r + 1 I n > 0 ¬ ( r × r ≤ n ) Q I P r := 1 ¬ ( n > 0) skip the paths are: I r × r ≤ n I r := r + 1 P I r := 1 n > 0 ¬ ( r × r ≤ n ) Q I Q ¬ ( n > 0) P skip r := 1 18 / 22

VC generation Define two functions Pre ( , ) and VC ( , ). Pre ( c , Q ) is like WP ( c , Q ) except it only computes WP ( c , Q ) for the start segment of c . Pre ( skip , Q ) = Q Pre ( x := a , Q ) = Q [ x �→ a ] Pre ( c 0 ; c 1 , Q ) = Pre ( c 0 , Pre ( c 1 , Q )) Pre ( if b then c 0 else c 1 , Q ) = ( b ⇒ Pre ( c 0 , Q )) ∧ ( ¬ b ⇒ Pre ( c 1 , Q )) Pre ( { I } while b do c , Q ) = I 19 / 22

VC generation cont. VC ( c , Q ) computes VCs for all but the start segment of c . VC ( skip , Q ) = true VC ( x := a , Q ) = true VC ( c 0 ; c 1 , Q ) = VC ( c 0 , Pre ( c 1 , Q )) ∧ VC ( c 1 , Q ) VC ( if b then c 0 else c 1 , Q ) = VC ( c 0 , Q ) ∧ VC ( c 1 , Q ) VC ( { I } while b do c , Q ) = ( I ∧ b ⇒ Pre ( c , I )) ∧ ( I ∧ ¬ b ⇒ Q ) 20 / 22

Soundness of VC generation If | = ∀ ¯ x . ( P ⇒ Pre ( c , Q )) ∧ VC ( c , Q ) then | = { P } c { Q } 21 / 22

Further reading See Concrete Semantics by Nipkow and Klein http: // www. concrete-semantics. org ◮ Section 7.1 on IMP language ◮ Section 7.2 on big-step semantics ◮ Section 12.4 on VC generation 22 / 22

Programming language semantics Paul Jackson School of Informatics - PowerPoint PPT Presentation

Programming language semantics Paul Jackson School of Informatics University of Edinburgh Formal Verification Spring 2018 Using maths to verify software First need ability to construct mathematical models of programs Highly

Semantics 1 / 21 Outline What is semantics? Denotational semantics Semantics of naming What

Operational Semantics 1 / 14 Outline What is semantics? Operational Semantics What is

15-411: Dynamic Semantics Jan Ho ff mann Dynamic Semantics Static semantics: definition of

Semantics of programming languages Informatics 2A: Lecture 27 John Longley School of Informatics

Semantics is an indispensable aspect of a query language Semantics is an indispensable aspect of

Polyteam Semantics Team Semantics Axiomatizations in team semantics Polyteams and Jonni

Semantics in Practice Semantics of Practice How do we write semantics? 1: pen-and-paper How do

Introductory Notes Jigsaw Semantics or: Dynamic Semantics Put Together Again Formal semantics

Polyteam Semantics Team Semantics Axiomatisations in team semantics Polyteams and

PL: A Whirlwind Tour Semantics and Foundations Program Semantics To analyze programs, we

10/17/2011 Dynamic Semantics Definition of a (programming) language involves: Generic Language

Natural Language Processing Lecture 18a: Meaning Representation Languages Semantics Road Map

Logic and Natural Language Semantics: Distributional Semantics R affaella B ernardi DISI, U

Semantics so far in course Lexical Semantics, Distributions, Previous semantics lectures

Preparatory course WS2011 - Semantics The job of semantics Referential theories Conceptual

Propositional Logic: Semantics Alice Gao Lecture 4, September 19, 2017 Semantics 1/56

CSE 331 Software Design and Implementation Lecture 2 Formal Reasoning Leah Perlmutter / Summer

COMP2111 Week 9 Term 1, 2020 Hoare Logic 1 Summary Weakest precondition reasoning Handling

Regression Idea: dont solve one subgoal by itself, but keep track of all subgoals that must

Logical Foundations of Cyber-Physical Systems Andr Platzer Andr Platzer (CMU) LFCPS/11:

Lecture 11 (11.01.2016) Verification Condition Generation Christoph Lth Jan Peleska Dieter

Towards programming logics for low level languages Ando Saabas Institute of Cybernetics / INRIA

Null Dereference Verification Via Over-approximated Weakest Precondition analysis Ravichandhran

Outline Static Analysis: Symbolic Execution and Inductive Verification Methods Overview TDDC90: