Making TLA + Model Checking Symbolic Igor Konnov Joining Interchain - PowerPoint PPT Presentation

Making TLA + Model Checking Symbolic Igor Konnov Joining Interchain Foundation in August Jure Kukovec Thanh-Hai Tran VeriDis + Matryoushka seminar, Amsterdam, June 2019

Why TLA + ? Rich specification language TLA + is used in industry, e.g., TLA + tools maintained at and - an interactive proof system (TLAPS) - a model checker (TLC), state enumeration Raft Paxos (Synod), Egalitarian Paxos, Flexible Paxos Apache Kafka several bugs found Igor Konnov 3 of 46

TLA + First-order logic with sets (ZFC) Rich expression syntax: - operations on sets, functions, tuples, records, sequences Temporal operators: - ✷ (always), ✸ (eventually), ❀ (leads-to), no Nexttime Practice: safety properties, ✷ Invariant Igor Konnov 4 of 46

APALACHE-MC 0.5.0 github.com/konnov/apalache Symbolic model checker that works under the assumptions of TLC: Fixed and finite constants (parameters) Finite sets, function domains and co-domains TLC’s restrictions on formula structure Bounded model checking to check safety As few language restrictions as possible Technically, QF_UFNIA Quantifier-free formulas in SMT: ∀ x ∈ S : P as � P [ c / x ] Unfolding quantified expressions: c ∈ S Igor Konnov 6 of 46

an example

A service for reliable broadcast one process broadcasts a message bcast unforgeability : if no correct process received bcast , 000 . . . 0 then no correct process ever accepts bcast correctness : if all correct processes received bcast , 111 . . . 1 then some correct process eventually accepts bcast relay : if a correct process accepts bcast , 011 . . . 1 then all correct processes eventually accept bcast Igor Konnov 8 of 46

Reliable broadcast by Srikanth & Toueg 87 ✞ ☎ local myval i ∈ { 0 , 1 } -- did process i receive bcast ? while true do i f myval i = 1 and not sent ECHO before then send ECHO to all i f received ECHO from at least n-2t distinct processes and not sent ECHO before then send ECHO to all i f received ECHO from at least n - t distinct processes then accept od ✝ ✆ resilience: of n > 3 t processes, f ≤ t processes are Byzantine Igor Konnov 9 of 46

How to check its properties? I read that paper about Byzantine Model Checker Model the algorithm as a threshold automaton Verify safety and liveness for all n , t , f : n > 3 t ∧ t ≥ f ≥ 0 I have heard this talk by Leslie Lamport Let’s write it in TLA + Run the TLC model checker for fixed parameters Igor Konnov 10 of 46

Declaration and initialization EXTENDS Integers , FiniteSets △ △ △ N = 12 T = 3 F = 3 △ △ Corr = 1 . . (N − F − 1) Faulty = (N − F) . . N VARIABLES pc , rcvd , sent △ Init = ∧ pc ∈ [ Corr → { “ V 0 “ , “ V 1 “ } ] some processes receive the broadcast ∧ sent = {} no messages sent initially ∧ rcvd ∈ [ Corr → {} ] no messages received initially

Transition relation △ Next = ∃ p ∈ Corr : ∧ Receive ( p ) ∧ ∨ UponV 1 ( p ) ∨ UponNonFaulty ( p ) ∨ UponAccept ( p ) ∨ UNCHANGED � pc , sent � △ Receive ( p ) = ∃ newMessages ∈ SUBSET ( sent ∪ Faulty ) : rcvd ′ = [ rcvd EXCEPT ![ self ] = rcvd [ p ] ∪ newMessages ]

Actions △ UponV1( p ) = ∧ pc [ p ] = “V1” ∧ pc ′ = [ pc EXCEPT ![ p ] = “SE” ] sent ′ = sent ∪ { p } ∧ △ UponNonFaulty ( p ) = Cardinality ( rcvd ′ [ p ]) > = N − 2 ∗ T ∧ pc [ p ] ∈ { “V0” , “V1” } ∧ ∧ pc ′ = [ pc EXCEPT ![ p ] = “SE” ] sent ′ = sent ∪ { p } ∧ △ UponAccept ( p ) = Cardinality ( rcvd ′ [ p ]) > = N − T ∧ pc [ p ] ∈ { “V0” , “V1” , “SE” } ∧ ∧ pc ′ = [ pc EXCEPT ![ p ] = “AC” ] ∧ sent ′ = sent ∪ ( IF pc [ p ] � = “SE” THEN { p } ELSE {} )

Safety? unforgeability : if no correct process received bcast , 000 . . . 0 then no correct process ever accepts bcast \* a non-inductive invariant △ Unforg = ∀ p ∈ Corr : pc [ p ] � = “AC” \* restricted initial states △ InitNoBcast = Init ∧ pc ∈ [ Corr → { “V0” } ] Check that every state reachable from InitNoBcast satisfies Unforg

Breaking unforgeability 12 processes, 4 faults n = 3 f APALACHE-MC : a counterexample in 5 minutes - 12K SMT constants, 34K SMT assertions depth 6 TLC : a counterexample after 2 hrs 21 min - 600M states depth 6

how does it work?

What is hard about TLA + ? Rich data sets of sets, functions, records, tuples, sequences No types TLA + is not a programming language No imperative statements like assignments TLA + is not a programming language No standard control flow TLA + is not a programming language Igor Konnov 18 of 46

Essential steps Assignments TLA + Flat TLA + SMT Reduction Types & symbolic specification specification rules (UF_NIA) transitions Extracting assignments and symbolic transitions treat some x ′ ∈ { . . . } as assignments similar to TLC Simple type inference propagate types at every step x : Int gives us { x } : Set [ Int ] Bounded model checking overapproximate the contents of data structures Igor Konnov 19 of 46

assignments & symbolic transitions

Symbolic transitions [Kukovec, K., Tran, ABZ’18] △ Next = ∃ p ∈ Corr : ∧ Receive ( p ) ∧ ∨ UponV 1 ( p ) ∨ UponNonFaulty ( p ) ∨ UponAccept ( p ) ∨ UNCHANGED � pc , sent � Automatically partitioning Next into four transitions: ∃ p ∈ Corr : ∃ p ∈ Corr : ∧ Receive ( p ) ∧ Receive ( p ) ∧ UponV 1 ( p ) ∧ UponNonFaulty ( p ) ∃ p ∈ Corr : ∃ p ∈ Corr : ∧ Receive ( p ) ∧ Receive ( p ) ∧ UponAccept ( p ) ∧ UNCHANGED � pc , sent � Igor Konnov 21 of 46

How does TLC find assignments? TLC detects assignments as it explores a formula: - from left to right: x ′ = 1 ∧ x ′ ∈ { 1 , 2 , 3 } - treating action-level disjunctions as non-deterministic choice x ′ = 1 ∨ x ′ = 2 ∧ x ′ ≥ 2 � � - expecting the same kind of assignments on all branches ( x ′ = 1 ∧ y ′ = 2 ) ∨ x ′ = 3 Igor Konnov 22 of 46

Finding symbolic assignments (with SMT) Looking for assignment strategies that: - cover every Boolean branch - have exactly one assignment per variable per branch - do not contain cyclic assignments ( y ′ = x ′ ∧ x ′ ∈ { 2 , 3 , y ′ } ) ∨ ( x ′ = 2 ∧ y ′ ∈ { x ′ } ) ∧ x ′ = 3 � � Sometimes, we do better than TLC (above) Sometimes, worse, e.g., when x = 0: x > 0 ∨ ( x ′ = x + 1 ∨ y ′ = x − 1 ) Definitions and the framework in: [Kukovec, K., Tran, ABZ’18] Igor Konnov 23 of 46

Simple types Igor Konnov 24 of 46

Types: scalars and functions Basic: constants: Const “a”, “hello” integers: Int -1, 1024 Booleans: Bool FALSE , TRUE Finite sets: Set [ τ ] Set [ Set [ Int ]] Function-like: functions: τ 1 → τ 2 Int → Bool tuples: τ 1 × · · · × τ n Int × Bool × ( Int → Int ) records: [ Const �→ τ 1 , . . . , Const �→ τ n ] [ “a” �→ Int , “b” �→ Bool ] sequences: Seq ( τ ) Seq [ Int ] Igor Konnov 25 of 46

Simple type inference Knowing the types at the current state Compute the types of the expressions and of the primed variables if X has type Set [ Int ] X ′ ∈ [ X → X ] has type Int → Int y in { y ∈ X : y > 0 } has type Int {} and �� are polymorphic constructors for sets and sequences hence, we ask the user to specify the type, e.g., {} < : { Int } records also require type annotations Igor Konnov 26 of 46

Bounded model checking Igor Konnov 27 of 46

Old recipe for bounded symbolic computations Two symbolic transitions that assign values to x △ Next = A ∨ B Translate TLA + expressions to SMT with some � · � state 0 state 1 state 2 . . . x ′ �→ a 1 x ′ �→ a 2 � Init � x �→ i 0 � A [ i 0 / x ] � � A [ c 1 / x ] � x ′ �→ b 1 x ′ �→ b 2 � B [ i 0 / x ] � � B [ c 1 / x ] � . . . � x ′ ∈ { a 1 , b 1 } � x ′ �→ c 1 � x ′ ∈ { a 2 , b 2 } � x ′ �→ c 2 Igor Konnov 28 of 46

Making TLA + Model Checking Symbolic Igor Konnov Joining Interchain - PowerPoint PPT Presentation

Making TLA + Model Checking Symbolic Igor Konnov Joining Interchain Foundation in August Jure Kukovec Thanh-Hai Tran VeriDis + Matryoushka seminar, Amsterdam, June 2019 Why TLA + ? Rich specification language TLA + is used in industry, e.g.,

BMCMT Bounded Model Checking of TLA + Specifications with SMT Jure Kukovec Igor Konnov Thanh

INTRODUCTION TO TLA + Presented by : Kevin Yeh What is TLA+ Specification Language for

Modelling and validating distributed systems with TLA+ Carla Ferreira 29th April 2019 TLA+

TLA+ Quinceaera FLoC TLA+ Workshop David Langworthy Microsoft Engineer 2003: WS-Transaction

Model Checking TLA+ Specifications Shiji Bijo shijib@ifi.uio.no Institutt for informatikk,

Lecture 5. Symbolic model checking. NuSMV and SPIN model checkers. User-friendly model checking

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Exposing Design Flaws in Shared-Clock Systems using TLA+ Russell Mull, Auxon Corporation TLA+

Using TLA+ Tianxiang Lu Stephan Merz Christoph Weidenbach TLA+ Workshop at FM2012, Paris

Objective Explain basic concepts of TLA + modeling systems: static and dynamic aspects

+ Proofs Harnessing SMT solvers for TLA Stephan Merz and Hern an Vanzetto + Workshop, Paris,

Symbolic Model Checking 10 20 States and Beyond Burch Clarke McMillan Dill Hwang Seminal

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

TLAPS : The TLA + Proof System Stephan Merz joint work with K. Chaudhuri, D. Cousineau, D.

Attacking Authentication Professor Larry Heimann Web Application Security Information Systems

Something with implementations Peter Schwabe June 23, 2016 PQCRYPTO Summer School on

A template attack against Verify PIN algorithms Hlne Le Bouder, Thierno Barry, Damien

The TLA + proof system Stephan Merz Kaustuv Chaudhuri, Damien Doligez, Leslie Lamport INRIA

Breakout Session Creating High-Quality Professional Learning: Enabling Choice, Ownership, and

A High-Level Language for Modeling Algorithms and their Properties Sabina Akhtar Stephan Merz

Electrum Lightweight specification of behavioral models with rich configurations Julien Brunel 1