Chapter 14: Consensus and Agreement Ajay Kshemkalyani and Mukesh - PowerPoint PPT Presentation

Chapter 14: Consensus and Agreement Ajay Kshemkalyani and Mukesh Singhal Distributed Computing: Principles, Algorithms, and Systems Cambridge University Press A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 1 / 54

Distributed Computing: Principles, Algorithms, and Systems Assumptions System assumptions 1 Failure models G1 G2 Synchronous/ Asynchronous 0 1 communication Network connectivity 1 1 0 0 1 Sender identification 0 0 Channel reliability Authenticated vs. non-authenticated 0 G3 G4 messages 0 Agreement variable A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 2 / 54

Distributed Computing: Principles, Algorithms, and Systems Problem Specifications Byzantine Agreement (single source has an initial value) Agreement: All non-faulty processes must agree on the same value. Validity: If the source process is non-faulty, then the agreed upon value by all the non-faulty processes must be the same as the initial value of the source. Termination: Each non-faulty process must eventually decide on a value. Consensus Problem (all processes have an initial value) Agreement: All non-faulty processes must agree on the same (single) value. Validity: If all the non-faulty processes have the same initial value, then the agreed upon value by all the non-faulty processes must be that same value. Termination: Each non-faulty process must eventually decide on a value. Interactive Consistency (all processes have an initial value) Agreement: All non-faulty processes must agree on the same array of values A [ v 1 . . . v n ]. Validity: If process i is non-faulty and its initial value is v i , then all non-faulty processes agree on v i as the i th element of the array A . If process j is faulty, then the non-faulty processes can agree on any value for A [ j ]. Termination: Each non-faulty process must eventually decide on the array A . These problems are equivalent to one another! Show using reductions. A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 3 / 54

Distributed Computing: Principles, Algorithms, and Systems Overview of Results Failure Synchronous system Asynchronous system mode (message-passing and shared memory) (message-passing and shared memory) No agreement attainable; agreement attainable; failure common knowledge also attainable concurrent common knowledge attainable Crash agreement attainable agreement not attainable failure f < n processes Ω( f + 1) rounds Byzantine agreement attainable agreement not attainable failure f ≤ ⌊ ( n − 1) / 3 ⌋ Byzantine processes Ω( f + 1) rounds Table: Overview of results on agreement. f denotes number of failure-prone processes. n is the total number of processes. In a failure-free system, consensus can be attained in a straightforward manner A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 4 / 54

Distributed Computing: Principles, Algorithms, and Systems Some Solvable Variants of the Consensus Problem in Async Systems Solvable Failure model and overhead Definition Variants Reliable crash failures, n > f (MP) Validity, Agreement, Integrity conditions broadcast k -set crash failures. f < k < n . size of the set of values agreed consensus (MP and SM) upon must be less than k ǫ -agreement crash failures values agreed upon are n ≥ 5 f + 1 (MP) within ǫ of each other Renaming up to f fail-stop processes, select a unique name from n ≥ 2 f + 1 (MP) a set of names Crash failures f ≤ n − 1 (SM) Table: Some solvable variants of the agreement problem in asynchronous system. The overhead bounds are for the given algorithms, and not necessarily tight bounds for the problem. A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 5 / 54

Distributed Computing: Principles, Algorithms, and Systems Solvable Variants of the Consensus Problem in Async Systems Circumventing the impossibility results for consensus in asynchronous systems Message−passing Shared memory k set consensus k set consensus Consensus epsilon− consensus epsilon− consensus using more powerful Renaming Renaming objects than atomic registers. Reliable broadcast This is the study of using atomic registers and universal objects and atomic snapshot objects universal constructions. constructed from atomic registers A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 6 / 54

Distributed Computing: Principles, Algorithms, and Systems Consensus Algorithm for Crash Failures (MP, synchronous) Up to f ( < n ) crash failures possible. In f + 1 rounds, at least one round has no failures. Now justify: agreement, validity, termination conditions are satisfied. Complexity: O ( f + 1) n 2 messages f + 1 is lower bound on number of rounds (global constants) integer: f ; // maximum number of crash failures tolerated (local variables) integer: x ← − local value; (1) Process P i (1 ≤ i ≤ n ) executes the Consensus algorithm for up to f crash failures: (1a) for round from 1 to f + 1 do (1b) if the current value of x has not been broadcast then (1c) broadcast ( x ); (1d) y j ← − value (if any) received from process j in this round; (1e) x ← − min ( x , y j ); (1f) output x as the consensus value. A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 7 / 54

Distributed Computing: Principles, Algorithms, and Systems Upper Bound on Byzantine Processes (sync) Agreement impossible when f = 1 , n = 3. P P c c commander commander 0 0 0 1 1 1 P P P P a b a b 0 0 (b) (a) malicious process correct process first round message second round message Taking simple majority decision does not help because loyal commander P a cannot distinguish between the possible scenarios (a) and (b); hence does not know which action to take. Proof using induction that problem solvable if f ≤ ⌊ n − 1 3 ⌋ . See text. A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 8 / 54

Distributed Computing: Principles, Algorithms, and Systems Consensus Solvable when f = 1 , n = 4 P P d d 0 0 1 0 0 0 0 1 1 0 commander commander P P c c 1 0 0 0 0 0 P P P P a b a b 1 0 (a) (b) first round exchange second round exchange correct process malicious process There is no ambiguity at any loyal commander, when taking majority decision Majority decision is over 2nd round messages, and 1st round message received directly from commander-in-chief process. A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 9 / 54

Distributed Computing: Principles, Algorithms, and Systems Byzantine Generals (recursive formulation), (sync, msg-passing) (variables) boolean : v ← − initial value; integer : f ← − maximum number of malicious processes, ≤ ⌊ ( n − 1) / 3 ⌋ ; (message type) Oral Msg ( v , Dests , List , faulty ), where v is a boolean, Dests is a set of destination process ids to which the message is sent, List is a list of process ids traversed by this message, ordered from most recent to earliest, faulty is an integer indicating the number of malicious processes to be tolerated. Oral Msg(f ) , where f > 0: 1 The algorithm is initiated by the Commander, who sends his source value v to all other processes using a OM(v , N , � i � , f ) message. The commander returns his own value v and terminates. [Recursion unfolding:] For each message of the form OM(vj , Dests , List , f ′ ) received in this round from some process j , the process i uses the 2 value vj it receives from the source, and using that value, acts as a new source. (If no value is received, a default value is assumed.) To act as a new source, the process i initiates Oral Msg(f ′ − 1 ) , wherein it sends OM(vj , Dests − { i } , concat ( � i � , L ) , ( f ′ − 1) ) to destinations not in concat ( � i � , L ) in the next round. [Recursion folding:] For each message of the form OM(vj , Dests , List , f ′ ) received in Step 2, each process i has computed the agreement 3 value vk , for each k not in List and k � = i ,corresponding to the value received from Pk after traversing the nodes in List , at one level lower in the recursion. If it receives no value in this round, it uses a default value. Process i then uses the value majorityk �∈ List , k � = i ( vj , vk ) as the agreement value and returns it to the next higher level in the recursive invocation. Oral Msg(0): 1 [Recursion unfolding:] Process acts as a source and sends its value to each other process. 2 [Recursion folding:] Each process uses the value it receives from the other sources, and uses that value as the agreement value. If no value is received, a default value is assumed. A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP 2008 10 / 54

Chapter 14: Consensus and Agreement Ajay Kshemkalyani and Mukesh - PowerPoint PPT Presentation

Chapter 14: Consensus and Agreement Ajay Kshemkalyani and Mukesh Singhal Distributed Computing: Principles, Algorithms, and Systems Cambridge University Press A. Kshemkalyani and M. Singhal (Distributed Computing) Consensus and Agreement CUP

Consensus and Dissent or: Meta - Consensus Consensus about what we have consensus

Prior Work Consensus Consensus Reliable BGP Consensus Reliable BGP Consensus Routing

Consensus Building Consensus is Consensus is finding an acceptable proposal that all members

Prior Work Consensus Consensus Reliable BGP Consensus Reliable BGP Consensus Routing

When Aeron Met Raft Martin Thompson - @mjpt777 What does Consensus mean? consensus

Distributed Systems CS425/ECE428 03/06/2020 Todays agenda Consensus Consensus in

CONSENSUS Fall 2012 Ken Birman Consensus a classic problem Consensus abstraction underlies

Membership of the consensus group Membership of the consensus group Members of the group were

Distributed Algorithms (PhD course) Consensus SARDAR MUHAMMAD SULAMAN Consensus The

Number of rounds for Consensus Non-Uniform Consensus (Non-Uniform) Agreement: No two correct

Lecture 21: Consensus - 1 1 Consensus is agreement. A question is seen as settled. Sometimes a

RAFT Consensus Slide content borrowed from Diego Ongaro, John Ousterhout, and Alberto Montresor

Raft: A Consensus Algorithm for Replicated Logs Diego Ongaro and John Ousterhout Stanford

Proof of Luck: an Efficient Blockchain Consensus Protocol Mitar Milutinovic, Warren He, Howard

Distributed Algorithms (PhD course) Consensus SARDAR MUHAMMAD SULAMAN Consensus (Recapitulation)

Consensus Variants Usman Mazhar Mirza 6/17/2013 1 Consensus Variants In the variants we

What Use Is Verified Software? John Rushby Computer Science Laboratory SRI International Menlo

Facing Up to Faults Facing Up to Faults Facing Up to Faults (v.2.0.1) (v.2.0.1) (v.2.0.1)

Think outside the rack 2015-04-21 WRSC john wilkes / johnwilkes@google.com, Parthasarathy

Multipath Transport, Resource Pooling, and implications for Routing Mark Handley , UCL and XORP,

Building a Fault- Building a Fault- Tolerant Distributed Tolerant Distributed System with

Outline 1. Introduction to fault diagnosis of DES 2. Seminal work of

Turbine Availability TIM BEDFORD, ATHENA ZITROU, LESLEY WALLS and KEVIN WILSON Department of

REMINDER -- Failure Rate Determination Class Project Failure Rate Calculations: 1.