CS6100: Topics in Design and Analysis of Algorithms Fault Tolerant - PDF document

CS6100: Topics in Design and Analysis of Algorithms Fault Tolerant Consensus CS6100 (Even 2012): Fault Tolerant Consensus

Models Failure Types 1. Clean Crash Failure — completely fail 2. (Unclean) Crash Failure — fail after some messages sent. 3. Byzantine Failure — behave arbitrarily. Timing • Asynchronous — impossible (FLP 1985) • Synchronous — our focus We consider f -resilient synchronous complete networks, i.e, at most f nodes crash, in the LOCAL model. CS6100 (Even 2012): Fault Tolerant Consensus 1

Problem Definition • Each node i has an intial value called input i • Intermediate variable x i (not strictly required) • Each node i has an output write-once variable y i output • Consensus is reached when following guarantees are achieved. Termination Every non-faulty node i must eventually write into y i . Agreement y i = y j for all non faulty nodes Validity If all input i = 0 (resp., 1) for non faulty i , all non faulty i should decide 0 (resp., 1). • We want to minimize time (rounds) to termination. CS6100 (Even 2012): Fault Tolerant Consensus 2

Algorithm Each node i initially sets x = p i , then V = { x } . For every round k , 1 ≤ k ≤ f + 1 : 1. Send the set V (or elements not sent before) to every other node. 2. Denote set received from v j as S j , j � = i and j has not crashed yet. 3. V = V ∪ � j S j . 4. If k = f + 1 , then y = min( V ) . Lemma 1. In every execution, at the end of round f + 1 , V i = V j for every two non-faulty nodes i and j . The following theorem follows as a corollary. Theorem 2. Above algorithm solves consensus in f + 1 rounds. CS6100 (Even 2012): Fault Tolerant Consensus 3

Proof of Lemma 1 Suppose for contradiction that V i ⊂ V j . Consider x ∈ V j \ V i . For every round i , 1 ≤ i ≤ f + 1 , x is in the set of values held by at least one node. Otherwise, it cannot exist in V j . However, there are at most f faults, so by the pigeonhole principle, there must be a fault free round in [1 , f + 1] . During that fault free round, every node would have received x , a contradiction to our assumption that V i ⊂ V j . CS6100 (Even 2012): Fault Tolerant Consensus 4

Lower Bound on the Number of Rounds Intuition: If nodes decide too early, they could not have distinguished between two executions where the decisions must have been different. Some definitions. Execution (Informal definition.) Sequence of synchronous operations (i.e., computations, messages sent and messages received) executed by all the nodes. View of node i Subsequence of computation and messages sent/received by node i . Denoted α | i . Similarity of Executions W.R.T i . Two executions α 1 and α 2 are similar w.r.t. node i if α 1 | i = α 1 | i . We denote this similarity by α 1 ∼ i α 2 Failure Sparse Executions There is at most 1 node failure per round. In the rest of the lower bound proof, we assume failure sparse executions. CS6100 (Even 2012): Fault Tolerant Consensus 5

Valence of a Configuration C is the set of all consensus values that can be reached from a configuration C . Univalent/Bivalent Configuration C The valence of C is a set of cardinality one/two. 0-valent configuration if the valence set has only 0. Similarly, define 1-valent configurations. Theorem 3. Any consensus algorithm A on n nodes resilient to f crash failures requires at least f + 1 rounds in some execution. (Assume n ≥ f + 2 .) Proof. First we show that the initial configuration is bivalent. Then we show that there is an execution under which the configuration after the first f − 1 rounds is bivalent. Then, show that one more round does not suffice. CS6100 (Even 2012): Fault Tolerant Consensus 6

Initial Bivalent Configuration Assume all initial configurations are univalent. Consider the following possible initial values in the sequence of n (= 10) nodes. 0000000000 -> 0 : : : 0000011111 -> 0 0000001111 -> 1 : : : 1111111111 -> 1 If 6th node crashes then, should the consensus be 1 or 0? A contradition. CS6100 (Even 2012): Fault Tolerant Consensus 7

f − 1 Round Bivalent Executions Lemma 4. ∀ k, 0 ≤ k ≤ f − 1 , ∃ a k -round execution of A that ends in a bivalent configuration. Proof. Assume first k − 1 rounds are bivalent, k ≤ f − 1 . We need to show that k th round is also bivalent. Assume the contrary. WLOG assume one round extension without failure is 1-valent. Call this β k . Since the execution is bivalent at end of k − 1 th round, there is a 1 round failure sparse extension that is 0-valent with one node p turning faulty. Call it γ k . One can find two intermediate configurations (shown k and α i +1 as α i ) with different univalencies. However, k the two configurations only differ slightly. In particular, there is a node q i +1 that does not hear from p in α i +1 k while it hears from p in α i k . CS6100 (Even 2012): Fault Tolerant Consensus 8

1-valent β k (node p sends to all nodes) 1-valent α i (node p fails to send to ( q 1 , . . . , q i )) k 0 / 1 0-valent α i +1 (node p fails to send to ( q 1 , . . . , q i , q i +1 )) k 0 − -valent γ k (node p fails to send to ( q 1 , . . . , q i , q i +1 , . . . , q m )) Consider the case where q i +1 dies in round k + 1 . The network nodes cannot distinguish between α i k and α i +1 . k CS6100 (Even 2012): Fault Tolerant Consensus 9

One More Indecisive Round Lemma 5. If α f − 1 is an f − 1 round execution of A that ends in a bivalent configuration, there ∃ a one- round extension of α f − 1 in which some non-faulty node has not decided. Proof Sketch. 1-valent β f p sends to p j p sends to p k δ f p fails to send to p j p sends to p k 0 − -valent γ f p fails to send to p j p might send to p k Note that in the above possible one round extensions, p k cannot distinguish between β f and δ f . Whereas, p j cannot distinguish between δ f and γ f . The proof of the theorem follows. CS6100 (Even 2012): Fault Tolerant Consensus 10

Byzantine Agreement With Global Coin What if f < n/ 8 nodes can behave arbitrarily 1 ? For instance, they can collude and actively thwart the efforts of the algorithm! Of course, no hope of solving consensus in ≤ f rounds. Suppose the nodes have access to a global unbiased coin. Each round, this coin produces an outcome that every node sees, but future outcomes are not revealed. 1 We don’t allow bad nodes to spoof other nodes. CS6100 (Even 2012): Fault Tolerant Consensus 11

Algorithm ByzGen Require: A value input i for each node i . Ensure: A decision y i . 1: vote = input i 2: L = (5 n/ 8) + 1 3: H = (3 n/ 4) + 1 4: G = 7 n/ 8 5: for all rounds do Broadcast vote. 6: Receive votes from all processors. 7: maj = majority of values received including own 8: vote. { maj is a value from { 0 , 1 } . } tally = # of occurrences of maj among votes 9: received. if coin = heads then 10: threshold = L 11: else 12: threshold = H 13: end if 14: if tally ≥ threshold then 15: vote = maj 16: else 17: CS6100 (Even 2012): Fault Tolerant Consensus 12

vote = 0 18: end if 19: if tally ≥ G then 20: Set y i to maj permanently. 21: end if 22: 23: end for CS6100 (Even 2012): Fault Tolerant Consensus 13

Analysis What happens if all good processors begin with same value? Easy Case! If two good nodes compute different maj values, then their tally will not exceed threshold (either L or H ). They will vote for 0 and next round maj will be 0. Faulty processors foil a threshold x ∈ { L, H } in a round if tally exceeds x for one processor while tally is below x for another. Note that both L and H cannot be foiled in the same round. The probability that a chosen threshold is foiled in a round, therefore, is at most 1 / 2 . So in expected 2 rounds, we have an unfoiled threshold. Then all good players will chose the same vote v and in the next round, they will reach consensus because G ≥ H + n/ 8 > H > L CS6100 (Even 2012): Fault Tolerant Consensus 14

nodes will send out vote v . Theorem 6. The expected # of rounds for ByzGen to reach agreement is a constant. CS6100 (Even 2012): Fault Tolerant Consensus 15

CS6100: Topics in Design and Analysis of Algorithms Fault Tolerant - PDF document

CS6100: Topics in Design and Analysis of Algorithms Fault Tolerant Consensus CS6100 (Even 2012): Fault Tolerant Consensus Models Failure Types 1. Clean Crash Failure completely fail 2. (Unclean) Crash Failure fail after some messages

CS6100: Topics in Design and Analysis of Algorithms Range Searching John Augustine CS6100 (Even

CS6100: Topics in Design and Analysis of Algorithms Point Location John Augustine CS6100 (Even

CS6100: Topics in Design and Analysis of Algorithms Voronoi Diagram John Augustine CS6100 (Even

CS6100: Topics in Design and Analysis of Algorithms Guarding and Triangulating Polygons John

CS6100: Topics in Design and Analysis of Algorithms Line Segment Intersections John Augustine

CS6100: Topics in Design and Analysis of Algorithms All Pairs Shortest Paths John Augustine

CS6100: Topics in Design and Analysis of Algorithms Delaunay Triangulation John Augustine

Design & Analysis of Design & Analysis of Design & Analysis of Physical Design

CSE202: Design and Analysis of Algorithms Ragesh Jaiswal, CSE, UCSD Ragesh Jaiswal, CSE, UCSD

CSE202: Design and Analysis of Algorithms Ragesh Jaiswal, CSE, UCSD Ragesh Jaiswal, CSE, UCSD

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

CSE101: Design and Analysis of Algorithms Ragesh Jaiswal, CSE, UCSD Ragesh Jaiswal, CSE, UCSD

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

DESIGN & ANALYSIS METHODS DESIGN & ANALYSIS METHODS FOR DESIGN & ANALYSIS METHODS

The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu

Byzantine Generals Problem II & FLP Impossibility August 28, 2019 Recap Conditions to

The Consensus Problem Roger Wattenhofer thread a lot of kudos to memory Maurice Herlihy and

When Is Agreement Possible? CS 188 Distributed Systems February 24, 2015 Lecture 13 Page 1 CS

Tropical Secant Graphs of Monomial Curves Mar a Ang elica Cueto Shaowei Lin Department

Combinatorics of the Star Product in AQFT Eli Hawkins Kasia Rejzner The University of York July

The Liar Paradox and Other Puzzles: Games Logicians Play Dr. Sara L. Uckelman

H ajeks Basic Logic: Decision and Representation Simone Bova bova@unisi.it Department of

CS6100: Topics in Design and Analysis of Algorithms Fault Tolerant - PDF document

CS6100: Topics in Design and Analysis of Algorithms Fault Tolerant Consensus CS6100 (Even 2012): Fault Tolerant Consensus Models Failure Types 1. Clean Crash Failure completely fail 2. (Unclean) Crash Failure fail after some messages

CS6100: Topics in Design and Analysis of Algorithms Range Searching John Augustine CS6100 (Even

CS6100: Topics in Design and Analysis of Algorithms Point Location John Augustine CS6100 (Even

CS6100: Topics in Design and Analysis of Algorithms Voronoi Diagram John Augustine CS6100 (Even

CS6100: Topics in Design and Analysis of Algorithms Guarding and Triangulating Polygons John

CS6100: Topics in Design and Analysis of Algorithms Line Segment Intersections John Augustine

CS6100: Topics in Design and Analysis of Algorithms All Pairs Shortest Paths John Augustine

CS6100: Topics in Design and Analysis of Algorithms Delaunay Triangulation John Augustine

Design &amp; Analysis of Design &amp; Analysis of Design &amp; Analysis of Physical Design

CSE202: Design and Analysis of Algorithms Ragesh Jaiswal, CSE, UCSD Ragesh Jaiswal, CSE, UCSD

CSE202: Design and Analysis of Algorithms Ragesh Jaiswal, CSE, UCSD Ragesh Jaiswal, CSE, UCSD

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

CSE101: Design and Analysis of Algorithms Ragesh Jaiswal, CSE, UCSD Ragesh Jaiswal, CSE, UCSD

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

DESIGN &amp; ANALYSIS METHODS DESIGN &amp; ANALYSIS METHODS FOR DESIGN &amp; ANALYSIS METHODS

The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu

Byzantine Generals Problem II &amp; FLP Impossibility August 28, 2019 Recap Conditions to

The Consensus Problem Roger Wattenhofer thread a lot of kudos to memory Maurice Herlihy and

When Is Agreement Possible? CS 188 Distributed Systems February 24, 2015 Lecture 13 Page 1 CS

Tropical Secant Graphs of Monomial Curves Mar a Ang elica Cueto Shaowei Lin Department

Combinatorics of the Star Product in AQFT Eli Hawkins Kasia Rejzner The University of York July

The Liar Paradox and Other Puzzles: Games Logicians Play Dr. Sara L. Uckelman

H ajeks Basic Logic: Decision and Representation Simone Bova bova@unisi.it Department of

Design & Analysis of Design & Analysis of Design & Analysis of Physical Design

DESIGN & ANALYSIS METHODS DESIGN & ANALYSIS METHODS FOR DESIGN & ANALYSIS METHODS

Byzantine Generals Problem II & FLP Impossibility August 28, 2019 Recap Conditions to