Distributed Systems CS425/ECE428 03/04/2020 Logistics HW3 - PowerPoint PPT Presentation

Distributed Systems CS425/ECE428 03/04/2020

Logistics • HW3 • Released on Monday. • You should be able to solve it completely after today’s class. • MP1 • Due date extended to Monday, March 9 th , 11:59pm. • MP2 • Will be released on Monday, March 9 th (and not this Friday).

Recap: Leader Election • In a group of processes, elect a Leader to undertake special tasks • Let everyone know in the group about this Leader. • Safety condition: • During the run of an election, a correct process has either not yet elected a leader, or has elected process with best attributes. • Liveness condition: • Election run terminates and each process eventually elects someone. • Two classical algorithms: • Ring-based algorithm • Bully algorithm • Difficulty of ensure both safety and liveness in an asynchronous system under failures. • Related to consensus !

Agenda for the next 2-3 weeks • Consensus • Consensus in synchronous systems • Chapter 15.4 • Impossibility of consensus in asynchronous systems • Impossibility of Distributed Consensus with One Faulty Process, Fischer- Lynch-Paterson (FLP), 1985 • A good enough consensus algorithm for asynchronous systems: • Paxos made simple, Leslie Lamport, 2001 • Other forms of consensus • Blockchains • Raft (log-based consensus)

Agenda for this week • Consensus • Consensus in synchronous systems • Chapter 15.4 • Impossibility of consensus in asynchronous systems • Impossibility of Distributed Consensus with One Faulty Process, Fischer- Lynch-Paterson (FLP), 1985 • A good enough consensus algorithm for asynchronous systems: • Paxos made simple, Leslie Lamport, 2001 • Other forms of consensus • Blockchains • Raft (log-based consensus)

Today’s agenda • Consensus • Consensus in synchronous systems • Chapter 15.4 • Impossibility of consensus in asynchronous systems • Impossibility of Distributed Consensus with One Faulty Process, Fischer- Lynch-Paterson (FLP), 1985 • A good enough consensus algorithm for asynchronous systems: • Paxos made simple, Leslie Lamport, 2001 • Other forms of consensus • Blockchains • Raft (log-based consensus)

Consensus • Each process proposes a value. • All processes must agree on one of the proposed values. • Examples: • The generals must agree on the time of attack. • An object replicated across multiple servers in a distributed data store. • All servers must agree on the current version of the object. • Transaction processing on replicated servers • Must agree on the order in which updates are applied to an object. • …..

Consensus • Each process proposes a value. • All processes must agree on one of the proposed values. • The final value can be decided based on any criteria: • Pick minimum of all proposed values. • Pick maximum of all proposed values. • Pick the majority (with some deterministic tie-breaking rule). • Pick the value proposed by the leader. • All processes must agree on who the leader is. • If reliable total-order can be achieved, pick the proposed value that gets delivered first. • All process must agree on the total order. • ……

Consensus Problem • System of N processes (P 1 , P 2 , ….., P n ) • Each process P i : • begins in an undecided state. • proposes value v i . • at some point during the run of a consensus algorithm, sets a decision variable d i and enters the decided state.

Required Properties • Termination: Eventually each process sets its decision variable. • Agreement: The decision value of all correct processes is the same. • If P i and P j are correct and have entered the decided state, then d i = d j. • Integrity: If the correct processes all proposed the same value, then any correct process in the decided state has chosen that value. Definition of integrity differs across sources (lack of consensus!)

Required Properties • Termination: Eventually each process sets its decision variable. • Agreement: The decision value of all correct processes is the same. • If P i and P j are correct and have entered the decided state, then d i = d j. • Integrity: If the correct processes all proposed the same value, then any correct process in the decided state has chosen that value. Which of these properties is liveness and which is safety?

Required Properties • Termination: Eventually each process sets its decision variable. • Liveness • Agreement: The decision value of all correct processes is the same. • If P i and P j are correct and have entered the decided state, then d i = d j. • Safety • Integrity: If the correct processes all proposed the same value, then any correct process in the decided state has chosen that value.

How do we agree on a value? • Ring-based leader election • Send proposed value along with elected message. • Turnaround time: 3NT worst case and 2NT best case (without failures). • T is the time taken to transmit a message on a channel. • O(Nft) if up to f processes fail during the election run. • Can we do better? • Bully algorithm • Send proposed value along with the coordinator message. • Turnaround time: 4T in the worst case without failures. • More than 2fT if up to f processes fail during the election run. What’s the best we can do?

Consider the simplest algorithm • Let’s assume the system is synchronous. • Use a simple B-multicast: • All processes B-multicast their proposed value to all other processes. • Upon receiving all proposed values, pick the minimum. • Time taken under no failures? • One message transmission time (T) • What can go wrong? • If we consider process failures, is a simple B-multicast enough?

B-multicast is not enough for this {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } P2 P3 P1 P4 {v 1 , v 2, v 3 , v 5 } P5 {v 1 , v 2, v 3 , v 5 } Need R-multicast

B-multicast is not enough for this {v 1 , v 2, v 3 , v 4 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } P2 P3 P1 P4 {v 1 , v 2, v 3 , v 4 , v 5 } P5 {v 1 , v 2, v 3 , v 4 , v 5 } Need R-multicast

Handling failures • P4 fails before sending v 4 to anyone. {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 5 } • What should other processes do? P2 P3 • Detect failure. Timeout! • Assume proposals are sent at time ‘s’. • Worst-case skew is 𝜗 . P1 • Maximum message transfer time P4 (including local processing) is T. P5 • What should the timeout value be? {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 5 }

Handling failures • Assume proposals are sent at time ‘s’. {v 1 , v 2, v 3 , v 5 } • Worst-case skew is 𝜗 . {v 1 , v 2, v 3 , v 5 } • Maximum message transfer time P2 P3 (including local processing) is T. • What should the timeout value be? • Option 1: 𝜗 + T P1 P4 • Pi waits for ( 𝜗 + T) time units after sending its proposal at time ‘s’. P5 {v 1 , v 2, v 3 , v 5 } • Any other process must have sent proposed value before s + 𝜗 . {v 1 , v 2, v 3 , v 5 } • The proposed value should have reached Pi by (s + 𝜗 + T). • Will this work?

Handling failures • Assume proposals are sent at time ‘s’. {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } • Worst-case skew is 𝜗 . P2 P3 • Maximum message transfer time (including local processing) is T. • What should the timeout value be? P1 • How about 𝜗 + T? P4 Local time at a process Pi. • • Pj must have sent proposed value P5 {v 1 , v 2, v 3 , v 5 } before time s + 𝜗 . {v 1 , v 2, v 3 , v 5 } • The proposed value should have reached Pi by (s + 𝜗 + T). • Will this work?

Handling failures • Assume proposals are sent at time ‘s’. {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } • Worst-case skew is 𝜗 . P2 P3 • Maximum message transfer time (including local processing) is T. • What should the timeout value be? P1 • How about 𝜗 + 2*T? P4 • Will this work? P5 {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 5 }

Handling failures • Assume proposals are sent at time ‘s’. {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } • Worst-case skew is 𝜗 . P2 P3 • Maximum message transfer time (including local processing) is T. • What should the timeout value be? P1 • How about 𝜗 + 2*T? P4 • Will this work? P5 {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 }

Handling failures • Assume proposals are sent at time ‘s’. {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } • Worst-case skew is 𝜗 . P2 P3 • Maximum message transfer time (including local processing) is T. • What should the timeout value be? P1 • How about 𝜗 + 3*T? P4 • Will this work? P5 {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 }

Handling failures • Assume proposals are sent at time ‘s’. {v 1 , v 2, v 3 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 } • Worst-case skew is 𝜗 . P2 P3 • Maximum message transfer time (including local processing) is T. • What should the timeout value be? P1 • How about 𝜗 + 3*T? P4 • Will this work? P5 {v 1 , v 2, v 3 , v 4 , v 5 } {v 1 , v 2, v 3 , v 4 , v 5 }

Distributed Systems CS425/ECE428 03/04/2020 Logistics HW3 - PowerPoint PPT Presentation

Distributed Systems CS425/ECE428 03/04/2020 Logistics HW3 Released on Monday. You should be able to solve it completely after todays class. MP1 Due date extended to Monday, March 9 th , 11:59pm. MP2 Will be released

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

Introduction to Distributed * Systems Introduction to Distributed * Systems Outline Outline

Introduction to Distributed Systems Introduction to Distributed Systems Outline Outline

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

` James R. Wilcox Zach Tatlock Ilya Sergey Distributed Systems Distributed Infrastructure

Distributed Storage Systems part 1 Marko Vukoli Distributed Systems and Cloud Computing This

Coordinating distributed systems Marko Vukoli Distributed Systems and Cloud Computing Previous

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

WHAT WE TALK ABOUT WHEN WE TALK ABOUT DISTRIBUTED SYSTEMS ALVARO VIDELA DISTRIBUTED SYSTEMS

Distributed File Systems: An Overview of Peer-to-Peer Architectures Distributed File Systems

DISTRIBUTED SYSTEMS Department of Computing Science Umea University Distributed Systems - D N

Networks and Distributed Systems Olaf Landsiedel Networks and Distributed Systems What is

Distributed Storage Systems part 2 Marko Vukoli Distributed Systems and Cloud Computing

CSE 543 - Computer Security Lecture 2 - Introduction August 30, 2007 URL:

Properties of Laurent coefficients of multivariate rational functions Workshop on Computer

CSCE 970 Lecture 5: More Properties of Bayes Nets Stephen D. Scott 1 Introduction So far,

CS 5412/LECTURE 22 Ken Birman FAULT TOLERANCE IN APACHE Spring, 2019

Certificate of impossibility of Hilbert-Artin representation of given degree for definite

Private Approximation of Search Problems

Resolve-impossibility for a contract signing protocol Aybek Mukhamedov and Mark Ryan July 6,

Proving that project 4 is impossible Nicolas Derumigny Emma Kerinec Yannis Gaziello Qentin

Distributed Systems CS425/ECE428 03/04/2020 Logistics HW3 - PowerPoint PPT Presentation

Distributed Systems CS425/ECE428 03/04/2020 Logistics HW3 Released on Monday. You should be able to solve it completely after todays class. MP1 Due date extended to Monday, March 9 th , 11:59pm. MP2 Will be released

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals &amp; Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals &amp; Challenges

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

Introduction to Distributed * Systems Introduction to Distributed * Systems Outline Outline

Introduction to Distributed Systems Introduction to Distributed Systems Outline Outline

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

` James R. Wilcox Zach Tatlock Ilya Sergey Distributed Systems Distributed Infrastructure

Distributed Storage Systems part 1 Marko Vukoli Distributed Systems and Cloud Computing This

Coordinating distributed systems Marko Vukoli Distributed Systems and Cloud Computing Previous

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

WHAT WE TALK ABOUT WHEN WE TALK ABOUT DISTRIBUTED SYSTEMS ALVARO VIDELA DISTRIBUTED SYSTEMS

Distributed File Systems: An Overview of Peer-to-Peer Architectures Distributed File Systems

DISTRIBUTED SYSTEMS Department of Computing Science Umea University Distributed Systems - D N

Networks and Distributed Systems Olaf Landsiedel Networks and Distributed Systems What is

Distributed Storage Systems part 2 Marko Vukoli Distributed Systems and Cloud Computing

CSE 543 - Computer Security Lecture 2 - Introduction August 30, 2007 URL:

Properties of Laurent coefficients of multivariate rational functions Workshop on Computer

CSCE 970 Lecture 5: More Properties of Bayes Nets Stephen D. Scott 1 Introduction So far,

CS 5412/LECTURE 22 Ken Birman FAULT TOLERANCE IN APACHE Spring, 2019

Certificate of impossibility of Hilbert-Artin representation of given degree for definite

Private Approximation of Search Problems

Resolve-impossibility for a contract signing protocol Aybek Mukhamedov and Mark Ryan July 6,

Proving that project 4 is impossible Nicolas Derumigny Emma Kerinec Yannis Gaziello Qentin

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges