Fault Tolerance, Replication, and Consistency 1 Motivation: Hadoop - PowerPoint PPT Presentation

Fault Tolerance, Replication, and Consistency 1

Motivation: Hadoop Cluster 2

Motivation: Hadoop Cluster Mostly retired desktops Intel Core 2: launched in 2008 Support is gathering old servers 3

Motivation: Hadoop Cluster Mostly retired desktops Intel Core 2: launched in 2008 Support is gathering old servers Test case for fault tolerance! 4

Fault Tolerance In any sufficiently large cluster, machines will fail. In any sufficiently large job, machines will fail. 5

Defining Failure Crashed: Node disappeared 6

Defining Failure Crashed: Node disappeared Slow: Too many students logged in, . . . 7

Defining Failure Crashed: Node disappeared Slow: Too many students logged in, . . . Omission: Drops a request Hard drive bad sector = ⇒ drops request for that file Intermittent network cable 8

Defining Failure Crashed: Node disappeared Slow: Too many students logged in, . . . Omission: Drops a request Hard drive bad sector = ⇒ drops request for that file Intermittent network cable Wrong: Returns bad data/does not follow protocol Defective RAM Undetected disk errors Wrong software version 9

Defining Failure Crashed: Node disappeared Slow: Too many students logged in, . . . Omission: Drops a request Hard drive bad sector = ⇒ drops request for that file Intermittent network cable Wrong: Returns bad data/does not follow protocol Defective RAM Undetected disk errors Wrong software version Byzantine: Many untrustworthy nodes, worst-case behavior Hacked Volunteer nodes (Tor, BitTorrent, Bitcoin) 10

Failure: An Outline 1 Timeouts 2 Replication 3 Consistency 4 Consensus 5 Recovery 11

Timeouts and Health Reports Detects crashed and possibly slow nodes. A node might omit specific requests, but pass health. 12

So A Node Times Out Mark the node offline, ask another? 13

So A Node Times Out Mark the node offline, ask another? “on Sunday morning, a portion of the metadata service responses exceeded the retrieval and transmission time allowed by storage servers.” –Amazon AWS outage 14

So A Node Times Out Mark the node offline, ask another? “on Sunday morning, a portion of the metadata service responses exceeded the retrieval and transmission time allowed by storage servers.” –Amazon AWS outage Service is loaded → Timeouts → Nodes marked offline → More load on remaining servers → Repeat. 15

Avoid cascading failure: drop incoming requests. 16

Avoid cascading failure: Capacity planning! Rate-limit machine failure Heuristics for small failures can backfire in larger failures 17

Replication Store several copies of the same data! In HDFS: 3 copies by default. Read from any copy = ⇒ better read performance. 18

Replicas for Fault Tolerance Crashed, slow, or omission: read from another replica 19

Replicas for Fault Tolerance Crashed, slow, or omission: read from another replica Wrong: checksums on server side or client side, try another BitTorrrent: checksums in torrent file 20

Replicas for Fault Tolerance Crashed, slow, or omission: read from another replica Wrong: checksums on server side or client side, try another BitTorrrent: checksums in torrent file Fine for read-only. What if the data changes? 21

Consistency? Web Pages Stale pages might be fine, but don’t mix old and new in one page. If somebody shares a link, it should work. Domain Names Caching with a time limit. Inconsistent answers are ok with time limit. Banking Reorder transactions to charge customers the most fees. A transaction succeeds or fails. E-Commerce Don’t assign the same seat on a plane (or do. . . ) 22

Consistency? Web Pages Stale pages might be fine, but don’t mix old and new in one page. If somebody shares a link, it should work. Domain Names Caching with a time limit. Inconsistent answers are ok with time limit. Banking Reorder transactions to charge customers the most fees. A transaction succeeds or fails. E-Commerce Don’t assign the same seat on a plane (or do. . . ) Consistency needs depend on the application! 23

Models for Consistency Strict: Absolute ordering of all accesses by time Linearisability: There exists some linear story (like a bank statement) Sequential: Nodes read in a consistent order 24

Example Time 1 Time 2 Time 3 Time 4 Time 5 Time 6 Alice Writes A Bob Writes B Carol Reads B Reads A Dan Reads B Reads A ✗ Strict ✓ Linearisabile ✓ Sequential: Carol and Dan saw the same order. 25

Example Time 1 Time 2 Time 3 Time 4 Time 5 Time 6 Alice Writes A Bob Writes B Carol Reads B Reads A Dan Reads B Reads A Eve Reads A Reads B ✗ Strict ✗ Linearisabile ✗ Sequential: Eve saw a different order. 26

Models for Consistency Strict: Absolute ordering of all accesses by time Linearisability: There exists some linear story (like a bank statement) Sequential: Nodes read in a consistent order Causal: Causually related events are ordered correctly FIFO: Writes from same node are ordered consistently But writes from different nodes can be inconsistently ordered 27

Explicit Consistency Options ( sync ) Weak: Only when programmer says so Entry: When a lock is acquired Release: When a lock is released 28

Eventual Consistency Update one replica, let the others update lazily. Some algorithms guarantee consistency eventually, depsite some failures. 29

Consistency: Two Generals Problem Two generals leading armies on opposite sides of a city. Need to both attack or both retreat. Only communication is messengers, who might be captured. 30

Consistency: Two Generals Problem Two generals leading armies on opposite sides of a city. Need to both attack or both retreat. Only communication is messengers, who might be captured. Theorem: no protocol ensures consensus. 31

Byzantine Generals Problem Multiple generals, majority vote: message exchange has to be 3x number of lost messages. Byzantine Fault Tolerance: need 3 m + 1 nodes to agree on a bit if m nodes are faulty. Want more/proof? Take distributed systems! 32

CAP Theorem : Consistency, Availability, Partition tolerance Consistency: Nodes see same data at the same time Availability: Node failures do not prevent system operation Partition Tolerance: Network failures do not prevent system operation Conjecture: pick two of the above. Related theorem for a special case. 33

Recovery Something failed, now what? Backward Recovery Checkpointing: return to previous. Can be expensive to store. Packet retransmission (when client does not ACK). Forward recovery Plan for some loss e.g. error correcting codes Backward recovery is more common. 34

Fail! Summary Ways to fail Ways to be consistent Redundancy by replicas or recomputing 35

Fault Tolerance, Replication, and Consistency 1 Motivation: Hadoop - PowerPoint PPT Presentation

Fault Tolerance, Replication, and Consistency 1 Motivation: Hadoop Cluster 2 Motivation: Hadoop Cluster Mostly retired desktops Intel Core 2: launched in 2008 Support is gathering old servers 3 Motivation: Hadoop Cluster Mostly retired

CSci 5105 Introduction to Distributed Systems Fault Tolerance Last Time Replication and

1 Applications ? Trading Consistency for Performance Applications ? Trading Consistency for

Distributed Systems 5. Fault Tolerant Systems Fault-Tolerance - 1 Lszl Bszrmnyi

Lecture 10: Fault Tolerance Fault Tolerant Concurrent Computing The main principles of fault

Adaptability and Fault Tolerance Adaptability and Fault Tolerance Rog rio rio de Lemos de

General Principles of Fault- Tolerance Daniel Gottesman Perimeter Institute Whats Left For

Roadmap for Section 10.1 The Notion of Fault-Tolerance Fault-Tolerance Support in NTFS Volume

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

Consistency - Chapter 5 Introduce several notions of Local Consistency: arc consistency,

Challenging Malicious Inputs with Fault Tolerance Techniques Bruno Luiz Agenda Threats

Fault Tolerance at Speed Todd L. Montgomery @toddlmontgomery About me What type of Fault

Rigorous fault-tolerance thresholds Ben Reichardt UC Berkeley N gate circuit 0/1 N gate

Fault Tolerance and Robustness in Concurrent Systems Faults, errors, failures, and fault

Constraint Programming - An overview Node-consistency Arc-consistency Path-consistency

RSM & Paxos Consensus Trilogy - Episode II Replicated State Machine What is the problem?

Fault Tolerance in Message Passing Fault Tolerance in Message Passing and in Action and in

AFT: A Serverless Fault- Tolerance Shim Vikram Sreekanti , Chenggang Wu, Saurav Chhatrapati,

Fault Tolerant Distributed Main Memory Systems CompSci 590.04

Distributed Systems Fault Tolerance Paul Krzyzanowski Except as otherwise noted, the content of

PPT Presentation designed for Training Purposes Expense Reduction Solutions for Credit Card

Why Is Random Testing Effective for Partition Tolerance Bugs? Rupak Majumdar, Filip Niksic Max

Automated combination of tolerance and control flow integrity countermeasures against multiple

Cross Link Insertion for Improving Tolerance to Variations in Clock Network Synthesis Tarun

Adjustable flat layouts for Two- Failure Tolerant Storage Systems Thomas Schwarz, SJ Marquette

Fault Tolerance, Replication, and Consistency 1 Motivation: Hadoop - PowerPoint PPT Presentation

Fault Tolerance, Replication, and Consistency 1 Motivation: Hadoop Cluster 2 Motivation: Hadoop Cluster Mostly retired desktops Intel Core 2: launched in 2008 Support is gathering old servers 3 Motivation: Hadoop Cluster Mostly retired

CSci 5105 Introduction to Distributed Systems Fault Tolerance Last Time Replication and

1 Applications ? Trading Consistency for Performance Applications ? Trading Consistency for

Distributed Systems 5. Fault Tolerant Systems Fault-Tolerance - 1 Lszl Bszrmnyi

Lecture 10: Fault Tolerance Fault Tolerant Concurrent Computing The main principles of fault

Adaptability and Fault Tolerance Adaptability and Fault Tolerance Rog rio rio de Lemos de

General Principles of Fault- Tolerance Daniel Gottesman Perimeter Institute Whats Left For

Roadmap for Section 10.1 The Notion of Fault-Tolerance Fault-Tolerance Support in NTFS Volume

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

Consistency - Chapter 5 Introduce several notions of Local Consistency: arc consistency,

Challenging Malicious Inputs with Fault Tolerance Techniques Bruno Luiz Agenda Threats

Fault Tolerance at Speed Todd L. Montgomery @toddlmontgomery About me What type of Fault

Rigorous fault-tolerance thresholds Ben Reichardt UC Berkeley N gate circuit 0/1 N gate

Fault Tolerance and Robustness in Concurrent Systems Faults, errors, failures, and fault

Constraint Programming - An overview Node-consistency Arc-consistency Path-consistency

RSM &amp; Paxos Consensus Trilogy - Episode II Replicated State Machine What is the problem?

Fault Tolerance in Message Passing Fault Tolerance in Message Passing and in Action and in

AFT: A Serverless Fault- Tolerance Shim Vikram Sreekanti , Chenggang Wu, Saurav Chhatrapati,

Fault Tolerant Distributed Main Memory Systems CompSci 590.04

Distributed Systems Fault Tolerance Paul Krzyzanowski Except as otherwise noted, the content of

PPT Presentation designed for Training Purposes Expense Reduction Solutions for Credit Card

Why Is Random Testing Effective for Partition Tolerance Bugs? Rupak Majumdar, Filip Niksic Max

Automated combination of tolerance and control flow integrity countermeasures against multiple

Cross Link Insertion for Improving Tolerance to Variations in Clock Network Synthesis Tarun

Adjustable flat layouts for Two- Failure Tolerant Storage Systems Thomas Schwarz, SJ Marquette

RSM & Paxos Consensus Trilogy - Episode II Replicated State Machine What is the problem?