CRaft: Building High-Performance Consensus Protocols with Accurate - PowerPoint PPT Presentation

CRaft: Building High-Performance Consensus Protocols with Accurate Clocks Feiran Wang*, Balaji Prabhakar*, Mendel Rosenblum*, Gene Zhang† *Stanford University, †eBay Inc.

Overview • CRaft: a multi-leader extension to Raft enabled by accurate clocks Existing protocol Synchronized clocks Better performance 2

State Machines • Maintain internal states • Respond to external requests • Examples: databases, storage systems State State State y ← x x ← 1 x: 2 x: 1 x: 1 y: 3 y: 1 y: 3 • How do we make them reliable? 3

Replicated State Machines Client Client x ← 1 Servers Consensus State Consensus State Consensus State Machine Machine Machine Log Log Log x: 2 x: 2 x: 2 x ← 1 y ← 1 … x ← 1 y ← 1 … x ← 1 y ← 1 … y: 3 y: 3 y: 3 • Consensus: ensures all servers agree on the same log • Continues to operate if at least a majority of servers are up Diego Ongaro and John Ousterhout. The Raft consensus algorithm. https://raft.github.io 4

The Raft Consensus Protocol • A widely used consensus protocol Client Client Client • Leader-based • Benefits: simple and efficient • Limitation: leader is the bottleneck for Leader x ← 1 y ← 1 … throughput and scalability Follower Leader Follower x ← 1 y ← 1 … x ← 1 y ← 1 … Diego Ongaro and John Ousterhout. 2014. In search of an understandable consensus algorithm. In USENIX Annual Technical Conference. 305–319. 5

Limitations with Single Leader • Single leader limits throughput and scalability Performance degrades with high load Decreasing throughput with larger cluster sizes Load increases 6

Challenge in a Multi-Leader Protocol Single leader Multiple leaders Replicate my log I have a log I have a log I have a log ok ok • Challenge: how to coordinate leaders? • Solution: agreement on time => agreement on order 7

Clock Synchronization • Achieving agreement on time is not trivial in a distributed system • Huygens: a software clock synchronization system Distribution of clock offsets between servers NTP precision: ~20ms (20 machines on CloudLab) Percentile 90th 99th 99.9th max Clock offset 7us 11us 15us 26us Huygens precision: ~20us Yilong Geng, Shiyu Liu, Zi Yin, Ashish Naik, Balaji Prabhakar, Mendel Rosenblum, and Amin Vahdat. Exploiting a natural network effect for scalable, fine-grained clock synchronization. In NSDI 2018. 81–94. 8

Our Approach: CRaft Raft CRaft (Clocks + Raft) Scalability Output A replicate log A replicated log ✓ ✓ Safety & Consistency Same guarantee as Raft ✓ ✓ Practicability A simple add-on to Raft; easy to implement 9

The CRaft Consensus Protocol

CRaft Overview Client Client Client Merged log Merged log Merged log Leader Follower Follower Group 1 Follower Follower Leader Group 2 Follower Follower Leader Group 3 Server 1 Server 2 Server 3 11

Life of a Request Replicated on a majority of servers • Safe and durable • Client Replicate Commit Execute Merge log log log Leader Follower Follower Follower Leader Follower Follower Follower Leader State Machine State Machine State Machine Server 1 Server 2 Server 3 12

Timestamp Management Log Merged log Safe time = 20 Leader 2 4 5 index 1 3 timestamp 1 4 6 17 18 Follower x ← 1 y ← 1 y ← x x ← 2 x ← 5 command Follower Server • CRaft guarantees monotonically increasing timestamps in each log • Safe time: indicates how up-to-date a log is 13

Safe Times How up-to-date is this log? Now Log … 1 4 6 17 18 23 25 Safe time = 20 Current entries: No entries come in timestamps <= with a timestamp safe time smaller than safe time 14

Merging index 1 2 3 4 5 Log 1 1 4 6 17 18 ts = 18 merged log … Log 2 2 5 12 ts = 12 5 6 8 10 12 Log 3 ts = 19 3 8 10 15 • Merge up to the smallest safe time • CRaft ensures merged log in monotonically increasing timestamp order 15

Optimization: Fast Path Replicate Commit Merge Execute Fast path: respond Normal path: respond before execution after execution • Fast path: respond to clients early for certain write operations 16

Evaluation

Experiment Setup • Implementation • Based on HashiCorp Raft – a popular and well-optimized implementation • Environment • CloudLab, single data center • Workload • In-memory key-value store • Multiple clients send get or set requests concurrently 18

Throughput vs Cluster Size • Up to ~2x read and ~2.5x write throughput compared to Raft 19

Latency vs Throughput Average latency vs throughput (3 servers) 99th percentile latency vs throughput (3 servers) Performance gain under high load Load increases Load increases • CRaft improves throughput and latency under high load 20

Performance vs Number of Clients Throughput Average Latency 2x Latency is bounded by clock difference 2x 2x • NTP precision: ~20ms, Huygens: ~20us 21

Conclusion Better performance Existing systems Stronger consistency Synchronized clocks • Accurate clocks enable better performance and/or consistency 22

Thank you!

CRaft: Building High-Performance Consensus Protocols with Accurate - PowerPoint PPT Presentation

CRaft: Building High-Performance Consensus Protocols with Accurate Clocks Feiran Wang, Balaji Prabhakar, Mendel Rosenblum, Gene Zhang Stanford University, eBay Inc. Overview CRaft: a multi-leader extension to Raft enabled by

Consensus Building Consensus is Consensus is finding an acceptable proposal that all members

Consensus and Dissent or: Meta - Consensus Consensus about what we have consensus

Prior Work Consensus Consensus Reliable BGP Consensus Reliable BGP Consensus Routing

Prior Work Consensus Consensus Reliable BGP Consensus Reliable BGP Consensus Routing

San Diego Craft Brewing Industry Marc M. Martin Vice President of Beer Karl Strauss Brewing Co.

Craft Beer and Beyond Moving from Craft Beer to Craft Beverage Production Overview About

Interim report January March 2014 Photo by Craft Great succes in Sochi Craft and Auclair

CONSENSUS Fall 2012 Ken Birman Consensus a classic problem Consensus abstraction underlies

FLP Impossibility & Weakest Failure Detector Consensus Protocols in Theory Philip Daian -

BC CRAFT BREWERS GUILD ORGANIZATIONAL OVERVIEW FEBRUARY 2019 ABOUT US The BC Craft Brewers

Teaching: Art, Craft or Science Shanker Institute Symposium hargreaves@bc.edu April 2019 .. .

Membership of the consensus group Membership of the consensus group Members of the group were

Distributed Algorithms (PhD course) Consensus SARDAR MUHAMMAD SULAMAN Consensus The

When Aeron Met Raft Martin Thompson - @mjpt777 What does Consensus mean? consensus

Distributed Systems CS425/ECE428 03/06/2020 Todays agenda Consensus Consensus in

Reasoning about Consensus Protocols Ilya Sergey ilyasergey.net Consensus Common meaning :

Clock Arithmetic 7 January 2019 OSU CSE 1 Mathematical Modulo (mod) The value of a

HOTRG study on partition function zeros in the p-state clock model Dong-Hee Kim Dept. Physics

Rickard Ewetz Cheng-Kok Koh ECE Department, Purdue University Introduction Clock tree Source

An object oriented model for the representation of temporal data in the Integra framework James

Clock-Driven Scheduling (in-depth) Precompute static schedule off-line Task Scheduler: (e.g.

High Performance and SoC Timing verification challenges Tau 2015 KS Ramesh Andalib Khan Fritz

Design for Testability 1 Basic Concept Design for testability (DFT) Design techniques

STAR CLUSTERS Lecture 1 Introduction Nora Ltzgendorf (ESA) I ts a school, so ? ? ? ? ?

Sambuz

Useful Links

Newsletter

Mail Us

CRaft: Building High-Performance Consensus Protocols with Accurate - PowerPoint PPT Presentation

CRaft: Building High-Performance Consensus Protocols with Accurate Clocks Feiran Wang*, Balaji Prabhakar*, Mendel Rosenblum*, Gene Zhang *Stanford University, eBay Inc. Overview CRaft: a multi-leader extension to Raft enabled by

Consensus Building Consensus is Consensus is finding an acceptable proposal that all members

Consensus and Dissent or: Meta - Consensus Consensus about what we have consensus

Prior Work Consensus Consensus Reliable BGP Consensus Reliable BGP Consensus Routing

Prior Work Consensus Consensus Reliable BGP Consensus Reliable BGP Consensus Routing

San Diego Craft Brewing Industry Marc M. Martin Vice President of Beer Karl Strauss Brewing Co.

Craft Beer and Beyond Moving from Craft Beer to Craft Beverage Production Overview About

Interim report January March 2014 Photo by Craft Great succes in Sochi Craft and Auclair

CONSENSUS Fall 2012 Ken Birman Consensus a classic problem Consensus abstraction underlies

FLP Impossibility &amp; Weakest Failure Detector Consensus Protocols in Theory Philip Daian -

BC CRAFT BREWERS GUILD ORGANIZATIONAL OVERVIEW FEBRUARY 2019 ABOUT US The BC Craft Brewers

Teaching: Art, Craft or Science Shanker Institute Symposium hargreaves@bc.edu April 2019 .. .

Membership of the consensus group Membership of the consensus group Members of the group were

Distributed Algorithms (PhD course) Consensus SARDAR MUHAMMAD SULAMAN Consensus The

When Aeron Met Raft Martin Thompson - @mjpt777 What does Consensus mean? consensus

Distributed Systems CS425/ECE428 03/06/2020 Todays agenda Consensus Consensus in

Reasoning about Consensus Protocols Ilya Sergey ilyasergey.net Consensus Common meaning :

Clock Arithmetic 7 January 2019 OSU CSE 1 Mathematical Modulo (mod) The value of a

HOTRG study on partition function zeros in the p-state clock model Dong-Hee Kim Dept. Physics

Rickard Ewetz Cheng-Kok Koh ECE Department, Purdue University Introduction Clock tree Source

An object oriented model for the representation of temporal data in the Integra framework James

Clock-Driven Scheduling (in-depth) Precompute static schedule off-line Task Scheduler: (e.g.

High Performance and SoC Timing verification challenges Tau 2015 KS Ramesh Andalib Khan Fritz

Design for Testability 1 Basic Concept Design for testability (DFT) Design techniques

STAR CLUSTERS Lecture 1 Introduction Nora Ltzgendorf (ESA) I ts a school, so ? ? ? ? ?

Sambuz

Useful Links

Newsletter

Mail Us

CRaft: Building High-Performance Consensus Protocols with Accurate Clocks Feiran Wang, Balaji Prabhakar, Mendel Rosenblum, Gene Zhang Stanford University, eBay Inc. Overview CRaft: a multi-leader extension to Raft enabled by

FLP Impossibility & Weakest Failure Detector Consensus Protocols in Theory Philip Daian -