From Distributed Logs to Database Replication Dr. Samuel Benz How - PowerPoint PPT Presentation

From Distributed Logs to Database Replication Dr. Samuel Benz

How to achieve scalability, fault tolerance and consistency in distributed systems?

Distributed applications in theory. . .

. . . in practice

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Why do we see such architectures? Distributed Stateful Components state (vs. stateless) shared > 1 client (isolation) mutable > 0 writer (concurrency) distributed > 1 DB (consistency) geographically > 50 km (latency) 5

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Reliable and Scalable Stateful Services Problem 1 Scalability: • Size: Internet scale services • Location: Access latency • Administration: Multiple organizational units 2 Fault-Tolerance Solution 1 Distributed Data: Replication 2 Distributed Computing: Coordination 6

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Different Types of Replication 7

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion State Machine Replication → Fault-tolerance 8

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion State Machine Replication → Consistency 9

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Partitioning → Scalability 10

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Consistent Partitioning 1 The system ensures strong consistency within partitions and ”best-effort” across partitions. 2 The system ensures strong consistency using 2PC across partitions. 3 The system orders commands before executing them or checks their order after executing the commands ( Atomic Multicast ). 11

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Simple Coordination Problem A B 12

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Consensus Problem Fundamental Result No algorithm can solve consensus in an asynchronous system despite a single crash. FLP impossibility result (after Fischer, Lynch, and Paterson, 1985) 13

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Consensus and Atomic Broadcast In a crash-stop failure model consensus is defined as follows: 1 Termination: Every correct process eventually decides. 2 Agreement: No two correct processes decide differently. 3 Uniform integrity: Every process decides at most once. 4 Uniform validity: If a process decides v , then v was proposed by some process. Additionally Atomic Broadcast : 5 Total order: If two correct processes p and q deliver two messages m and m ′ , then p delivers m before m ′ if and only if q delivers m before m ′ . [Chandra et al . Unreliable failure detectors for reliable distributed systems. 1996.] 14

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion 15

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Distributed Log 16

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Distributed Transactions 17

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Distributed Data Structures 18

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Kafka Consistency 19

Introduction Reliable and Scalable Services Distributed Consensus Distributed Log Conclusion Kafka Scalability 20

From Distributed Logs to Database Replication Dr. Samuel Benz How - PowerPoint PPT Presentation

From Distributed Logs to Database Replication Dr. Samuel Benz How to achieve scalability, fault tolerance and consistency in distributed systems? Distributed applications in theory. . . . . . in practice Introduction Reliable and Scalable

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

Logs on Logs on Logs No More Append Atomic & Remap Eric Mackay Venkatesh Srinivas Basics

MySQL Replication Tutorial Mats Kindahl Senior Software Engineer Replication Technology Lars

August 23, 2012 Data Replication/ETL: Terms Data Replication : Data Replication is the process of

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

I Logs Apache Kafka, Stream Processing, and Real-time Data Jay Kreps The Plan 1. What is Data

Todays Topics - Chapter 15 Slide 1 performance enhancement Replication Replication of

Distributed Databases Distributed database management system A distributed database (DDB) is

New features in MySQL Replication Lars Thalmann, Development Manager, Replication & Backup

THE UNBUNDLED DATABASE Leveraging the unbundled database via distributed logs and stream

Consistency and Replication Chi Zhang czhang@cs.fiu.edu Object Replication (1) Organization of

DISTRIBUTED SYSTEMS II REPLICATION CNT. II The Quorum consensus method for Replication To

Distributed Databases 1 19.1 Distributed Database System A distributed database system

CS4224/CS5424 Lecture 1 Introduction Distributed Database Systems A distributed database is a

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Computer Networks M Group issues and policies Antonio Corradi Academic year 2015/2016 Groups

CS5412 / LECTURE 10 Ken Birman REPLICATION AND CONSISTENCY Spring, 2019

Distributed Systems: Group Communication Julia Proft and Utkarsh Mall The Process Group Approach

Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer

Solving Atomic Broadcast Eden : a Consensus Based Group Communication System p.1/ ?? Solving

Verteilte Systeme (Distributed Systems) Karl M. Gschka Karl.Goeschka@tuwien.ac.at

Distributed Systems (ICE 601) Replication & Consistency - Part 1 Dongman Lee ICU Class

Important Lessons Lamport & vector clocks both give a logical timestamps Total

From Distributed Logs to Database Replication Dr. Samuel Benz How - PowerPoint PPT Presentation

From Distributed Logs to Database Replication Dr. Samuel Benz How to achieve scalability, fault tolerance and consistency in distributed systems? Distributed applications in theory. . . . . . in practice Introduction Reliable and Scalable

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

Logs on Logs on Logs No More Append Atomic &amp; Remap Eric Mackay Venkatesh Srinivas Basics

MySQL Replication Tutorial Mats Kindahl Senior Software Engineer Replication Technology Lars

August 23, 2012 Data Replication/ETL: Terms Data Replication : Data Replication is the process of

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

I Logs Apache Kafka, Stream Processing, and Real-time Data Jay Kreps The Plan 1. What is Data

Todays Topics - Chapter 15 Slide 1 performance enhancement Replication Replication of

Distributed Databases Distributed database management system A distributed database (DDB) is

New features in MySQL Replication Lars Thalmann, Development Manager, Replication &amp; Backup

THE UNBUNDLED DATABASE Leveraging the unbundled database via distributed logs and stream

Consistency and Replication Chi Zhang czhang@cs.fiu.edu Object Replication (1) Organization of

DISTRIBUTED SYSTEMS II REPLICATION CNT. II The Quorum consensus method for Replication To

Distributed Databases 1 19.1 Distributed Database System A distributed database system

CS4224/CS5424 Lecture 1 Introduction Distributed Database Systems A distributed database is a

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Computer Networks M Group issues and policies Antonio Corradi Academic year 2015/2016 Groups

CS5412 / LECTURE 10 Ken Birman REPLICATION AND CONSISTENCY Spring, 2019

Distributed Systems: Group Communication Julia Proft and Utkarsh Mall The Process Group Approach

Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer

Solving Atomic Broadcast Eden : a Consensus Based Group Communication System p.1/ ?? Solving

Verteilte Systeme (Distributed Systems) Karl M. Gschka Karl.Goeschka@tuwien.ac.at

Distributed Systems (ICE 601) Replication &amp; Consistency - Part 1 Dongman Lee ICU Class

Important Lessons Lamport &amp; vector clocks both give a logical timestamps Total

Logs on Logs on Logs No More Append Atomic & Remap Eric Mackay Venkatesh Srinivas Basics

New features in MySQL Replication Lars Thalmann, Development Manager, Replication & Backup

Distributed Systems (ICE 601) Replication & Consistency - Part 1 Dongman Lee ICU Class

Important Lessons Lamport & vector clocks both give a logical timestamps Total