23 Databases Intro to Database Systems Andy Pavlo AP AP - PowerPoint PPT Presentation

Distributed OLTP 23 Databases Intro to Database Systems Andy Pavlo AP AP 15-445/15-645 Computer Science Fall 2019 Carnegie Mellon University

2 ADM INISTRIVIA Homework #5 : Monday Dec 3 rd @ 11:59pm Project #4 : Monday Dec 10 th @ 11:59pm Extra Credit : Wednesday Dec 10 th @ 11:59pm Final Exam : Monday Dec 9 th @ 5:30pm CMU 15-445/645 (Fall 2019)

3 LAST CLASS System Architectures → Shared-Memory, Shared-Disk, Shared-Nothing Partitioning/Sharding → Hash, Range, Round Robin Transaction Coordination → Centralized vs. Decentralized CMU 15-445/645 (Fall 2019)

4 OLTP VS. OLAP On-line Transaction Processing (OLTP): → Short-lived read/write txns. → Small footprint. → Repetitive operations. On-line Analytical Processing (OLAP): → Long-running, read-only queries. → Complex joins. → Exploratory queries. CMU 15-445/645 (Fall 2019)

5 DECENTRALIZED COORDINATO R Partitions Begin Request P1 P2 Application Server P3 P4 CMU 15-445/645 (Fall 2019)

5 DECENTRALIZED COORDINATO R Partitions Query P1 P2 Query Application Server P3 P4 Query CMU 15-445/645 (Fall 2019)

5 DECENTRALIZED COORDINATO R Partitions Commit Request P1 P2 Safe to commit? Application Server P3 P4 CMU 15-445/645 (Fall 2019)

6 OBSERVATION We have not discussed how to ensure that all nodes agree to commit a txn and then to make sure it does commit if we decide that it should. → What happens if a node fails? → What happens if our messages show up late? → What happens if we don't wait for every node to agree? CMU 15-445/645 (Fall 2019)

7 IM PORTAN T ASSUM PTIO N We can assume that all nodes in a distributed DBMS are well-behaved and under the same administrative domain. → If we tell a node to commit a txn, then it will commit the txn (if there is not a failure). If you do not trust the other nodes in a distributed DBMS, then you need to use a Byzantine Fault Tolerant protocol for txns (blockchain). CMU 15-445/645 (Fall 2019)

8 TODAY'S AGENDA Atomic Commit Protocols Replication Consistency Issues (CAP) Federated Databases CMU 15-445/645 (Fall 2019)

9 ATOM IC COM M IT PROTO CO L When a multi-node txn finishes, the DBMS needs to ask all the nodes involved whether it is safe to commit. Examples: → Two-Phase Commit → Three-Phase Commit (not used) → Paxos → Raft → ZAB (Apache Zookeeper) → Viewstamped Replication CMU 15-445/645 (Fall 2019)

10 TWO- PH ASE COM M IT (SUCCESS) Commit Request Participant Application Server Node 2 Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

10 TWO- PH ASE COM M IT (SUCCESS) Commit Request Participant Application Server Node 2 Phase1: Prepare Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

10 TWO- PH ASE COM M IT (SUCCESS) Commit Request Participant OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

10 TWO- PH ASE COM M IT (SUCCESS) Commit Request Participant OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Phase2: Commit Node 1 Node 3 CMU 15-445/645 (Fall 2019)

10 TWO- PH ASE COM M IT (SUCCESS) Commit Request Participant OK OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Phase2: Commit OK Node 1 Node 3 CMU 15-445/645 (Fall 2019)

10 TWO- PH ASE COM M IT (SUCCESS) Success! Participant Application Server Node 2 Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

11 TWO- PH ASE COM M IT (ABORT) Commit Request Participant Application Server Node 2 Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

11 TWO- PH ASE COM M IT (ABORT) Commit Request Participant Application Server Node 2 Phase1: Prepare Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

11 TWO- PH ASE COM M IT (ABORT) Commit Request Participant Application Server Node 2 Phase1: Prepare ABORT! Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

11 TWO- PH ASE COM M IT (ABORT) Aborted Participant Application Server Node 2 ABORT! Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

11 TWO- PH ASE COM M IT (ABORT) Aborted Participant Application Server Node 2 ABORT! Coordinator Participant Phase2: Abort Node 1 Node 3 CMU 15-445/645 (Fall 2019)

11 TWO- PH ASE COM M IT (ABORT) Aborted Participant OK Application Server Node 2 ABORT! Coordinator Participant Phase2: Abort OK Node 1 Node 3 CMU 15-445/645 (Fall 2019)

12 2PC OPTIM IZATION S Early Prepare Voting → If you send a query to a remote node that you know will be the last one you execute there, then that node will also return their vote for the prepare phase with the query result. Early Acknowledgement After Prepare → If all nodes vote to commit a txn, the coordinator can send the client an acknowledgement that their txn was successful before the commit phase finishes. CMU 15-445/645 (Fall 2019)

13 EARLY ACKNOWLEDGEM EN T Commit Request Participant Application Server Node 2 Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

13 EARLY ACKNOWLEDGEM EN T Commit Request Participant Application Server Node 2 Phase1: Prepare Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

13 EARLY ACKNOWLEDGEM EN T Commit Request Participant OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

13 EARLY ACKNOWLEDGEM EN T Success! Participant OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Node 1 Node 3 CMU 15-445/645 (Fall 2019)

13 EARLY ACKNOWLEDGEM EN T Success! Participant OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Phase2: Commit Node 1 Node 3 CMU 15-445/645 (Fall 2019)

13 EARLY ACKNOWLEDGEM EN T Success! Participant OK OK Application Server Node 2 Phase1: Prepare OK Coordinator Participant Phase2: Commit OK Node 1 Node 3 CMU 15-445/645 (Fall 2019)

14 TWO- PH ASE COM M IT Each node records the outcome of each phase in a non-volatile storage log. What happens if coordinator crashes? → Participants must decide what to do. What happens if participant crashes? → Coordinator assumes that it responded with an abort if it hasn't sent an acknowledgement yet. CMU 15-445/645 (Fall 2019)

15 PAXOS Consensus protocol where a coordinator proposes an outcome (e.g., commit or abort) and then the participants vote on whether that outcome should succeed. Does not block if a majority of participants are available and has provably minimal message delays in the best case. CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Commit Request Node 2 Acceptor Application Server Node 3 Proposer Acceptor Node 1 Node 4 CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Commit Request Node 2 Acceptor Application Server Propose Node 3 Proposer Acceptor Node 1 Node 4 CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Commit Request Node 2 X Acceptor Application Server Propose Node 3 Proposer Acceptor Node 1 Node 4 CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Agree Commit Request Node 2 X Acceptor Application Server Propose Node 3 Agree Proposer Acceptor Node 1 Node 4 CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Agree Commit Request Node 2 X Acceptor Application Server Propose Node 3 Commit Agree Proposer Acceptor Node 1 Node 4 CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Agree Commit Request Accept Node 2 X Acceptor Application Server Propose Node 3 Commit Agree Proposer Acceptor Accept Node 1 Node 4 CMU 15-445/645 (Fall 2019)

16 PAXOS Acceptor Success! Node 2 X Acceptor Application Server Node 3 Proposer Acceptor Node 1 Node 4 CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer TIM E CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) TIM E CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) TIM E CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) Propose(n+1) TIM E CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) Propose(n+1) Commit(n) TIM E CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) Propose(n+1) Commit(n) TIM E Reject(n,n+1) CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) Propose(n+1) Commit(n) TIM E Reject(n,n+1) Agree(n+1) CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) Propose(n+1) Commit(n) TIM E Reject(n,n+1) Agree(n+1) Commit(n+1) CMU 15-445/645 (Fall 2019)

17 PAXOS Proposer Acceptors Proposer Propose(n) Agree(n) Propose(n+1) Commit(n) TIM E Reject(n,n+1) Agree(n+1) Commit(n+1) Accept(n+1) CMU 15-445/645 (Fall 2019)

23 Databases Intro to Database Systems Andy Pavlo AP AP - PowerPoint PPT Presentation

Distributed OLTP 23 Databases Intro to Database Systems Andy Pavlo AP AP 15-445/15-645 Computer Science Fall 2019 Carnegie Mellon University 2 ADM INISTRIVIA Homework #5 : Monday Dec 3 rd @ 11:59pm Project #4 : Monday Dec 10 th @ 11:59pm

CS 764: Topics in Database Management Systems Lecture 11: Two-Phase Commit (2PC) Xiangyao Yu

RPC (fjnish) / two-phase commit 1 Changelog Changes made in this version not seen in fjrst

CS5412: TRANSACTIONS (I) Lecture XVII Ken Birman Transactions A widely used reliability

Statistical Methods for Plant Biology PBIO 3150/5150 Anirudh V. S. Ruhil February 24, 2016 The

Distributed Transaction Management Advanced Topics in Database Management (INFSCI 2711) Some

Data Streams: Random Order & Multiple Passes 2009 Barbados Workshop on Computational

Nested Transactions Nested Transactions Flat transactions The rules for committing of

11/21/2013 Location of Facility Primary Beamline MI Abort Line Reconfiguration

Transactional Memory: Architectural support for Lock-Free Data Structure Transactional Memory:

ADAPTIVE TWO-STAGE INTEGRATORS FOR SAMPLING ALGORITHMS BASED ON HAMILTONIAN DYNAMICS E.

Distributed Systems (ICE 601) Transactions & Concurrency Control - Part1 Dongman Lee ICU

What are Exceptions? Exceptions are rare events triggered by the hardware and forcing the

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability

Distributed Transactions and Concurrency CS425/ECE 428 Nikita Borisov Topics for Today

Work Queue + Python A Framework For Scalable Scientific Ensemble Applications Peter Bui , Dinesh

ECE 650 Systems Programming & Engineering Spring 2018 Database Transaction Processing Tyler

COSC 5351 Advanced Computer Architecture Slides modified from Hennessy CS252 course slides MP

Extreme Computing NoSQL www.inf.ed.ac.uk PREVIOUSLY: BATCH Query most/all data Results

Serializability with Snapshot Isolation under the Hood Mihaela Bornea 1 , S. Elnikety 2 , O.

Assessing Medication Adherence Dr. Lauren Hanna and Dr. Delbert Robinson Northwell Health

Underground Injection Control (UIC) Permitting Rob Castillo July 2020 1 Railroad Commission of

PRP Teleconference July 2018 ATC Teleconference Agenda 1. Update on the renewal of the PRP

Estimating beyond the trial-represented population by incorporating studies with self-selected

NHS City and Hackney CCG Commissioning Intentions 2017/18 and 2018/19 1 Tonight is about

23 Databases Intro to Database Systems Andy Pavlo AP AP - PowerPoint PPT Presentation

Distributed OLTP 23 Databases Intro to Database Systems Andy Pavlo AP AP 15-445/15-645 Computer Science Fall 2019 Carnegie Mellon University 2 ADM INISTRIVIA Homework #5 : Monday Dec 3 rd @ 11:59pm Project #4 : Monday Dec 10 th @ 11:59pm

CS 764: Topics in Database Management Systems Lecture 11: Two-Phase Commit (2PC) Xiangyao Yu

RPC (fjnish) / two-phase commit 1 Changelog Changes made in this version not seen in fjrst

CS5412: TRANSACTIONS (I) Lecture XVII Ken Birman Transactions A widely used reliability

Statistical Methods for Plant Biology PBIO 3150/5150 Anirudh V. S. Ruhil February 24, 2016 The

Distributed Transaction Management Advanced Topics in Database Management (INFSCI 2711) Some

Data Streams: Random Order &amp; Multiple Passes 2009 Barbados Workshop on Computational

Nested Transactions Nested Transactions Flat transactions The rules for committing of

11/21/2013 Location of Facility Primary Beamline MI Abort Line Reconfiguration

Transactional Memory: Architectural support for Lock-Free Data Structure Transactional Memory:

ADAPTIVE TWO-STAGE INTEGRATORS FOR SAMPLING ALGORITHMS BASED ON HAMILTONIAN DYNAMICS E.

Distributed Systems (ICE 601) Transactions &amp; Concurrency Control - Part1 Dongman Lee ICU

What are Exceptions? Exceptions are rare events triggered by the hardware and forcing the

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability

Distributed Transactions and Concurrency CS425/ECE 428 Nikita Borisov Topics for Today

Work Queue + Python A Framework For Scalable Scientific Ensemble Applications Peter Bui , Dinesh

ECE 650 Systems Programming &amp; Engineering Spring 2018 Database Transaction Processing Tyler

COSC 5351 Advanced Computer Architecture Slides modified from Hennessy CS252 course slides MP

Extreme Computing NoSQL www.inf.ed.ac.uk PREVIOUSLY: BATCH Query most/all data Results

Serializability with Snapshot Isolation under the Hood Mihaela Bornea 1 , S. Elnikety 2 , O.

Assessing Medication Adherence Dr. Lauren Hanna and Dr. Delbert Robinson Northwell Health

Underground Injection Control (UIC) Permitting Rob Castillo July 2020 1 Railroad Commission of

PRP Teleconference July 2018 ATC Teleconference Agenda 1. Update on the renewal of the PRP

Estimating beyond the trial-represented population by incorporating studies with self-selected

NHS City and Hackney CCG Commissioning Intentions 2017/18 and 2018/19 1 Tonight is about

Data Streams: Random Order & Multiple Passes 2009 Barbados Workshop on Computational

Distributed Systems (ICE 601) Transactions & Concurrency Control - Part1 Dongman Lee ICU

ECE 650 Systems Programming & Engineering Spring 2018 Database Transaction Processing Tyler