two-phase commit / security (start) 1 Changelog Changes made in - PowerPoint PPT Presentation

two-phase commit / security (start) 1

Changelog Changes made in this version not seen in fjrst lecture: quorum: add note that part of voting is updating other nodes to latest version 1

last time (1) RPC: remote function calls like local interface description language compiled into stubs (wrapper functions) marshalling (AKA serialization) of arguments/return value into bytes NFS: fjle operations into remote procedure calls NFS is stateless operation server uses fjle IDs — give inode number client remembers fd to fjle ID mapping nothing to recover on server failure nothing for server to forget on client failure 2

last time (2) close-to-open consistency check for updates on open, write fjle on close idea: inconsistent behavior if two processes open fjle at once okay AFS: callbacks on write rather than proactive checks …but server still needs to know about write to callback 3

fjle locking so, your program doesn’t like confmicting writes what can you do? if offmine operation, probably not much… otherwise fjle locking except it often doesn’t work on NFS, etc. 4

advisory fjle locking with fcntl int fd = open(...); struct flock lock_info = { .l_type = F_WRLCK, // write lock; RDLOCK also available // range of bytes to lock: .l_whence = SEEK_SET, l_start = 0, l_len = ... }; int rv = fcntl(fd, F_SETLKW, &lock_info); lock_info.l_type = F_UNLCK; fcntl(fd, F_SETLK, &lock_info); 5 /* set lock, waiting if needed */ if (rv == − 1) { /* handle error */ } /* now have a lock on the file */ /* unlock --- could also close() */

advisory locks fcntl is an advisory lock doesn’t stop others from accessing the fjle… unless they always try to get a lock fjrst 6

POSIX fjle locks are horrible actually two locking APIs: fcntl() and fmock() fcntl: not inherited by fork fcntl: closing any fd for fjle release lock even if you dup2’d it! fcntl: maybe sometimes works over NFS? fmock: less likely to work over NFS, etc. 7

fcntl and NFS seems to require extra state at the server typical implementation: separate lock server not a stateless protocol 8

lockfjles use a separate lockfjle instead of “real” locks e.g. convention: use NOTES.txt.lock as lock fjle lock: create a lockfjle with link() or open() with O_EXCL can’t lock: link()/open() will fail “fjle already exists” for current NFSv3: should be single RPC calls that always contact server some (old, I hope?) systems: link() atomic, open() O_EXCL not unlock: remove the lockfjle annoyance: what if program crashes, fjle not removed? 9

failure models how do machines fail?… well, lots of ways 10

two models of machine failure fail-stop failing machines stop responding or one always detects they’re broken and can ignore them Byzantine failures failing machiens do the worst possible thing 11

dealing with machine failure recover when machine comes back up does not work for Byzantine failures rely on a quorum of machines working requires 1 extra machine for fail-stop 12 requires 3 F + 1 to handle F failures with Byzantine failures

distributed transaction problem distributed transaction two machines both agree to do something or not do something even if a machine fails 13

distributed transaction example course database across many machines machine A and B: student records machine C: course records want to make sure machines agree to add students to course …even if one machine fails no confusion about student is in course 14

the centralized solution one solution: a new machine D decides what to do for machines A-C which store records machine D maintains a redo log for all machines treats them as just data storage problem: we’d like machines to work indepdently not really taking advantage of distributed why did we split student records across two machines anyways? 15

decentralized solution sketch want each machine to be responsible just for their own data only coordinate when transaction crosses machine e.g. changing course + student records only coordinate with involved machines hopefully, scales to tens or hundreds of machines typical transaction would involve 1 to 3 machines? 16

distributed transactions and failures extra tool: persistent log idea: machine remembers what happen on failure same idea as redo log: record what to do in log preview: whether trying to do/not do action …but need to handle if machine stopped while writing log 17

two-phase commit: setup every machine votes on transaction commit — do the operation (add student A to class) abort — don’t do it (something went wrong require unanimity to commit otherwise, default=abort 18

two-phase commit: phases phase 1: preparing each machine states their intention: commit/abort phase 2: fjnishing gather intentions, fjgure out whether to do/not do it 19

preparing agree to commit promise: “I will accept this transaction” promise recorded in the machine log in case it crashes agree to abort promise: “I will not accept this transaction” promise recorded in the machine log in case it crashes never ever take back agreement! to keep promise: can’t allow interfering operations e.g. agree to add student to class reserve seat in class (even though student might not be added) 20

preparing agree to commit promise: “I will accept this transaction” promise recorded in the machine log in case it crashes agree to abort promise: “I will not accept this transaction” promise recorded in the machine log in case it crashes never ever take back agreement! to keep promise: can’t allow interfering operations (even though student might not be added) 20 e.g. agree to add student to class → reserve seat in class

they can’t change their mind once they tell you fjnishing learn all machines agree to commit: commit transaction actually apply transaction (e.g. record student is in class) record decision in local log learn any machine agreed to abort: abort transaction don’t ever try to apply transaction record decision in local log unsure which? just ask everyone what they agreed to do 21

fjnishing learn all machines agree to commit: commit transaction actually apply transaction (e.g. record student is in class) record decision in local log learn any machine agreed to abort: abort transaction don’t ever try to apply transaction record decision in local log unsure which? just ask everyone what they agreed to do 21 they can’t change their mind once they tell you

two-phase commit: blocking agree to commit “add student to class”? can’t allow confmicting actions… adding student to confmicting class? removing student from the class? not leaving seat in class? …until know transaction globally committed/aborted 22

waiting forever? machine goes away, two-phase commit state is uncertain never resolve what happens solution in practice: manual intervention 23

two-phase commit: roles typical two-phase commit implementation several workers one coordinator might be same machine as a worker 24

two-phase-commit messages “will you agree to do this action?” on failure: can ask multiple times! I agree to commit/abort transaction worker records decision in log, returns same result each time I counted the votes and the result is commit/abort only commit if all votes were commit 25 coordiantor → worker: PREPARE worker → coordinator: VOTE-COMMIT or VOTE-ABORT coordinator → worker: GLOBAL-COMMMIT or GLOBAL-ABORT

reasoning about protocols: state machines very hard to reason about dist. protocol correctness each machine is in some state know what every message does in this state avoids common problem: don’t know what message does 26 typical tool: state machine

coordinator state machine (simplifjed) accumulate votes gets COMMIT workers resends vote? gets ABORT worker resends vote? after timeout resend PREPARE send COMMIT INIT receive AGREE-TO-COMMIT from all send ABORT receive any AGREE-TO-ABORT send PREPARE (ask for votes) COMMITTED ABORTED WAITING 27

two-phase commit / security (start) 1 Changelog Changes made in - PowerPoint PPT Presentation

two-phase commit / security (start) 1 Changelog Changes made in this version not seen in fjrst lecture: quorum: add note that part of voting is updating other nodes to latest version 1 last time (1) RPC: remote function calls like local

How To Make Your Commit Seen? Marta Rybczyska Akademy 2012, Tallin Commit message? What for?

DATABSE SYSTEMS CONSENSUS ON TRANSACTION COMMIT. TODS06 MADE BY- ARCHIT GARG 1 Agenda

COMMUNITY GAME RETURN TO PLAY ROADMAP Phase 1 Phase 2A Phase 2B Phase 3 Phase 4 Phase 5 WRU &

Easy Commit: A Non-blocking Two-phase Commit Protocol Suyash Gupta, Mohammad Sadoghi Dept. of

Byron Nelson High School Phase 2 GMP January 14, 2019 BNHS Phase 2 GMP Bid Date: December 11,

observability for developers How to Get from Here to There @cyen @honeycombio Christine DEV

Security / VM (start) 1 last time two-phase commit: doing operation together data is split

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Phase IB Supplement Phase II Submission Progressing Towards a Phase II Submission Phase IB

Two-Phase Commit (2PC) Y Site at which Xact originates is coordinator; other sites at which it

Network FS / Access Control 1 last time two-phase commit consensus: workers + coordinator agree

Phase Transition in 3SAT Yi Zhou Phase Transition in 3SAT Phase Transition in 3SAT Fine Grained

Growing Green Recyclean Program Introduction Two-Phase Project Phase One Phase Two

COMMIT: Jon Meredith March 15, 2017 COMMIT is Live! We have had some WINS : Very minimal

Assuming rational parties and watchtowers - Will a party commit fraud? - Will a watchtower get

getting comfortable in prod to improve your life in dev @cyen @honeycombio first, some

Special Purpose Hardware for Factoring: Special Purpose Hardware for Factoring: the NFS Sieving

Filtering and the matrix step in NFS Thorsten Kleinjung Laboratory for Cryptologic Algorithms

NFS over RDMA Brent Callaghan, Theresa Lingutla-Raj, Alex Chiu, Peter Staubach, Omer Asad Sun

Network File System - NFS NFS Specification NFS is a distributed file system (DFS) originally

April 3: Access Control Matrix Overview Access Control Matrix Model Boolean Expression

The problem Given an integer N that we want to factor with the number field sieve, find two

Module 17: Distributed-File Systems Background Naming and Transparency Remote File

CS 135: File Systems General Filesystem Design 1 / 22 Promises Promises Made by Disks (etc.)

two-phase commit / security (start) 1 Changelog Changes made in - PowerPoint PPT Presentation

two-phase commit / security (start) 1 Changelog Changes made in this version not seen in fjrst lecture: quorum: add note that part of voting is updating other nodes to latest version 1 last time (1) RPC: remote function calls like local

How To Make Your Commit Seen? Marta Rybczyska Akademy 2012, Tallin Commit message? What for?

DATABSE SYSTEMS CONSENSUS ON TRANSACTION COMMIT. TODS06 MADE BY- ARCHIT GARG 1 Agenda

COMMUNITY GAME RETURN TO PLAY ROADMAP Phase 1 Phase 2A Phase 2B Phase 3 Phase 4 Phase 5 WRU &amp;

Easy Commit: A Non-blocking Two-phase Commit Protocol Suyash Gupta, Mohammad Sadoghi Dept. of

Byron Nelson High School Phase 2 GMP January 14, 2019 BNHS Phase 2 GMP Bid Date: December 11,

observability for developers How to Get from Here to There @cyen @honeycombio Christine DEV

Security / VM (start) 1 last time two-phase commit: doing operation together data is split

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Phase IB Supplement Phase II Submission Progressing Towards a Phase II Submission Phase IB

Two-Phase Commit (2PC) Y Site at which Xact originates is coordinator; other sites at which it

Network FS / Access Control 1 last time two-phase commit consensus: workers + coordinator agree

Phase Transition in 3SAT Yi Zhou Phase Transition in 3SAT Phase Transition in 3SAT Fine Grained

Growing Green Recyclean Program Introduction Two-Phase Project Phase One Phase Two

COMMIT: Jon Meredith March 15, 2017 COMMIT is Live! We have had some WINS : Very minimal

Assuming rational parties and watchtowers - Will a party commit fraud? - Will a watchtower get

getting comfortable in prod to improve your life in dev @cyen @honeycombio first, some

Special Purpose Hardware for Factoring: Special Purpose Hardware for Factoring: the NFS Sieving

Filtering and the matrix step in NFS Thorsten Kleinjung Laboratory for Cryptologic Algorithms

NFS over RDMA Brent Callaghan, Theresa Lingutla-Raj, Alex Chiu, Peter Staubach, Omer Asad Sun

Network File System - NFS NFS Specification NFS is a distributed file system (DFS) originally

April 3: Access Control Matrix Overview Access Control Matrix Model Boolean Expression

The problem Given an integer N that we want to factor with the number field sieve, find two

Module 17: Distributed-File Systems Background Naming and Transparency Remote File

CS 135: File Systems General Filesystem Design 1 / 22 Promises Promises Made by Disks (etc.)

COMMUNITY GAME RETURN TO PLAY ROADMAP Phase 1 Phase 2A Phase 2B Phase 3 Phase 4 Phase 5 WRU &