CS411 Database Systems Concurrency Control 16: Final Review - PDF document

CS411 Database Systems Concurrency Control 16: Final Review Session Kazuhiro Minami Concurrency Control – Basic concepts Basic Concepts on Locks • What is a lock? • What is a transaction? • What is a lock table? What kind of information is stored • Which actions do we consider in a transaction? there? • How to represent a transaction? • What is the consistency of transactions? • What is a schedule? • What is the legality of transactions? • What is the goal of concurrency control? • What is the notations for actions of locking and • What is a serial schedule? unlocking? • What is a serializable schedule? • What is the job of the locking scheduler? • What is a conflict-serializable schedule? • What is the two-phase locking (2PL) condition? What • What are conflicting swaps? type of serializable schedules are produced with this • How to determine whether a schedule is conflict- approach? serializable? 4

Concepts on Timestamp-based Two-phase locked schedule Concurrency Control T 1 : r 1 (A);w 1 (B); T 2 : r 2 (A);w 2 (A);w 2 (B); • What is a timestamp? 1. Make T 1 and T 2 consistent, but not 2PL by adding lock and unlock actions 2. Give a legal, but not serializable schedule of T 1 and T 2 in question 1 • When is a timestamp assigned to a transaction? 3. Make T 1 and T 2 consistent and 2PL by adding lock and unlock actions • What’s the notation for transaction T’s timestamp? 4. Give a legal schedule of T 1 and T 2 in question 3 • What information do we maintain for each database element X? • What type of serializable schedules are produced with this approach? • How the timestamp-based approach solve the problem of “dirty” read? 5 6 How to detect T’s reading X too late? Assumed Serial Schedule • Conflict serializable schedule that is equivalent to a serial schedule in which the timestamp order of U write X transactions is the order to execute them T read X Actual schedule T U V T start U start starts starts starts Serial schedule T U V starts starts starts

How can you detect T’s writing to late? Prevention of Dirty Read • How to prevent transaction T from reading data written by uncommitted transaction U? • We want T to wait until U commits U read X T write X U write X T read X T start U start U T U start start aborts 9 10 Thomas Write Rule Another Problem with Dirty Data • Why can we skip transaction T’s write on element X, • Thomas write rule: T’s write can be skipped if which is already modified by transaction U? TS(T) < WT(X) • What if there is another transaction V starting after T • But, we want T to wait until U commits but before U? • What if V starts after U? U writes X U writes X T writes X T writes X We need to restore the T U T previous value T U U aborts for X, but… 11 start start commits start start

Concurrency Control by Timestamps • Tell me what happens as each executes – st 1 ; r 1 (A); st 2 ; w 2 (B); r 2 (A); w 1 (B) Concurrency Control T2 B T1 A 100 200 RT=0 RT=0 by Validation WT=0 WT=0 r 1 (A) RT=100 w 2 (B) WT=200 r 2 (A) RT=200 w 1 (B) OK, but needs to wait until T2 commits Validation-based Scheduler Concurrency Control by Validation • Another type of optimistic concurrency control • Keep track of each transaction T’s • Maintain a record of what active transactions are – Read set RS(T): the set of elements T read doing – Write set WS(T): the set of elements T write • Just before a transaction starts to write, it goes • Execute transactions in three phases: through a “validation phase” 1. Read . T reads all the elements in RS(T) • If a there is a risk of physically unrealizable 2. Validate . Validate T by comparing its RS(T) an behavior, the transaction is rolled back WS(T) with those in other transactions. If the validation fails, T is rolled back Read actions Write actions 3. Write . T writes its values for the elements in WS(T) Validation Transaction T

Assumed Serial Schedule for Validation Potential Violation: Read too Early • We may think of each transaction that • Transactions T and U such that successfully validates as executing at the moment 1. U has validated that it validates 2. START(T) < FIN(U) 3. RS(T) ∩ WS(U) is not empty Actual schedule T U V validates T reads X validates validates U writes X Serial schedule T U V validates validates validates U start T start U validated T validating Another Potential Violation: Write too Early Validation Rules • Two transactions T and U such that To validate a transaction T, 1. Check that RS(T) ∩ WS(U) is an empty set for any – U is in VAL v alidated U and START(T) < FIN(U) – VAL(T) < FIN(U) Check that WS(T) ∩ WS(U) is an empty set for any 2. – WS(T) ∩ WS(U) is not empty validated U that did not finish before T validated, i.e., if VAL(T) < FIN(U) T writes X U writes X T validating U finish U validated

Example Problem Storage In the following sequence of events, tell what happens when • How to store records in a block? each sequence is processed by a validation-based scheduler. – Fixed-length record R1(A,B); R2(B,C); V1; R3(C,D); V3; W1(A); V2; W2(A); W3(B); – Variable-length record RS={A,B} RS={B,C} • What extra information do we need to store a record in a WS={A} WS={A} T1 T2 block? • What information is stored in a block header? • What information is stored in a record header? • In which situations do we create an overflow block? T3 RS={C,D} WS={B} Indexing Query execution • Do you know which types of index can we use in different situations? • What are cost parameters? – Dense/Sparse – Secondary (unclustered) • Can you describe how different join – Multi-level index algorithms work? • B-tree – Do you understand the structure of a B-tree? – Nested loop join; sort-based join; hash-based join • Root node; internal node; leaf node • Do you know the memory requirement of – How to find/insert/delete a record? each algorithm? • Dynamic hash table – Do you understand the structure of a dynamic hash table? • Do you know in which situation we can use a • Pointer array, data bucket, nub, etc. simple-sort join or sort-merge-join? – How to find/insert/delete a record? – What are differences from static hash tables? – What are different between extensible and linear hash tables?

Query Optimization Logging & Recovery • What’s the correctness principle? • Do you know how to apply algebraic laws to • What are primitive four operations of transactions? get a better logical plan? • How does undo logging work? – Log records • Do you know how to perform a cost-based – Undo-logging rules optimization on a logical plan? – Recovery procedure • How does redo logging work? – Dynamic programming • How does undo/redo logging work? – Left-deep or Right-deep join trees • What is a checkpoint? • Do you know how to estimate the size of an • What is a nonquiescent checkpoint? intermediate operation? • Do you know when <END CKPT> is added in each method? – Selection • Do you know recovery procedures with a checkpointed log? – Join • What are advantages and disadvantages of each method?

CS411 Database Systems Concurrency Control 16: Final Review - PDF document

CS411 Database Systems Concurrency Control 16: Final Review Session Kazuhiro Minami Concurrency Control Basic concepts Basic Concepts on Locks What is a lock? What is a transaction? What is a lock table? What kind of

CS411: Two Perspectives on DBMS User perspective CS411 how to use a database system

01: Introduction Please read the class syllabus, policies, and lecture schedule; ask if you

CS411 than $50,000 and pay taxes in Champaign-Urbana. Database Systems Could write in

CS411 data Has similar capabilities for queries to those in Database Systems relational

CS411 Database Systems Foreign Keys Local and Global Constraints 06: SQL Triggers Kazuhiro

CS411 Database Systems Join Expressions 06: SQL Kazuhiro Minami Join Expressions Products and

CS411 balanced search tree Database Systems hash table R etc Cost: B(R) 12:

CS411 Concurrency control Database Systems Recovery Logging Redo 13: Logging

CS411 Pick application Conceptual design domain Database Systems What data do I What data do

CS411 Whats the purpose of main memory buffer? Database Systems Why do we need a

CS411 Database Systems 04: Relational Schema Design Kazuhiro Minami Primary Goal: Minimize

System Aspects of SQL (Chapter 9: Four more ways to make SQL calls CS411 from outside the DBMS)

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

NEBC Database Course 2008 Database Servers Database Interfaces Tim Booth : tbooth@ceh.ac.uk

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

SIMON PLANZER LECTURER IN LAW, UNIVERSITY OF ST.GALLEN ATTORNEY AT LAW, ZURICH HEC Workshop:

Weve been here (or nearby) before IETF Technical Plenary, November 2013 Brian Carpenter 1 1

Epidemiology of Bronchiectasis Anne E. ODonnell MD May 17, 2018 Disclosures Principal

Hairy black holes in scalar tensor theories E Babichev and CC gr-qc/1312.3204 CC, T Kolyvaris, E

Vertex Coloring Stefan Schmid @ T-Labs, 2011 Graph Coloring Stefan Schmid @ T-Labs Berlin, 2012

Announcement Real-Time Scheduling Mid-term course evaluation will be open till Whatre

1 Last week we talked about implicit graph representations. This week we're going to be

Paper Presentation Amo Guangmo Tong University of Taxes at Dallas gxt140030@utdallas.edu

Sambuz

Useful Links

Newsletter

Mail Us