Locking.tdb without locks? SambaXP 2016 Berlin Volker Lendecke - PowerPoint PPT Presentation

Locking.tdb without locks? SambaXP 2016 Berlin Volker Lendecke Samba Team / SerNet 2016-05-12

Small tdb intro ◮ tdb (Trivial (Tridge) Data Base) is a shared writer key-value store ◮ API similar to dbm ◮ tdb is implemented as a hash table with a linked list overflow ◮ Shared mmap with locks per hash list ◮ Optimized for heavy small read/write traffic ◮ Lots of tuning done in recent years ◮ Freelist traffic reduced by dead records ◮ Freelist fragmentation reduced ◮ You knew all this, right? Locking.tdb without locks? (2 vl / 10)

Locking.tdb in a nutshell ◮ Locking.tdb is (still?) our central open-file database ◮ It is very heavily contended ◮ Locking.tdb protects atomic opens/closes ◮ create/setattr/setacl/unlink ◮ For open and close, a tdb record is locked ◮ brlock.tdb is locked while locking.tdb is locked ◮ Two records locked simultaneously – deadlock? ◮ DBWRAP LOCK ORDER maintains lock ordering ◮ Metadata operations are done while holding the lock ◮ Unlink can take ages Locking.tdb without locks? (3 vl / 10)

dbwrap ◮ tdb is a low-level API ◮ Exposes the hash chain structure (”tdb chainlock”) ◮ Really, really tricky semantics around locking ◮ Not aware of talloc ◮ We wanted clustering, tdb does not cluster, so: ◮ All problems in computer science can be solved by another level of indirection, except of course for the problem of too many indirections. ◮ Implement a wrapper around tdb with the really needed features ◮ dbwrap fetch locked() being the heart of it Locking.tdb without locks? (4 vl / 10)

g lock ◮ ctdb can not provide clusterwide locks ◮ For persistent databases, we need to protect replication ◮ Simulate fcntl locks in user space ◮ g lock lock creates a record with the locker’s PID as the only content ◮ There’s code for shared locks, but that was never used ◮ First implementation: lock waiters were added in an array ◮ Unlock sent messages to all waiters for retry Locking.tdb without locks? (5 vl / 10)

dbwrap watch ◮ g lock was the third place where someone waits for record changes ◮ Oplock breakers waited for break or close ◮ SHARING VIOLATION 1-sec delay (or 5x 200msec: Hi, Chris :-)) ◮ dbwrap record watch send abstracts that ◮ dbwrap watchers.tdb holds all waiters for any record in any db ◮ With dbwrap watch db(), every store to a database will trigger watchers ◮ Watchers typically wait for: ◮ Lease break ack by client’s smbd ◮ g lock unlocked by lock holder Locking.tdb without locks? (6 vl / 10)

Monitoring processes ◮ Watching a record ist mostly waiting for someone to do something ◮ What happens if that ”someone” dies hard? ◮ Arbitrary processes need to monitor each other ◮ SIGCHLD only works for direct children ◮ With unix datagram messaging every process holds a lockfile ◮ fcntl wait for the lockfile to be given up? ◮ tmond and stream based messaging solves monitoring local processes ◮ g lock in current master just polls ◮ dbwrap record watch send grew a ”blocker” argument ◮ dbwrap record watch recv indicates blocker crash: EOWNERDEAD Locking.tdb without locks? (7 vl / 10)

Finally, dbwrap nolock ◮ Double locks (locking.tdb and brlock.tdb) are bad ◮ Gave Amitay a bad time for parallel database recoveries ◮ Cluster file systems can block smbd completely in D for a looong time ◮ The file is dead, the others on the hash chain too :-( ◮ With mutexes, we lost /proc/locks ◮ Diagnosis for contended locks more difficult ◮ dbwrap backend based on g lock ◮ A locked record holds the lock owner in the data field ◮ Lock waiters use dbwrap record watch send ◮ With mutexes, the noncontended case should not be much slower ◮ Lock contention is worse, but that’s bad already Locking.tdb without locks? (8 vl / 10)

Implementation details ◮ dbwrap nolock is not exactly lockless ◮ Critical region under the lock is very small and confined ◮ No file system operations under the lock ◮ Always locks two tdbs very briefly: Locking.tdb and dbwrap watch.tdb ◮ The critical region ops could be delegated to a finite state machine ◮ Persistent file handles anyone? ◮ Open issues: ◮ Performance of course ◮ Scalability with thousands of waiters – watchersd (like notifyd?) ◮ Watching processes on remote nodes ◮ Demo time :-) Locking.tdb without locks? (9 vl / 10)

Questions? vl@samba.org / vl@sernet.de Locking.tdb without locks? vl (10 / 10)

Locking.tdb without locks? SambaXP 2016 Berlin Volker Lendecke - PowerPoint PPT Presentation

Locking.tdb without locks? SambaXP 2016 Berlin Volker Lendecke Samba Team / SerNet 2016-05-12 Small tdb intro tdb (Trivial (Tridge) Data Base) is a shared writer key-value store API similar to dbm tdb is implemented as a hash table

Lets talk locks! @kavya719 kavya locks. locks are slow locks are slow latency

Module 15: Managing Transactions and Locks Overview Introduction to Transactions and Locks

Threads & Locks Threads & Locks Srinidhi Varadarajan Topics Topics Thread

NUMA-aware Reader-Writer Locks Tom Herold, Marco Lamina 04.02.2015 NUMA Seminar Agenda 1.

Topics Topics Thread Programming (Chapter 12) Threads & Locks

Unit testing and mocking with cmocka Unit testing and mocking with cmocka SambaXP 2018 SambaXP

Samba in love with GnuTLS Samba in love with GnuTLS SambaXP 2019 SambaXP 2019 Andreas Schneider

LOCKING CS 2550 / Spring 2006 Principles of Database Systems 10 Locking Alexandros

CS533 Concepts of Operating Systems Linux Kernel Locking Techniques Intro to kernel locking

Database concurrency control with precision: orthogonal key-value locking Goetz Graefe Locks vs

CS 134: Operating Systems Locks and Low-Level Synchronization 1 / 30 Overview CS34 Overview

POSIX Thread Synchronization Mutex Locks Condition Variables Read-Write Locks

synchronization 2: locks / memory ordering 1 last time pthread create/join racing where data

Last time Need for synchronization primitives 7: Synchronization Locks and building locks

CPL 2016, week 2 Inter-thread synchronization: locks and monitors Oleg Batrashev Institute of

Orthogonal key-value locking Goetz Graefe, Hideaki Kimura Hewlett-Packard Laboratories Palo Alto,

Lets Talk! Uniting Dev and UX to design for Voice Pascal Heynol @ UX Cambridge 2018 Pascal

Janus: back to the future of WebRTC History IETF WebRTC Janus Gateways Lorenzo Miniero

Corporate Lab or Academic Department, Which Fits? Bill Aiello University of British Columbia

How the interest in claw-free graphs travelled from Pittsburgh via Berlin and New York to Rome

LIFE AFTER CARBON IDEAS THAT ARE CHANGING THE EVOLUTION OF CITIES PLUS THE CLIMATE EMERGENCY

A I rtific ia l nte llig e nc e is already impacting many parts of our lives In order to

Using Zeek for SSL Research Hochschule Darmstadt FH Neu-Ulm CA - G01 Beuth Hochschule Berlin CA

POLI 120N: Contention and Conflict in Africa Professor Adida European imprint First, a few

Sambuz

Useful Links

Newsletter

Mail Us

Locking.tdb without locks? SambaXP 2016 Berlin Volker Lendecke - PowerPoint PPT Presentation

Locking.tdb without locks? SambaXP 2016 Berlin Volker Lendecke Samba Team / SerNet 2016-05-12 Small tdb intro tdb (Trivial (Tridge) Data Base) is a shared writer key-value store API similar to dbm tdb is implemented as a hash table

Lets talk locks! @kavya719 kavya locks. locks are slow locks are slow latency

Module 15: Managing Transactions and Locks Overview Introduction to Transactions and Locks

Threads &amp; Locks Threads &amp; Locks Srinidhi Varadarajan Topics Topics Thread

NUMA-aware Reader-Writer Locks Tom Herold, Marco Lamina 04.02.2015 NUMA Seminar Agenda 1.

Topics Topics Thread Programming (Chapter 12) Threads &amp; Locks

Unit testing and mocking with cmocka Unit testing and mocking with cmocka SambaXP 2018 SambaXP

Samba in love with GnuTLS Samba in love with GnuTLS SambaXP 2019 SambaXP 2019 Andreas Schneider

LOCKING CS 2550 / Spring 2006 Principles of Database Systems 10 Locking Alexandros

CS533 Concepts of Operating Systems Linux Kernel Locking Techniques Intro to kernel locking

Database concurrency control with precision: orthogonal key-value locking Goetz Graefe Locks vs

CS 134: Operating Systems Locks and Low-Level Synchronization 1 / 30 Overview CS34 Overview

POSIX Thread Synchronization Mutex Locks Condition Variables Read-Write Locks

synchronization 2: locks / memory ordering 1 last time pthread create/join racing where data

Last time Need for synchronization primitives 7: Synchronization Locks and building locks

CPL 2016, week 2 Inter-thread synchronization: locks and monitors Oleg Batrashev Institute of

Orthogonal key-value locking Goetz Graefe, Hideaki Kimura Hewlett-Packard Laboratories Palo Alto,

Lets Talk! Uniting Dev and UX to design for Voice Pascal Heynol @ UX Cambridge 2018 Pascal

Janus: back to the future of WebRTC History IETF WebRTC Janus Gateways Lorenzo Miniero

Corporate Lab or Academic Department, Which Fits? Bill Aiello University of British Columbia

How the interest in claw-free graphs travelled from Pittsburgh via Berlin and New York to Rome

LIFE AFTER CARBON IDEAS THAT ARE CHANGING THE EVOLUTION OF CITIES PLUS THE CLIMATE EMERGENCY

A I rtific ia l nte llig e nc e is already impacting many parts of our lives In order to

Using Zeek for SSL Research Hochschule Darmstadt FH Neu-Ulm CA - G01 Beuth Hochschule Berlin CA

POLI 120N: Contention and Conflict in Africa Professor Adida European imprint First, a few

Sambuz

Useful Links

Newsletter

Mail Us

Threads & Locks Threads & Locks Srinidhi Varadarajan Topics Topics Thread

Topics Topics Thread Programming (Chapter 12) Threads & Locks