Read-Copy Update (RCU) Don Porter CSE 506 RCU in a nutshell - PowerPoint PPT Presentation

Read-Copy Update (RCU) Don Porter CSE 506

RCU in a nutshell ò Think about data structures that are mostly read, occasionally written ò Like the Linux dcache ò RW locks allow concurrent reads ò Still require an atomic decrement of a lock counter ò Atomic ops are expensive ò Idea: Only require locks for writers; carefully update data structure so readers see consistent views of data

Motivation (from Paul McKenney’s Thesis) 35 "ideal" Hash Table Searches per Microsecond "global" 30 "globalrw" 25 20 Performance of RW 15 lock only marginally better than mutex 10 lock 5 0 1 2 3 4 # CPUs

Principle (1/2) ò Locks have an acquire and release cost ò Substantial, since atomic ops are expensive ò For short critical regions, this cost dominates performance

Principle (2/2) ò Reader/writer locks may allow critical regions to execute in parallel ò But they still serialize the increment and decrement of the read count with atomic instructions ò Atomic instructions performance decreases as more CPUs try to do them at the same time ò The read lock itself becomes a scalability bottleneck, even if the data it protects is read 99% of the time

Lock-free data structures ò Some concurrent data structures have been proposed that don’t require locks ò They are difficult to create if one doesn’t already suit your needs; highly error prone ò Can eliminate these problems

RCU: Split the difference ò One of the hardest parts of lock-free algorithms is concurrent changes to pointers ò So just use locks and make writers go one-at-a-time ò But, make writers be a bit careful so readers see a consistent view of the data structures ò If 99% of accesses are readers, avoid performance-killing read lock in the common case

Example: Linked lists This implementation needs a lock A C E B’s next B pointer is uninitialized; Reader gets a Reader goes to B page fault

Example: Linked lists A C E B Garbage collect C after all readers Reader goes to C or finished B---either is ok

Example recap ò Notice that we first created node B, and set up all outgoing pointers ò Then we overwrite the pointer from A ò No atomic instruction needed ò Either traversal is safe ò In some cases, we may need a memory barrier ò Key idea: Carefully update the data structure so that a reader can never follow a bad pointer

Garbage collection ò Part of what makes this safe is that we don’t immediately free node C ò A reader could be looking at this node ò If we free/overwrite the node, the reader tries to follow the ‘next’ pointer ò Uh-oh ò How do we know when all readers are finished using it? ò Hint: No new readers can access this node: it is now unreachable

Quiescence ò Trick: Linux doesn’t allow a process to sleep while traversing an RCU-protected data structure ò Includes kernel preemption, I/O waiting, etc. ò Idea: If every CPU has called schedule() (quiesced), then it is safe to free the node ò Each CPU counts the number of times it has called schedule() ò Put a to-be-freed item on a list of pending frees ò Record timestamp on each CPU ò Once each CPU has called schedule, do the free

Quiescence, cont ò There are some optimizations that keep the per-CPU counter to just a bit ò Intuition: All you really need to know is if each CPU has called schedule() once since this list became non-empty ò Details left to the reader

Limitations ò No doubly-linked lists ò Can’t immediately reuse embedded list nodes ò Must wait for quiescence first ò So only useful for lists where an item’s position doesn’t change frequently ò Only a few RCU data structures in existence

Nonetheless ò Linked lists are the workhorse of the Linux kernel ò RCU lists are increasingly used where appropriate ò Improved performance!

API ò Drop in replacement for read_lock: ò rcu_read_lock() ò Wrappers such as rcu_assign_pointer() and rcu_dereference_pointer() include memory barriers ò Rather than immediately free an object, use call_rcu(object, delete_fn) to do a deferred deletion

From McKenney and Walpole, Introducing Technology into the Linux Kernel: A Case Study 2000 1800 1600 1400 # RCU API Uses 1200 1000 800 600 400 200 0 2002 2003 2004 2005 2006 2007 2008 2009 Year Figure 2: RCU API Usage in the Linux Kernel

Summary ò Understand intuition of RCU ò Understand how to add/delete a list node in RCU ò Pros/cons of RCU

Read-Copy Update (RCU) Don Porter CSE 506 RCU in a nutshell - PowerPoint PPT Presentation

Read-Copy Update (RCU) Don Porter CSE 506 RCU in a nutshell Think about data structures that are mostly read, occasionally written Like the Linux dcache RW locks allow concurrent reads Still require an atomic decrement

T Levels/Skills Plan Body Copy Body Copy Body Copy Body Copy Body Copy Body Copy Body Copy Body

RCU Theory and Practice Marwan Burelle - LSE Summer Week 2015 Overview RCU concepts Short

Linux Plumbers Conference 2011 Userspace RCU Library: RCU Synchronization and RCU/Lock-Free Data

Read-Copy Update User Todays Lecture System Calls Kernel (RCU) RCU File System

Read-Copy Update Todays Lecture System Calls Kernel (RCU) RCU File System Networking

Read-Copy-Update (RCU) Josh Triplett May 22, 2006 Topics The RCU API How it works

What is RCU, Fundamentally By: Paul E. McKenney Jonathan Walpole Presenter: Jim Santmyer

What is RCU, Fundamentally? By: Paul E. McKenney Jonathan Walpole Presenter: Dany Madden Agenda

Read-Copy Update (RCU) Don Porter CSE 506 Logical Diagram Binary Memory Threads Formats

Read-Copy Update (RCU) Don Porter COMP 790: OS Implementation Logical Diagram Binary Memory

Read-Copy-Update for Read-Copy-Update for HelenOS HelenOS http://d3s.mff.cuni.cz Martjn Dck

RCUfs: A Read Copy Update file system An experimental method for handling file system locking

Studying copy-on-read and copy-on-write techniques on block device level to aid in large

AACS AACS Managed Copy Managed Copy Status Update Status Update And Implementation And

Advanced Access Content System (AACS) AACS Managed Copy Overview Presented to BD GPC October

Communication in History: The Key to Understanding Get your digital copy today at

Principles of Software Construction: Concurrency, Pt. 3 java.util.concurrent Josh Bloch

Josh Bloch Charlie Garrod 17-214 1 Administrivia HW 5b due today, 11:59EDT (framework &

Thank you 2019 CRA Sponsors! AGENDA What Really Works? Education, Enforcement & the ROI

It is pitch black. You are likely to be eaten by a Grue. CS 152: Programming Language Paradigms

Preemptable Ticket Spinlocks Improving Consolidated Performance in the Cloud Jiannan Ouyang, John

Software Model Checking and Counter-example Guided Abstraction Refinement Claire Le Goues 1

Untangling and Restructuring CTDB Martin Schwenke < martin@meltin.net > Samba Team IBM

Lock-free algorithms for Kotlin coroutines It is all about scalability Presented at SPTCC 2017

Read-Copy Update (RCU) Don Porter CSE 506 RCU in a nutshell - PowerPoint PPT Presentation

Read-Copy Update (RCU) Don Porter CSE 506 RCU in a nutshell Think about data structures that are mostly read, occasionally written Like the Linux dcache RW locks allow concurrent reads Still require an atomic decrement

T Levels/Skills Plan Body Copy Body Copy Body Copy Body Copy Body Copy Body Copy Body Copy Body

RCU Theory and Practice Marwan Burelle - LSE Summer Week 2015 Overview RCU concepts Short

Linux Plumbers Conference 2011 Userspace RCU Library: RCU Synchronization and RCU/Lock-Free Data

Read-Copy Update User Todays Lecture System Calls Kernel (RCU) RCU File System

Read-Copy Update Todays Lecture System Calls Kernel (RCU) RCU File System Networking

Read-Copy-Update (RCU) Josh Triplett May 22, 2006 Topics The RCU API How it works

What is RCU, Fundamentally By: Paul E. McKenney Jonathan Walpole Presenter: Jim Santmyer

What is RCU, Fundamentally? By: Paul E. McKenney Jonathan Walpole Presenter: Dany Madden Agenda

Read-Copy Update (RCU) Don Porter CSE 506 Logical Diagram Binary Memory Threads Formats

Read-Copy Update (RCU) Don Porter COMP 790: OS Implementation Logical Diagram Binary Memory

Read-Copy-Update for Read-Copy-Update for HelenOS HelenOS http://d3s.mff.cuni.cz Martjn Dck

RCUfs: A Read Copy Update file system An experimental method for handling file system locking

Studying copy-on-read and copy-on-write techniques on block device level to aid in large

AACS AACS Managed Copy Managed Copy Status Update Status Update And Implementation And

Advanced Access Content System (AACS) AACS Managed Copy Overview Presented to BD GPC October

Communication in History: The Key to Understanding Get your digital copy today at

Principles of Software Construction: Concurrency, Pt. 3 java.util.concurrent Josh Bloch

Josh Bloch Charlie Garrod 17-214 1 Administrivia HW 5b due today, 11:59EDT (framework &amp;

Thank you 2019 CRA Sponsors! AGENDA What Really Works? Education, Enforcement &amp; the ROI

It is pitch black. You are likely to be eaten by a Grue. CS 152: Programming Language Paradigms

Preemptable Ticket Spinlocks Improving Consolidated Performance in the Cloud Jiannan Ouyang, John

Software Model Checking and Counter-example Guided Abstraction Refinement Claire Le Goues 1

Untangling and Restructuring CTDB Martin Schwenke &lt; martin@meltin.net &gt; Samba Team IBM

Lock-free algorithms for Kotlin coroutines It is all about scalability Presented at SPTCC 2017

Josh Bloch Charlie Garrod 17-214 1 Administrivia HW 5b due today, 11:59EDT (framework &

Thank you 2019 CRA Sponsors! AGENDA What Really Works? Education, Enforcement & the ROI

Untangling and Restructuring CTDB Martin Schwenke < martin@meltin.net > Samba Team IBM