Scalable Concurrent Hash Tables via Relativistic Programming Josh - PowerPoint PPT Presentation

Scalable Concurrent Hash Tables via Relativistic Programming Josh Triplett April 29, 2010

Speed of data < Speed of light • Speed of light: 3e8 meters/second • Processor speed: 3 GHz, 3e9 cycles/second • 0.1 meters/cycle (4 inches/cycle) • Ignores propagation delay, ramp time, speed of signals

Speed of data < Speed of light • Speed of light: 3e8 meters/second • Processor speed: 3 GHz, 3e9 cycles/second • 0.1 meters/cycle (4 inches/cycle) • Ignores propagation delay, ramp time, speed of signals • One of the reasons CPUs stopped getting faster • Physical limit on memory, CPU–CPU communication

Throughput vs Latency • CPUs can do a lot of independent work in 1 cycle • CPUs can work out of their own cache in 1 cycle • CPUs can’t communicate and agree in 1 cycle

How to scale? • To improve scalability, work independently • Agreement represents the bottleneck • Scale by reducing the need to agree

Classic concurrent programming • Every CPU agrees on the order of instructions • No tolerance for conflicts • Implicit communication and agreement required • Does not scale • Example: mutual exclusion

Relativistic programming • By analogy with physics: no global reference frame • Allow each thread to work with its observed “relative” view of memory • Minimal constraints on instruction ordering • Tolerance for conflicts: allow concurrent threads to access shared data at the same time, even when doing modifications.

Why relativistic programming? • Wait-free • Very low overhead • Linear scalability

Concrete examples • Per-CPU variables

Concrete examples • Per-CPU variables • Deferred destruction — Read-Copy Update (RCU)

What does RCU provide? • Delimited readers with near-zero overhead • “Wait for all current readers to finish” operation • Primitives for conflict-tolerant operations: rcu_assign_pointer , rcu_dereference

What does RCU provide? • Delimited readers with near-zero overhead • “Wait for all current readers to finish” operation • Primitives for conflict-tolerant operations: rcu_assign_pointer , rcu_dereference • Working data structures you don’t have to think hard about

RCU data structures • Linked lists • Radix trees • Hash tables, sort of

Hash tables, sort of • RCU linked lists for buckets • Insertion and removal • No other operations

New RCU hash table operations • Move element • Resize table

Move operation “old” key a n 1 n 2 n 3 . . . b n 4 n 5

Move operation a n 1 n 2 . . “new” . key b n 4 n 5 n 3

Move operation semantics • If a reader doesn’t see the old item, subsequent lookups of the new item must succeed. • If a reader sees the new item, subsequent lookups of the old item must fail. • The move operation must not cause concurrent lookups for other items to fail • Semantics based roughly on filesystems

Move operation challenge • Trivial to implement with mutual exclusion • Insert then remove, or remove then insert • Intermediate states don’t matter • Hash table buckets use linked lists • RCU linked list implementations provide insert and remove • Move semantics not possible using just insert and remove

Current approach in Linux • Sequence lock • Readers retry if they race with a rename • Any rename

Solution characteristics • Principles: • One semantically significant change at a time • Intermediate states must not violate semantics • Need a new move operation specific to relativistic hash tables, making moves a single semantically significant change with no broken intermediate state • Must appear to simultaneously move item to new bucket and change key

Key idea “old” key a n 1 n 2 n 3 . . . b n 4 n 5 • Cross-link end of new bucket to node in old bucket

Key idea “new” key a n 1 n 2 n 3 . . . b n 4 n 5 • Cross-link end of new bucket to node in old bucket • While target node appears in both buckets, change the key

Key idea “new” key a n 1 n 2 n 3 . . . b n 4 n 5 • Cross-link end of new bucket to node in old bucket • While target node appears in both buckets, change the key • Need to resolve cross-linking safely, even for readers looking at the target node • First copy target node to the end of its bucket, so readers can’t miss later nodes • Memory barriers

Benchmarking with rcuhashbash • Run one thread per CPU. • Continuous loop: randomly lookup or move • Configurable algorithm and lookup:move ratio • Run for 30 seconds, count reads and writes • Average of 10 runs • Tested on 64 CPUs

Results, 999:1 lookup:move ratio, reads 200 Proposed algorithm Current Linux (RCU+seqlock) Per-bucket spinlocks 180 Per-bucket reader-writer locks 160 Millions of Hash Lookups per Second 140 120 100 80 60 40 20 0 1 2 4 8 16 32 64 CPUs

Results, 1:1 lookup:move ratio, reads 7 Per-bucket spinlocks Per-bucket reader-writer locks Proposed algorithm Current Linux (RCU+seqlock) 6 Millions of Hash Lookups per Second 5 4 3 2 1 0 1 2 4 8 16 32 64 CPUs

Resizing RCU-protected hash tables • Disclaimer: work in progress • Working on implementation and test framework in rcuhashbash • No benchmark numbers yet • Expect code and announcement soon

Resizing algorithm • Keep a secondary table pointer, usually NULL • Lookups use secondary table if primary table lookup fails

Resizing algorithm • Keep a secondary table pointer, usually NULL • Lookups use secondary table if primary table lookup fails • Cross-link tails of chains to second table in appropriate bucket

Resizing algorithm • Keep a secondary table pointer, usually NULL • Lookups use secondary table if primary table lookup fails • Cross-link tails of chains to second table in appropriate bucket • Wait for current readers to finish before removing cross-links from primary table

Resizing algorithm • Keep a secondary table pointer, usually NULL • Lookups use secondary table if primary table lookup fails • Cross-link tails of chains to second table in appropriate bucket • Wait for current readers to finish before removing cross-links from primary table • Repeat until primary table empty • Make the secondary table primary • Free the old primary table after a grace period

For more information • Code: git://git.kernel.org/pub/scm/linux/kernel/ git/josh/rcuhashbash (Resize coming soon!) • Relativistic programming: http://wiki.cs.pdx.edu/rp/ • Email: josh@joshtriplett.org

Scalable Concurrent Hash Tables via Relativistic Programming Josh - PowerPoint PPT Presentation

Scalable Concurrent Hash Tables via Relativistic Programming Josh Triplett April 29, 2010 Speed of data < Speed of light Speed of light: 3e8 meters/second Processor speed: 3 GHz, 3e9 cycles/second 0.1 meters/cycle (4 inches/cycle)

Resizable, Scalable, Concurrent Hash Tables via Relativistic Programming Josh Triplett 1 Paul E.

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

Datastructures 1 Hash Tables Red Black Trees Week 8 Objectives Hash Tables, Hashing

Hash Tables 1 / 91 Hash Tables Administrivia Assignment 2 has been released. We will be

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

CS 758/858: Algorithms http://www.cs.unh.edu/~ruml/cs758 Searching Hash Tables Hash Functions

Hash tables Hash functions Open addressing March 09, 2020 Cinda Heeren / Andy Roth / Geoffrey

Working with Hash Tables Daniel Petrolito (ANZ Bank) Working With Hash Tables Daniel SAS

Hash Tables Direct-Address Tables Hash Functions Universal Hashing Chaining Open Addressing

Hash Functions Hash Functions 1 Cryptographic Hash Function Crypto hash function h(x) must

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions R. Joshua Tobin and David

Topic 22 Hash Tables " hash collision n. [from the techspeak] (var. `hash clash') When used

Hash Tables 1 Hash Table in Primary Storage Main parameter B = number of buckets Hash

Relativistic Effects Relativistic Bit . . . Can Keep Data Secret: Relativistic Bit . . . Why

CS200: Hash Tables Prichard Ch. 13.2 CS200 - Hash Tables 1 Table Implementations: average

Shape Analysis via Symbolic Memory Graphs and Conversion from Pointers to Containers Kamil Dudka

CS10001: Programming & Data Structures Sudeshna Sarkar Dept. of Computer Sc. & Engg.,

Libraries and Museums ALIADA Project Consortium SWIB15, November 23-25, 2015 Hamburg The

Linked Data: Principles and State of the Art Christian Bizer, Freie Universitt Berlin Tom

15-721 DATABASE SYSTEMS Lecture #07 Latch-free OLTP Indexes (Part I) Andy Pavlo / /

CSE3320 Operating Systems File Systems and Implementations Jia Rao Department of Computer

Foundation of Cryptography (0368-4162-01), Lecture 0 Adminstration + Introduction Iftach

Topic 1 Course Introduction Chapman : I didn't expect a kind of Spanish Inquisition. Cardinal

Scalable Concurrent Hash Tables via Relativistic Programming Josh - PowerPoint PPT Presentation

Scalable Concurrent Hash Tables via Relativistic Programming Josh Triplett April 29, 2010 Speed of data < Speed of light Speed of light: 3e8 meters/second Processor speed: 3 GHz, 3e9 cycles/second 0.1 meters/cycle (4 inches/cycle)

Resizable, Scalable, Concurrent Hash Tables via Relativistic Programming Josh Triplett 1 Paul E.

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

Datastructures 1 Hash Tables Red Black Trees Week 8 Objectives Hash Tables, Hashing

Hash Tables 1 / 91 Hash Tables Administrivia Assignment 2 has been released. We will be

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

CS 758/858: Algorithms http://www.cs.unh.edu/~ruml/cs758 Searching Hash Tables Hash Functions

Hash tables Hash functions Open addressing March 09, 2020 Cinda Heeren / Andy Roth / Geoffrey

Working with Hash Tables Daniel Petrolito (ANZ Bank) Working With Hash Tables Daniel SAS

Hash Tables Direct-Address Tables Hash Functions Universal Hashing Chaining Open Addressing

Hash Functions Hash Functions 1 Cryptographic Hash Function Crypto hash function h(x) must

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions R. Joshua Tobin and David

Topic 22 Hash Tables &quot; hash collision n. [from the techspeak] (var. `hash clash') When used

Hash Tables 1 Hash Table in Primary Storage Main parameter B = number of buckets Hash

Relativistic Effects Relativistic Bit . . . Can Keep Data Secret: Relativistic Bit . . . Why

CS200: Hash Tables Prichard Ch. 13.2 CS200 - Hash Tables 1 Table Implementations: average

Shape Analysis via Symbolic Memory Graphs and Conversion from Pointers to Containers Kamil Dudka

CS10001: Programming &amp; Data Structures Sudeshna Sarkar Dept. of Computer Sc. &amp; Engg.,

Libraries and Museums ALIADA Project Consortium SWIB15, November 23-25, 2015 Hamburg The

Linked Data: Principles and State of the Art Christian Bizer, Freie Universitt Berlin Tom

15-721 DATABASE SYSTEMS Lecture #07 Latch-free OLTP Indexes (Part I) Andy Pavlo / /

CSE3320 Operating Systems File Systems and Implementations Jia Rao Department of Computer

Foundation of Cryptography (0368-4162-01), Lecture 0 Adminstration + Introduction Iftach

Topic 1 Course Introduction Chapman : I didn't expect a kind of Spanish Inquisition. Cardinal

Topic 22 Hash Tables " hash collision n. [from the techspeak] (var. `hash clash') When used

CS10001: Programming & Data Structures Sudeshna Sarkar Dept. of Computer Sc. & Engg.,