Cheap and Large CAMs for High Performance Data-Intensive Networked - PowerPoint PPT Presentation

Cheap and Large CAMs for High Performance Data-Intensive Networked Systems Ashok Anand , Chitra Muthukrishnan, Steven Kappes, and Aditya Akella University of Wisconsin-Madison Suman Nath Microsoft Research

New data-intensive networked systems Large hash tables (10s to 100s of GBs)

New data-intensive networked systems WAN optimizers Object WAN Data center Branch office Key Chunk Large hash tables (32 GB) (20 B) pointer Object High speed (~10K/sec) Chunks(4 KB) lookups for 500 Mbps link Look up High speed (~10 K/sec) Hashtable (~32GB) Object store (~4 TB) inserts and evictions

New data-intensive networked systems • Other systems – De-duplication in storage systems (e.g., Datadomain) – CCN cache (Jacobson et al., CONEXT 2009) – DONA directory lookup (Koponen et al., SIGCOMM 2006) Cost-effective large hash tables Cheap Large cAMs

Candidate options Too + Price statistics from 2008-09 slow Too Random Random Cost expensive reads/sec writes/sec (128 GB) $30 + Disk 250 250 $120K + DRAM 300K 300K 2.5 ops/sec/$ $225 + Flash-SSD 10K* 5K* Slow * Derived from latencies on Intel M-18 SSD in experiments writes How to deal with slow writes of Flash SSD

Our CLAM design • New data structure “ BufferHash ” + Flash • Key features – Avoid random writes, and perform sequential writes in a batch • Sequential writes are 2X faster than random writes (Intel SSD) • Batched writes reduce the number of writes going to Flash – Bloom filters for optimizing lookups BufferHash performs orders of magnitude better than DRAM based traditional hash tables in ops/sec/$

Outline • Background and motivation • CLAM design – Key operations (insert, lookup, update) – Eviction – Latency analysis and performance tuning • Evaluation

Flash/SSD primer • Random writes are expensive Avoid random page writes • Reads and writes happen at the granularity of a flash page I/O smaller than page should be avoided, if possible

Conventional hash table on Flash/SSD Keys are likely to hash to random locations Random Flash writes SSDs: FTL handles random writes to some extent; But garbage collection overhead is high ~200 lookups/sec and ~200 inserts/sec with WAN optimizer workload, << 10 K/s and 5 K/s

Conventional hash table on Flash/SSD DRAM Can’t assume locality in requests – DRAM as cache won’t work Flash

Our approach: Buffering insertions • Control the impact of random writes • Maintain small hash table ( buffer ) in memory • As in-memory buffer gets full, write it to flash – We call in-flash buffer, incarnation of buffer DRAM Flash SSD Buffer: In-memory Incarnation: In-flash hash table hash table

Two-level memory hierarchy DRAM Buffer Flash Incarnation Latest Oldest 4 3 2 1 incarnation incarnation Incarnation table Net hash table is: buffer + all incarnations

Lookups are impacted due to buffers DRAM Buffer Lookup key Flash In-flash look ups 4 3 2 1 Incarnation table Multiple in-flash lookups. Can we limit to only one?

Bloom filters for optimizing lookups DRAM Buffer Lookup key Bloom filters In-memory Flash look ups False positive! 4 3 2 1 Configure carefully! Incarnation table 2 GB Bloom filters for 32 GB Flash for false positive rate < 0.01!

Update: naïve approach DRAM Buffer Update key Bloom filters Flash Update key Expensive random writes 4 3 2 1 Incarnation table Discard this naïve approach

Lazy updates DRAM Buffer Update key Bloom filters Insert key Flash Key, new Key, old value value 4 3 2 1 Incarnation table Lookups check latest incarnations first

Eviction for streaming apps • Eviction policies may depend on application – LRU, FIFO, Priority based eviction, etc. • Two BufferHash primitives – Full Discard: evict all items • Naturally implements FIFO – Partial Discard: retain few items • Priority based eviction by retaining high priority items • BufferHash best suited for FIFO – Incarnations arranged by age – Other useful policies at some additional cost • Details in paper

Issues with using one buffer • Single buffer in DRAM Buffer DRAM – All operations and Bloom filters eviction policies Flash • High worst case insert latency 4 3 2 1 – Few seconds for 1 GB buffer Incarnation table – New lookups stall

Partitioning buffers 0 XXXXX 1 XXXXX • Partition buffers DRAM – Based on first few bits of key space – Size > page • Avoid i/o less than page Flash – Size >= block • Avoid random page writes • Reduces worst case 4 3 2 1 latency • Eviction policies apply Incarnation table per buffer

BufferHash: Putting it all together • Multiple buffers in memory • Multiple incarnations per buffer in flash • One in-memory bloom filter per incarnation DRAM Buffer 1 Buffer K . . Flash . . Net hash table = all buffers + all incarnations

Outline • Background and motivation • Our CLAM design – Key operations (insert, lookup, update) – Eviction – Latency analysis and performance tuning • Evaluation

Latency analysis • Insertion latency – Worst case size of buffer – Average case is constant for buffer > block size • Lookup latency – Average case Number of incarnations – Average case False positive rate of bloom filter

Parameter tuning: Total size of Buffers Total size of buffers = B1 + B2 + … + BN Given fixed DRAM, how much allocated to buffers Total bloom filter size = DRAM – total size of buffers DRAM B1 BN Lookup #Incarnations * False positive rate . # Incarnations = (Flash size/Total buffer size) . Flash . False positive rate increases as the size of bloom filters decrease . Too small is not optimal Too large is not optimal either Optimal = 2 * SSD/entry

Parameter tuning: Per-buffer size What should be size of a partitioned buffer (e.g. B1) ? DRAM B1 BN Affects worst case insertion . Adjusted according to . Flash . application requirement . (128 KB – 1 block)

Outline • Background and motivation • Our CLAM design – Key operations (insert, lookup, update) – Eviction – Latency analysis and performance tuning • Evaluation

Evaluation • Configuration – 4 GB DRAM, 32 GB Intel SSD, Transcend SSD – 2 GB buffers, 2 GB bloom filters, 0.01 false positive rate – FIFO eviction policy

BufferHash performance • WAN optimizer workload – Random key lookups followed by inserts – Hit rate (40%) – Used workload from real packet traces also • Comparison with BerkeleyDB (traditional hash table) on Intel SSD Average latency BufferHash BerkeleyDB Better lookups! Look up (ms) 0.06 4.6 Insert (ms) 0.006 4.8 Better inserts!

Insert performance Bufferhash BerkeleyDB CDF 1.0 0.8 99% inserts < 0.1 ms 0.6 40% of 0.4 inserts > 5 ms ! 0.2 0.001 0.001 0.01 0.01 0.1 0.1 1 1 10 10 100 100 Insert latency (ms) on Intel SSD Buffering effect! Random writes are slow!

Lookup performance Bufferhash BerkeleyDB CDF 99% of lookups < 0.2ms 1.0 0.8 40% of lookups > 5 ms 0.6 0.4 Garbage collection 0.2 overhead due to writes! 0.001 0.001 0.01 0.01 0.1 0.1 1 1 10 10 100 100 Lookup latency (ms) for 40% hit workload 60% lookups don’t go to Flash 0.15 ms Intel SSD latency

Performance in Ops/sec/$ • 16K lookups/sec and 160K inserts/sec • Overall cost of $400 • 42 lookups/sec/$ and 420 inserts/sec/$ – Orders of magnitude better than 2.5 ops/sec/$ of DRAM based hash tables

Other workloads • Varying fractions of lookups • Results on Trancend SSD Average latency per operation Lookup fraction BufferHash BerkeleyDB 0 0.007 ms 18.4 ms 0.5 0.09 ms 10.3 ms 1 0.12 ms 0.3 ms • BufferHash ideally suited for write intensive workloads

Evaluation summary • BufferHash performs orders of magnitude better in ops/sec/$ compared to traditional hashtables on DRAM (and disks) • BufferHash is best suited for FIFO eviction policy – Other policies can be supported at additional cost, details in paper • WAN optimizer using Bufferhash can operate optimally at 200 Mbps, much better than 10 Mbps with BerkeleyDB – Details in paper

Related Work • FAWN (Vasudevan et al., SOSP 2009) – Cluster of wimpy nodes with flash storage – Each wimpy node has its hash table in DRAM – We target… • Hash table much bigger than DRAM • Low latency as well as high throughput systems • HashCache (Badam et al., NSDI 2009) – In-memory hash table for objects stored on disk

Conclusion • We have designed a new data structure BufferHash for building CLAMs • Our CLAM on Intel SSD achieves high ops/sec/$ for today’s data -intensive systems • Our CLAM can support useful eviction policies • Dramatically improves performance of WAN optimizers

Thank you

Cheap and Large CAMs for High Performance Data-Intensive Networked - PowerPoint PPT Presentation

Cheap and Large CAMs for High Performance Data-Intensive Networked Systems Ashok Anand , Chitra Muthukrishnan, Steven Kappes, and Aditya Akella University of Wisconsin-Madison Suman Nath Microsoft Research New data-intensive networked

Cheap and Large CAMs for High Performance Data-Intensive Networked Systems Presented By:

4/1/2020 The Collaborative Assessment and Management of Suicidality (CAMS) Kevin Crowley, Ph.D.

Cheap Talk Games: Extensions Cheap Talk Games: Extensions F. Koessler / November 12, 2008 Cheap

Suicidality (CAMS) Framework: Grounding in Philosophy and Reaching Towards Future Developments

nouvelle mthode de placement MSc Jacques VERCRUYSSE GEO-GREEN sprl-bvba Cheap-GSHPs (Cheap and

CA CAMS MS EMISSION EMISSION INV INVENT ENTOR ORY Latest developments and plans | Jeroen

Belmont Ridge Middle School Home of the River Hawks CAMS 7 Information Presentation What is

From m diff fferent t prospe pecti ctive ve 1 Maleka Ali, CAMS-AUDIT Director of

Cheap Children and the Cheap Children and the Persistence of Poverty Persistence of

Strategic Information Transmission: Cheap Talk Games Outline (November 12, 2008) Credible

Communication in Games Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Cheap Talk In cheap talk,

Cheap and Safe Cold Synthesis Generator LENR ENERGY Me-H LENR Reactor Metal hydride Cold

Cheap Orthogonal Constraints in Neural Networks: A Simple Parametrization of the Orthogonal and

Operating System Implications of Fast, Cheap, Non-Volatile Memory Adam Morawski MIMUW

Cheap transla,on Automated ques,on answering Visualizing

scoot Introducing Fast, Cheap, Personal Transportation Free, But Slow (<10 MPH, $0/trip)

Ozone Depletion in Filaments of the Arctic Polar Vortex, Observed the First Global Hawk Science

The Ocano Project The Ocano Project Intelligent eUtilities Infrastructure Intelligent

Case study: GATE in the NeOn project Diana Maynard University of Sheffield University of

MOLLUSKS One of the oldest and most diverse phyla vary in size, shape and form Include :

W. Wallace McMillan Assistant Professor of Physics University of Maryland Baltimore County Ray

Where%we%are%and%% Where%we%are%going% The%four%stages%of%climate% change%denial% Feb$16$ 1$

Bioinformatics Outline What is bioinformatics? Who are bioinformaticians? Hardware

MSS - Software for planning research Member of the Helmholtz-Association aircraft missions

Cheap and Large CAMs for High Performance Data-Intensive Networked - PowerPoint PPT Presentation

Cheap and Large CAMs for High Performance Data-Intensive Networked Systems Ashok Anand , Chitra Muthukrishnan, Steven Kappes, and Aditya Akella University of Wisconsin-Madison Suman Nath Microsoft Research New data-intensive networked

Cheap and Large CAMs for High Performance Data-Intensive Networked Systems Presented By:

4/1/2020 The Collaborative Assessment and Management of Suicidality (CAMS) Kevin Crowley, Ph.D.

Cheap Talk Games: Extensions Cheap Talk Games: Extensions F. Koessler / November 12, 2008 Cheap

Suicidality (CAMS) Framework: Grounding in Philosophy and Reaching Towards Future Developments

nouvelle mthode de placement MSc Jacques VERCRUYSSE GEO-GREEN sprl-bvba Cheap-GSHPs (Cheap and

CA CAMS MS EMISSION EMISSION INV INVENT ENTOR ORY Latest developments and plans | Jeroen

Belmont Ridge Middle School Home of the River Hawks CAMS 7 Information Presentation What is

From m diff fferent t prospe pecti ctive ve 1 Maleka Ali, CAMS-AUDIT Director of

Cheap Children and the Cheap Children and the Persistence of Poverty Persistence of

Strategic Information Transmission: Cheap Talk Games Outline (November 12, 2008) Credible

Communication in Games Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Cheap Talk In cheap talk,

Cheap and Safe Cold Synthesis Generator LENR ENERGY Me-H LENR Reactor Metal hydride Cold

Cheap Orthogonal Constraints in Neural Networks: A Simple Parametrization of the Orthogonal and

Operating System Implications of Fast, Cheap, Non-Volatile Memory Adam Morawski MIMUW

Cheap transla,on Automated ques,on answering Visualizing

scoot Introducing Fast, Cheap, Personal Transportation Free, But Slow (&lt;10 MPH, $0/trip)

Ozone Depletion in Filaments of the Arctic Polar Vortex, Observed the First Global Hawk Science

The Ocano Project The Ocano Project Intelligent eUtilities Infrastructure Intelligent

Case study: GATE in the NeOn project Diana Maynard University of Sheffield University of

MOLLUSKS One of the oldest and most diverse phyla vary in size, shape and form Include :

W. Wallace McMillan Assistant Professor of Physics University of Maryland Baltimore County Ray

Where%we%are%and%% Where%we%are%going% The%four%stages%of%climate% change%denial% Feb$16$ 1$

Bioinformatics Outline What is bioinformatics? Who are bioinformaticians? Hardware

MSS - Software for planning research Member of the Helmholtz-Association aircraft missions

scoot Introducing Fast, Cheap, Personal Transportation Free, But Slow (<10 MPH, $0/trip)