Big and Fast Anti-Caching in OLTP Systems Justin DeBrabant Online - PowerPoint PPT Presentation

Big and Fast   Anti-Caching in OLTP Systems Justin DeBrabant

Online Transaction Processing transaction-oriented small footprint write-intensive 2

A bit of history… 3

OLTP Through the Years relational model rise of the web Ingres/System R “end of an era” OLAP 2015 1972 1993 4

Modern OLTP Requirements 1. web-scale (big) 2. high-throughput (fast) 5

Thesis Motivation ▸ traditional disk-based architectures aren’t fast enough ▸ newer main memory architectures aren’t big enough 6

Can we have main- memory performance for larger-than-memory datasets? 7

Thesis Overview: Contributions 1. anti-caching architecture larger than memory datasets in main ‣ memory DBMS 2. anti-caching + persistent memory exploring next-generation hardware and ‣ OLTP systems 8

Outline ▸ Introduction ▸ Overview and Motivation ▸ Anti-Caching Architecture ▸ Memory Optimizations ▸ Anti-Caching on NVM ▸ Future Work and Conclusions 9

Disk-Oriented Architectures ▸ assumption: data won’t fit in memory ▸ disk-resident data, main memory buffer pool for execution ▸ concurrency is a must ▸ transaction serialization and locks 10

Memory Costs 1E+10 price per GB ($) 1E+05 1E+00 1970 1973 1976 1979 1982 1985 1988 1991 1994 1997 2000 2003 2006 2009 2012 11

Now What? 1. DBMS buffer pool 2. distributed cache 3. in-memory DBMS 12

Buffer Pool ▸ must still… ‣ maintain buffer pool ‣ lock/latch data ‣ maintain ARIES-style recovery logs ▸ question: What is the overhead of all these things? 13

Buffer Pool 31% Locking Recovery Real Work 26% 31% 12% OLTP Through the Looking Glass, and What We Found There   SIGMOD ‘08 14

Cache Layer Persistence Layer 16

Main Memory Cache ▸ fast and scalable, but… ▸ key-value interface ▸ not ACID (AI, not CD) 17

Consistency and Durability ▸ reads are easy, writes are not ▸ multiple copies of data ▸ application’s responsibility ▸ for OLTP, writes are common and consistency is essential 18

H-Store Architecture ▸ partitioned, shared-nothing ▸ single-threaded main memory execution ‣ no need for locks and latches ▸ lightweight recovery ‣ snapshots + command log 21

virtual memory! data > memory? 23

persistent storage 24

Big and Fast big: disk-oriented fast: memory-oriented big and fast: anti-caching 25

OLTP workloads are skewed 26

Design Principles ▸ asynchronous disk fetches ‣ don’t block ▸ maintain ordering of evicted data accesses ‣ ensures transactional consistency ▸ single copy of data ‣ consistency is free ▸ efficient memory use, no swizzling 27

Outline ▸ Introduction ▸ Overview and Motivation ▸ Anti-Caching Architecture ▸ Memory Optimizations ▸ Anti-Caching on NVM ▸ Future Work and Conclusions 28

Architectural Overview ▸ memory is primary storage, cold data is evicted to disk-based anti- cache ▸ reading data from the anti-cache is done in 3 phases ‣ avoids blocking, ensures consistency 29

Anti-Caching Phases ▸ evict ▸ pre-pass ▸ fetch ▸ merge 30

Evict 1. data > anti-cache threshold 2. dynamically construct anti- cache blocks of coldest tuples 3. asynchronously write to disk 31

Pre-Pass 1. a transaction enters pre-pass when evicted data is accessed 2. continues execution, creating list of evicted blocks 3. abort, queue blocks to be fetched 32

Fetch 1. data is fetched asynchronously from disk ‣ avoids blocking 2. moved into merge buffer 33

Merge 1. data is moved from in-memory merge buffer to in-memory table 2. previously aborted transaction is restarted 3. transaction executes normally 34

Anti-Caching Phase: Pre-Pass Anti-Caching Phase: Merge Anti-Caching Phase: Fetch Anti-Caching Phase: Evict anti-cache

Tracking Access Patterns ▸ done online, more responsive to changes in workload ▸ goal is low CPU and memory overhead ▸ approximate ordering is OK 36

Approximate LRU (aLRU) ▸ maintain LRU chain embedded in tuple headers ▸ per-partition ▸ transactions that update LRU chain are sampled randomly ▸ configurable sample rate 37

Anti-Caching vs. Swapping ▸ fine-grained eviction ▸ blocks constructed dynamically ▸ asynchronous batched fetches ▸ possible because of transactions 38

Anti-Caching vs. Caching ▸ data exists in exactly one location ‣ caching architectures have multiple copies, must maintain consistency ‣ data is moved, not copied ▸ goal is increased data size, not throughput 39

Benchmarking ▸ YCSB ▸ Zipfian skew ▸ data > memory ▸ read/write mix ▸ MySQL, MySQL + memcached 40

YCSB, read-only, data 8X memory anti-cache MySQL MySQL + memcached 120000 throughput (txn/s) 90000 60000 30000 0 1.5 1.25 1 0.75 0.5 workload skew (high —> low) 41

YCSB, read-heavy, data 8X memory anti-cache MySQL MySQL + memcached 120000 throughput (txn/s) 90000 60000 30000 0 1.5 1.25 1 0.75 0.5 workload skew (high —> low) 42

Tracking Accesses Revisited ▸ approximate ordering is OK ▸ original implementation ▸ aLRU (linked list) ▸ compute vs. memory Can we reduce the memory overhead? 43

Timestamp-Based Eviction ▸ use relative timestamps to track accesses ▸ to evict, take subset of tuples and evict based on timestamp age ▸ questions: ▸ timestamp granularity ▸ sample size (power of two) 44

Timestamp Granularity ▸ 4 byte timestamps ▸ use instruction counter ▸ 2 byte timestamps ▸ use epochs, set the timestamp to the current epoch 45

YCSB, read-heavy, data 8X aLRU chain timestamp-low timestamp-high 90000 throughput (txn/s) 67500 45000 22500 0 1.5 1.25 1 0.75 0.5 workload skew (high —> low) 46

Key Take-Aways ▸ 8-17X improvement for skewed workloads at larger- than-memory data sizes ▸ disk becomes the bottleneck for lower skew 47

Hardware Assumptions are Key ▸ heavily influence system architectures ▸ many factors ▸ capacity ▸ latency ▸ volatility 48

What’s next for OLTP? 49

Non-Volatile Memory 50

Properties of NVM ▸ non-volatile ▸ random-access ▸ high write endurance ‣ except flash ▸ byte-addressable ‣ except flash 51

The NVM Arms Race ▸ FeRAM ‣ high write endurance ▸ MRAM ‣ DRAM-like latency ▸ PCM (PRAM) ‣ DRAM-like capacity 52

Looking Forward… ▸ OLTP architectures and NVM ‣ anti-cache architecture ‣ disk-based architecture ▸ open questions ‣ Which architecture is best suited for NVM? ‣ What adaptations are needed? 53

NVM Emulation ▸ goal: provide product-independent analysis ▸ test wide range of latency profiles ▸ automatically add specified latency ▸ built by collaborators at Intel 54

Anti-Caching on NVM ▸ replace disk with NVM ▸ several adaptations necessary ▸ lightweight array-based anti-cache ▸ utilizes mmap interface ▸ fine-grained block and tuple eviction interface 55

Disk-Oriented Architectures on NVM ▸ must adapt both storage and log files to be use NVM mmap interface ▸ configure to use fine-grained buffer pool pages 56

YCSB, read-only, data 8X anti-caching MySQL 180000 throughput (txn/s) 135000 90000 45000 0 1.5 1.25 1 0.75 0.5 workload skew (high —> low) 57

YCSB, read-heavy, data 8X anti-caching MySQL 180000 throughput (txn/s) 135000 90000 45000 0 1.5 1.25 1 0.75 0.5 workload skew (high —> low) 58

Future Work 59

Multi-Tier Architectures ▸ DRAM -> NVM -> Disk/SSD ▸ open questions ▸ indexing structures ▸ synchronous/asynchronous fetches 60

Anti-Caching Indexes ▸ index size can be significant ▸ can cold index ranges be evicted to an anti-cache? ▸ open questions ▸ how/what to evict ▸ execution changes 61

Semantic Anti-Caching ▸ current implementation makes no assumption about types of skew ▸ skew typically as semantic meaning ▸ e.g., temporal, spatial ▸ can we leverage these domain semantics? 62

Conclusions ▸ anti-caching architecture outperforms and outscales previous OLTP architectures ▸ well-suited for next-generation NVM- based architectures 63

  Questions?   debrabant@cs.brown.edu   65

Big and Fast Anti-Caching in OLTP Systems Justin DeBrabant Online - PowerPoint PPT Presentation

Big and Fast Anti-Caching in OLTP Systems Justin DeBrabant Online Transaction Processing transaction-oriented small footprint write-intensive 2 A bit of history 3 OLTP Through the Years relational model rise of the web

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Fast Food and Your Health www.ddssafety.net Last updated October 2009 What is fast food?

Community Update MST T Fast st Facts cts MST T Fast st Facts cts MST T Fast st Facts

Lurssen 32,9 A classic fast Lurssen 32,9 A classic fast A F T D E C K Lurssen 32,9 A

I Prefer Pi Corey Sinnamon Febuary 3, 2015 Big Day 3/14/15 Big Day 3/14/15 Themes Big

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

Redis for Fast Data Ingest Agenda Fast Data Ingest and its challenges Redis for Fast

IOTA/FAST Collaboration Meeting - Intro Vladimir SHILTSEV, AD/APC IOTA/FAST Workshop and

The Education for All The Education for All Fast Track Initiative Fast Track Initiative

Fast-track listing Fast-track listing process Time to market can be essential benefits of

Fast-SCNN: Fast Semantic Segmentation Network Rudra PK Poudel Stephan Liwicki Roberto Cipolla

Michael Stonebraker The Meaning of Big Data - 3 V s Big Volume With simple (SQL)

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural

Plan to Finish Nathan Ricke Van Voorhis Group HFB-DMET Research Goals: Write Fock-space

KDI DERA Methodology Fausto Giunchiglia and Mattia Fumagallli University of Trento

J. PROSPERO E. DE VERA III, DPA Commissioner, Commission on Higher Education DISTRIBUT RIBUTION

Loop Invariants Dr. Mattox Beckman University of Illinois at Urbana-Champaign Department of

Introduction to Scintillation Arcs the single-dish version Dan Stinebring Oberlin

Argument Search with Voice Assistants Master's Thesis by Kevin Lang Referees: Advisor: Prof.

Wiera : Towards Flexible Multi-Tiered Geo-Distributed Cloud Storage Instances Kwangsung Oh ,

Today Some logistics Overview lecture on recognition models Visual Recognition and