Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware - PowerPoint PPT Presentation

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware @ Andy_Pavlo // 15- 721 // Spring 2018

2 ADM IN ISTRIVIA Snowflake Guest: May 2 th @ 3:00pm Final Exam Handout: May 2 nd Code Review #2: May 2 nd @ 11:59pm → We will use the same group pairings as before. Final Presentations: May 14 th @ 8:30am → GHC 4303 (ignore schedule!) → 12 minutes per group → Food and prizes for everyone! CMU 15-721 (Spring 2018)

3 ADM IN ISTRIVIA Course Evaluation → Please tell me what you really think of me. → I actually take your feedback in consideration. → Take revenge on next year's students. https://cmu.smartevals.com/ CMU 15-721 (Spring 2018)

4 DATABASE H ARDWARE People have been thinking about using hardware to accelerate DBMSs for decades. 1980s: Database Machines 2000s: FPGAs + Appliances 2010s: FPGAs + GPUs DATABASE MACHINES: AN IDEA WHOSE TIME HAS PASSED? A CRITIQUE O OF THE FUTURE OF DATABASE M MACHINES University of Wisconsin 1983 CMU 15-721 (Spring 2018)

5 Non-Volatile Memory GPU Acceleration Hardware Transactional Memory CMU 15-721 (Spring 2018)

6 N O N- VO LATILE M EM O RY Emerging storage technology that provide low latency read/writes like DRAM, but with persistent writes and large capacities like SSDs. → aka Storage-class Memory, Persistent Memory First devices will be block-addressable (NVMe) Later devices will be byte-addressable. CMU 15-721 (Spring 2018)

7 FUN DAM EN TAL ELEM EN TS O F CIRCUITS Capacitor Resistor Inductor (ca. 1745) (ca. 1827) (ca. 1831) CMU 15-721 (Spring 2018)

8 FUN DAM EN TAL ELEM EN TS O F CIRCUITS In 1971, Leon Chua at Berkeley predicted the existence of a fourth fundamental element. A two-terminal device whose resistance depends on the voltage applied to it, but when that voltage is turned off it permanently remembers its last resistive state. TWO CENTURIES OF MEMRISTORS Nature Materials 2012 CMU 15-721 (Spring 2018)

9 FUN DAM EN TAL ELEM EN TS O F CIRCUITS Capacitor Resistor Inductor Memristor (ca. 1745) (ca. 1827) (ca. 1831) (ca. 1971) CMU 15-721 (Spring 2018)

10 M ERISTO RS A team at HP Labs led by Stanley Williams stumbled upon a nano-device that had weird properties that they could not understand. It wasn’t until they found Chua’s 1971 paper that they realized what they had invented. HOW WE FOUND THE MISSING MEMRISTOR IEEE Spectrum 2008 CMU 15-721 (Spring 2018)

11 TECH N O LO GIES Phase-Change Memory (PRAM) Resistive RAM (ReRAM) Magnetoresistive RAM (MRAM) CMU 15-721 (Spring 2018)

12 PH ASE- CH AN GE M EM O RY Storage cell is comprised of two metal electrodes separated by a resistive heater and the phase change material (chalcogenide). The value of the cell is changed based on Bitline how the material is heated. chalcogenide → A short pulse changes the cell to a ‘0’. → A long, gradual pulse changes the cell to a ‘1’. Heater Access PHASE CHANGE MEMORY ARCHITECTURE AND THE QUEST FOR SCALABILITY Communications of the ACM 2010 CMU 15-721 (Spring 2018)

13 RESISTIVE RAM Two metal layers with two TiO 2 layers in between. Running a current one direction moves electrons from the top TiO 2 layer to the bottom, thereby changing the resistance. May be programmable storage fabric… Platinum → Bertrand Russell’s Material Implication Logic TiO 2-x Layer TiO 2 Layer Platinum HOW WE FOUND THE MISSING MEMRISTOR IEEE Spectrum 2008 CMU 15-721 (Spring 2018)

14 M AGN ETO RESISTIVE RAM Stores data using magnetic storage elements instead of electric charge or current flows. Spin-Transfer Torque (STT-MRAM) is the leading technology for this type of NVM. → Supposedly able to scale to very small Fixed FM Layer→ sizes (10nm) and have SRAM latencies. Oxide Layer Free FM Layer ↔ SPIN MEMORY SHOWS ITS M MIGHT IEEE Spectrum 2014 CMU 15-721 (Spring 2018)

15 WH Y TH IS IS FO R REAL TH IS TIM E Industry has agreed to standard technologies and form factors. Linux and Microsoft have added support for NVM in their kernels (DAX). Intel has added new instructions for flushing cache lines to NVM ( CLFLUSH , CLWB ). CMU 15-721 (Spring 2018)

16 N VM DIM M FO RM FACTO RS NVDIMM-F (2015) → Flash only. Has to be paired with DRAM DIMM. NVDIMM-N (2015) → Flash and DRAM together on the same DIMM. → Appears as volatile memory to the OS. NVDIMM-P (2018) → True persistent memory. No DRAM or flash. CMU 15-721 (Spring 2018)

17 N VM CO N FIGURATIO N S DRAM as Hardware- NVM Next to NVM as Persistent Managed Cache DRAM Memory DBMS DBMS DBMS DBMS Address Space DBMS Address Space DBMS Address Space Virtual Memory Subsystem Virtual Memory Subsystem Buffer Pool DRAM Disk NVM DRAM NVM Filesystem Filesystem NVM Source: Ismail Oukid CMU 15-721 (Spring 2018)

18 N VM FO R DATABASE SYSTEM S Block-addressable NVM is not that interesting. Byte-addressable NVM will be a game changer but will require some work to use correctly. → In-memory DBMSs will be better positioned to use byte- addressable NVM. → Disk-oriented DBMSs will initially treat NVM as just a faster SSD. CMU 15-721 (Spring 2018)

19 STO RAGE & RECOVERY M ETH O DS Understand how a DBMS will behave on a system that only has byte-addressable NVM. Develop NVM-optimized implementations of standard DBMS architectures. Based on the N-Store prototype DBMS. LET'S TALK ABOUT STORAGE & RECOVERY METHODS FOR NON- VOLATILE MEMORY DATABASE SYSTEMS SIGMOD 2015 CMU 15-721 (Spring 2018)

20 SYN CH RO N IZATIO N Existing programming models assume that any write to memory is non-volatile. → CPU decides when to move data from caches to DRAM. The DBMS needs a way to ensure that data is flushed from caches to NVM. STORE Memory L1 Cache Controller L2 Cache CMU 15-721 (Spring 2018)

20 SYN CH RO N IZATIO N Existing programming models assume that any write to memory is non-volatile. → CPU decides when to move data from caches to DRAM. The DBMS needs a way to ensure that data is flushed from caches to NVM. STORE CLWB Memory L1 Cache Controller L2 Cache CMU 15-721 (Spring 2018)

20 SYN CH RO N IZATIO N Existing programming models assume that any write to memory is non-volatile. → CPU decides when to move data from caches to DRAM. The DBMS needs a way to ensure that data is flushed from caches to NVM. STORE CLWB ADR Memory L1 Cache Controller L2 Cache CMU 15-721 (Spring 2018)

21 N AM IN G If the DBMS process restarts, we need to make sure that all of the pointers for in-memory data point to the same data. Index Table Heap Tuple #00 Tuple #01 Tuple #02 Tuple #00 ( v2 ) CMU 15-721 (Spring 2018)

21 N AM IN G If the DBMS process restarts, we need to make sure that all of the pointers for in-memory data X X point to the same data. Index Table Heap Tuple #00 Tuple #01 Tuple #02 Tuple #00 ( v2 ) CMU 15-721 (Spring 2018)

21 N AM IN G If the DBMS process restarts, we need to make sure that all of the pointers for in-memory data point to the same data. Index Table Heap Tuple #00 Tuple #01 Tuple #02 Tuple #00 ( v2 ) CMU 15-721 (Spring 2018)

22 N VM - AWARE M EM O RY ALLO CATO R Feature #1: Synchronization → The allocator writes back CPU cache lines to NVM using the CLFLUSH instruction. → It then issues a SFENCE instruction to wait for the data to become durable on NVM. Feature #2: Naming → The allocator ensures that virtual memory addresses assigned to a memory-mapped region never change even after the OS or DBMS restarts. CMU 15-721 (Spring 2018)

23 DBM S EN GIN E ARCH ITECTURES Choice #1: In-place Updates → Table heap with a write-ahead log + snapshots. → Example: VoltDB Choice #2: Copy-on-Write → Create a shadow copy of the table when updated. → No write-ahead log. → Example: LMDB Choice #3: Log-structured → All writes are appended to log. No table heap. → Example: RocksDB CMU 15-721 (Spring 2018)

24 IN- PLACE UPDATES EN GIN E In-Memory In-Memory Durable Index Table Heap Storage Write-Ahead Log Tuple #00 Tuple #01 Tuple #02 Snapshots CMU 15-721 (Spring 2018)

24 IN- PLACE UPDATES EN GIN E In-Memory In-Memory Durable Index Table Heap Storage Write-Ahead Log Tuple #00 1 Tuple Delta Tuple #01 Tuple #02 Snapshots CMU 15-721 (Spring 2018)

24 IN- PLACE UPDATES EN GIN E In-Memory In-Memory Durable Index Table Heap Storage Write-Ahead Log Tuple #00 2 1 Tuple Delta Tuple #01 (!) Tuple #01 Tuple #02 Snapshots CMU 15-721 (Spring 2018)

24 IN- PLACE UPDATES EN GIN E In-Memory In-Memory Durable Index Table Heap Storage Write-Ahead Log Tuple #00 2 1 Tuple Delta Tuple #01 (!) Tuple #01 Tuple #02 Snapshots 3 Tuple #01 (!) CMU 15-721 (Spring 2018)

24 IN- PLACE UPDATES EN GIN E In-Memory In-Memory Durable Index Table Heap Storage Duplicate Data Write-Ahead Log Tuple #00 2 1 Tuple Delta Tuple #01 (!) Tuple #01 Tuple #02 Recovery Latency Snapshots 3 Tuple #01 (!) CMU 15-721 (Spring 2018)

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware - PowerPoint PPT Presentation

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware @ Andy_Pavlo // 15- 721 // Spring 2018 2 ADM IN ISTRIVIA Snowflake Guest: May 2 th @ 3:00pm Final Exam Handout: May 2 nd Code Review #2: May 2 nd @ 11:59pm We will use the

Lect 12a - Delaunay Triangulations Lect 12b - Delaunay Triangulations Lect 12c - Delaunay

Lect ure # 11 ADVANCED DATABASE SYSTEMS System Catalogs and Database Compression @

Lect ure # 01 ADVANCED DATABASE SYSTEMS Course Introduction & History of Database Systems

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ Andy_Pavlo // 15- 721 //

Lect ure # 03 ADVANCED DATABASE SYSTEMS Query Compilation @ Andy_Pavlo // 15- 721 // Spring

Lect ure # 14 ADVANCED DATABASE SYSTEMS Networking @ Andy_Pavlo // 15- 721 // Spring 2018 2

Lect ure # 06 ADVANCED DATABASE SYSTEMS Multi-Version Concurrency Control (Part II) @

Lect ure # 10 ADVANCED DATABASE SYSTEMS Storage Models & Data Layout @ Andy_Pavlo // 15-

Lect ure # 22 ADVANCED DATABASE SYSTEMS Vectorized Execution (Part II) @ Andy_Pavlo // 15-

Lect 14a - Line Arrangements: Definitions and Zone Theorem Lect 14b - Line Arrangements:

ADVANCED DATABASE SYSTEMS Database Compression @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Advanced Database CS 525: Organization? Advanced Database =Database Implementation

ADVANCED DATABASE SYSTEMS Recovery Protocols @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

ADVANCED DATABASE SYSTEMS Self-Driving Database Management Systems @ Andy_Pavlo // 15- 721 //

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Chapter 1: Introduction Components of computer security Threats Policies and

Strengthening Early Childhood in Kansas in 2019 WEBINAR April 17, 2019 Welcome Amanda

Questions of Trust Jason Quinley, Christopher Ahern University of T ubingen, University of

Board of Trustees Retreat October 26, 2013 An Equal Opportunity University A Time of Great

Utilizing the Latest Features of Intel's Performance Monitoring Unit Scalable Tools Workshop 2019

osstest The Xen Projects CI system Some interesting architectural features Xen Summit

EPSCoR Webinar OCTOBER 25, 2019 Emera Technologies, LLC Safety Briefing 2 Introduction Quik

Hardware Transactional Memory on Haswell EP Viktor Leis Technische Universitt Mnchen 1 / 14

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware - PowerPoint PPT Presentation

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware @ Andy_Pavlo // 15- 721 // Spring 2018 2 ADM IN ISTRIVIA Snowflake Guest: May 2 th @ 3:00pm Final Exam Handout: May 2 nd Code Review #2: May 2 nd @ 11:59pm We will use the

Lect 12a - Delaunay Triangulations Lect 12b - Delaunay Triangulations Lect 12c - Delaunay

Lect ure # 11 ADVANCED DATABASE SYSTEMS System Catalogs and Database Compression @

Lect ure # 01 ADVANCED DATABASE SYSTEMS Course Introduction &amp; History of Database Systems

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ Andy_Pavlo // 15- 721 //

Lect ure # 03 ADVANCED DATABASE SYSTEMS Query Compilation @ Andy_Pavlo // 15- 721 // Spring

Lect ure # 14 ADVANCED DATABASE SYSTEMS Networking @ Andy_Pavlo // 15- 721 // Spring 2018 2

Lect ure # 06 ADVANCED DATABASE SYSTEMS Multi-Version Concurrency Control (Part II) @

Lect ure # 10 ADVANCED DATABASE SYSTEMS Storage Models &amp; Data Layout @ Andy_Pavlo // 15-

Lect ure # 22 ADVANCED DATABASE SYSTEMS Vectorized Execution (Part II) @ Andy_Pavlo // 15-

Lect 14a - Line Arrangements: Definitions and Zone Theorem Lect 14b - Line Arrangements:

ADVANCED DATABASE SYSTEMS Database Compression @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Advanced Database CS 525: Organization? Advanced Database =Database Implementation

ADVANCED DATABASE SYSTEMS Recovery Protocols @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

ADVANCED DATABASE SYSTEMS Self-Driving Database Management Systems @ Andy_Pavlo // 15- 721 //

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Chapter 1: Introduction Components of computer security Threats Policies and

Strengthening Early Childhood in Kansas in 2019 WEBINAR April 17, 2019 Welcome Amanda

Questions of Trust Jason Quinley, Christopher Ahern University of T ubingen, University of

Board of Trustees Retreat October 26, 2013 An Equal Opportunity University A Time of Great

Utilizing the Latest Features of Intel's Performance Monitoring Unit Scalable Tools Workshop 2019

osstest The Xen Projects CI system Some interesting architectural features Xen Summit

EPSCoR Webinar OCTOBER 25, 2019 Emera Technologies, LLC Safety Briefing 2 Introduction Quik

Hardware Transactional Memory on Haswell EP Viktor Leis Technische Universitt Mnchen 1 / 14

Lect ure # 01 ADVANCED DATABASE SYSTEMS Course Introduction & History of Database Systems

Lect ure # 10 ADVANCED DATABASE SYSTEMS Storage Models & Data Layout @ Andy_Pavlo // 15-