Database Storage Part I Lecture # 03 Database Systems Andy Pavlo - PowerPoint PPT Presentation

Database Storage Part I Lecture # 03 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645 Carnegie Mellon Univ. Fall 2018

2 ADM IN ISTRIVIA Homework #1 is due Monday September 10 th @ 11:59pm Project #1 will be released on Wednesday September 12 th CMU 15-445/645 (Fall 2018)

3 UPCO M IN G DATABASE EVEN TS Kinetica Talk → Thursday Sep 6 th @ 12pm → CIC 4 th Floor SalesForce Talk → Friday Sep 7 th @ 12pm → CIC 4 th Floor Relational AI Talk → Wednesday @ Sep 12 th @ 4:00pm → GHC 8102 CMU 15-445/645 (Fall 2018)

4 OVERVIEW We now understand what a database looks like at a logical level and how to write queries to read/write data from it. We will next learn how to build software that manages a database. CMU 15-445/645 (Fall 2018)

5 CO URSE O UTLIN E Relational Databases Query Planning Storage Operator Execution Execution Concurrency Control Access Methods Recovery Buffer Pool Manager Distributed Databases Disk Manager Potpourri CMU 15-445/645 (Fall 2018)

6 DISK- O RIEN TED ARCH ITECTURE The DBMS assumes that the primary storage location of the database is on non-volatile disk. The DBMS's components manage the movement of data between non-volatile and volatile storage. CMU 15-445/645 (Fall 2018)

7 STO RAGE H IERARCH Y Faster Smaller CPU Registers CPU Caches Volatile Random Access Byte-Addressable DRAM Non-Volatile SSD Sequential Access Block-Addressable HDD Slower Larger Network Storage CMU 15-445/645 (Fall 2018)

7 STO RAGE H IERARCH Y Faster Smaller CPU Registers CMU 15-721 (Spring 2019) CPU Caches Memory DRAM SSD Disk HDD Slower Larger Network Storage CMU 15-445/645 (Fall 2018)

7 STO RAGE H IERARCH Y Faster Smaller CPU Registers CMU 15-721 (Spring 2019) CPU Caches Memory DRAM Non-volatile Memory SSD Disk HDD Slower Larger Network Storage CMU 15-445/645 (Fall 2018)

8 ACCESS TIM ES 0.5 sec 0.5 ns L1 Cache Ref 7 sec 7 ns L2 Cache Ref 1 00 sec 100 ns DRAM 1 .7 days 150,000 ns SSD 1 6.5 weeks 10,000,000 ns HDD 1 1 .4 months ~30,000,000 ns Network Storage 1,000,000,000 ns Tape Archives 31 .7 years [Source] CMU 15-445/645 (Fall 2018)

9 SYSTEM DESIGN GOALS Allow the DBMS to manage databases that exceed the amount of memory available. Reading/writing to disk is expensive, so it must be managed carefully to avoid large stalls and performance degradation. CMU 15-445/645 (Fall 2018)

10 SEQ UEN TIAL VS. RAN DO M ACCESS Random access on an HDD is much slower than sequential access. Traditional DBMSs are designed to maximize sequential access. → Algorithms try to reduce number of writes to random pages so that data is stored in contiguous blocks. → Allocating multiple pages at the same time is called an extent. CMU 15-445/645 (Fall 2018)

11 WH Y N OT USE TH E O S? One can use mmap to map the contents of a file into a process' address space. The OS is responsible for moving data for moving the files' pages in and out of memory. page1 page2 page3 page4 On-Disk File CMU 15-445/645 (Fall 2018)

11 WH Y N OT USE TH E O S? One can use mmap to map the contents Virtual Physical Memory Memory of a file into a process' address space. page1 The OS is responsible for moving data page2 for moving the files' pages in and out page3 of memory. page4 page1 page2 page3 page4 On-Disk File CMU 15-445/645 (Fall 2018)

11 WH Y N OT USE TH E O S? One can use mmap to map the contents Virtual Physical Memory Memory of a file into a process' address space. page1 page1 page1 The OS is responsible for moving data page2 for moving the files' pages in and out page3 of memory. page4 page1 page2 page3 page4 On-Disk File CMU 15-445/645 (Fall 2018)

11 WH Y N OT USE TH E O S? One can use mmap to map the contents Virtual Physical Memory Memory of a file into a process' address space. page1 page1 page1 The OS is responsible for moving data page2 page3 for moving the files' pages in and out page3 page3 of memory. page4 page1 page2 page3 page4 On-Disk File CMU 15-445/645 (Fall 2018)

11 WH Y N OT USE TH E O S? One can use mmap to map the contents Virtual Physical Memory Memory of a file into a process' address space. page1 page1 page1 The OS is responsible for moving data ??? page2 page3 for moving the files' pages in and out page3 page3 of memory. page4 page1 page2 page3 page4 On-Disk File CMU 15-445/645 (Fall 2018)

12 WH Y N OT USE TH E O S? What if we allow multiple threads to access the mmap files to hide page fault stalls? This works good enough for read-only access. It is complicated when there are multiple writers… CMU 15-445/645 (Fall 2018)

13 WH Y N OT USE TH E O S? Full Usage There are some solutions to this problem: → madvise : Tell the OS how you expect to read certain pages. → mlock : Tell the OS that memory ranges Partial Usage cannot be paged out. → msync : Tell the OS to flush memory ranges out to disk. CMU 15-445/645 (Fall 2018)

14 WH Y N OT USE TH E O S? DBMS (almost) always wants to control things itself and can do a better job at it. → Flushing dirty pages to disk in the correct order. → Specialized prefetching. → Buffer replacement policy. → Thread/process scheduling. The OS is not your friend. CMU 15-445/645 (Fall 2018)

15 DATABASE STO RAGE Problem #1: How the DBMS represents the database in files on disk. Problem #2: How the DBMS manages its memory and move data back-and-forth from disk. CMU 15-445/645 (Fall 2018)

16 TO DAY'S AGEN DA File Storage Page Layout Tuple Layout CMU 15-445/645 (Fall 2018)

17 FILE STO RAGE The DBMS stores a database as one or more files on disk. The OS doesn't know anything about these files. → All of the standard filesystem protections are used. → Early systems in the 1980s used custom "filesystems" on raw storage. CMU 15-445/645 (Fall 2018)

18 STO RAGE M AN AGER The storage manager is responsible for maintaining a database's files. It organizes the files as a collection of pages. → Tracks data read/written to pages. → Tracks the available space. CMU 15-445/645 (Fall 2018)

19 DATABASE PAGES A page is a fixed-size block of data. → It can contain tuples, meta- data, indexes, log records… → Most systems do not mix page types. → Some systems require a page to be self-contained. Each page is given a unique identifier. → The DBMS uses an indirection layer to map page ids to physical locations. CMU 15-445/645 (Fall 2018)

20 DATABASE PAGES 1KB There are three different notions of "pages" in a DBMS: → Hardware Page (usually 4KB) 4KB → OS Page (usually 4KB) → Database Page (1-16KB) By hardware page, we mean at what 8KB level the device can guarantee a "failsafe write". 16KB CMU 15-445/645 (Fall 2018)

21 PAGE STO RAGE ARCH ITECTURE Different DBMSs manage pages in files on disk in different ways. → Heap File Organization → Sequential / Sorted File Organization → Hashing File Organization At this point in the hierarchy we don't need to know anything about what is inside of the pages. CMU 15-445/645 (Fall 2018)

22 DATABASE H EAP A heap file is an unordered collection of pages where tuples that are stored in random order. → Get / Delete Page → Must also support iterating over all pages. Need meta-data to keep track of what pages exist and which ones have free space. Two ways to represent a heap file: → Linked List → Page Directory CMU 15-445/645 (Fall 2018)

23 H EAP FILE: LIN KED LIST Maintain a header page at the Free Page Page Page List beginning of the file that stores two … pointers: Header → HEAD of the free page list. Data Data → HEAD of the data page list. Each page keeps track of the number Page Page of free slots in itself. … Data Page Data Data List CMU 15-445/645 (Fall 2018)

24 H EAP FILE: PAGE DIRECTO RY Page Data The DBMS maintains special pages that tracks the location of data pages Directory Page in the database files. Data The directory also records the number of free slots per page. … … Page The DBMS has to make sure that the directory pages are in sync with the Data data pages. CMU 15-445/645 (Fall 2018)

27 TO DAY'S AGEN DA File Storage Page Layout Tuple Layout CMU 15-445/645 (Fall 2018)

28 PAGE H EADER Page Every page contains a header of meta- Header data about the page's contents. → Page Size → Checksum Data → DBMS Version → Transaction Visibility → Compression Information Some systems require pages to be self- contained (e.g., Oracle). CMU 15-445/645 (Fall 2018)

29 PAGE LAYO UT For any page storage architecture, we now need to understand how to organize the data stored inside of the page. → We are still assuming that we are only storing tuples. Two approaches: → Tuple-oriented → Log-structured CMU 15-445/645 (Fall 2018)

Database Storage Part I Lecture # 03 Database Systems Andy Pavlo - PowerPoint PPT Presentation

Database Storage Part I Lecture # 03 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645 Carnegie Mellon Univ. Fall 2018 2 ADM IN ISTRIVIA Homework #1 is due Monday September 10 th @ 11:59pm Project #1 will be released on

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Storage and File Structure December 12, 2008 Storage and File Structure Magnetic Discs RAID

> SUN STORAGE 7000 UNIFIED STORAGE SYSTEMS ITS TIME TO CHANGE YOUR STORAGE

NEBC Database Course 2008 Database Servers Database Interfaces Tim Booth : tbooth@ceh.ac.uk

Storage Devices for Database Systems 5DV120 Database System Principles Ume a University

Data Structure and Storage Storage CS386, Introduction to Database Systems Jay Urbain Credits:

SUSE Enterprise Storage 6 Darren Soothill EMEA Storage Technical Strategist Agenda

Solar Plus Storage Solar Plus Storage Focus on Storage Benefits Focus on Storage Benefits by

Hybrid SAN & Cluster Enterprise Network Storage Hikvision Enterprise Network Storage

INF5470 Fall 2012 Lecture 10: Analog Storage Content Overview Volatile Short Term Storage

National Address Database National Address Database What is a National Address Database?

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

DATABASE SYSTEMS Database programming in a web environment Database System Course, 2016-2017

DATABASE SYSTEMS Database programming in a web environment Database System Course AGENDA FOR

Advanced Database CS 525: Organization? Advanced Database =Database Implementation

Sequential team form and its simplification using graphical models Aditya Mahajan and Sekhar

ALIGNMENT-FREE SEQUENCE COMPARISON OVER HADOOP FOR COMPUTATIONAL BIOLOGY Giuseppe Cattaneo,

= Set Reset 0 S 0 R Q Q Q Q Sequential Logic 3 Sequential Logic 4 SR latch D latch

Unit 13 Sequential Logic Constructs 13.2 Learning Outcomes I understand the difference

NFS Tricks and Benchmarking Traps Daniel Ellard and Margo Seltzer FREENIX 2003 - June 12, 2003

The I/O-Model Aggarwal and Vitter, The Input/Output Complexity of Sorting and Related Problems

Understanding CPU Caches Ulrich Drepper Introduction Discrepancy main CPU and main memory speed

Estimating Risk under Estimating statistics . . . Linearized techniques Interval Uncertainty:

Database Storage Part I Lecture # 03 Database Systems Andy Pavlo - PowerPoint PPT Presentation

Database Storage Part I Lecture # 03 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645 Carnegie Mellon Univ. Fall 2018 2 ADM IN ISTRIVIA Homework #1 is due Monday September 10 th @ 11:59pm Project #1 will be released on

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Storage and File Structure December 12, 2008 Storage and File Structure Magnetic Discs RAID

&gt; SUN STORAGE 7000 UNIFIED STORAGE SYSTEMS ITS TIME TO CHANGE YOUR STORAGE

NEBC Database Course 2008 Database Servers Database Interfaces Tim Booth : tbooth@ceh.ac.uk

Storage Devices for Database Systems 5DV120 Database System Principles Ume a University

Data Structure and Storage Storage CS386, Introduction to Database Systems Jay Urbain Credits:

SUSE Enterprise Storage 6 Darren Soothill EMEA Storage Technical Strategist Agenda

Solar Plus Storage Solar Plus Storage Focus on Storage Benefits Focus on Storage Benefits by

Hybrid SAN &amp; Cluster Enterprise Network Storage Hikvision Enterprise Network Storage

INF5470 Fall 2012 Lecture 10: Analog Storage Content Overview Volatile Short Term Storage

National Address Database National Address Database What is a National Address Database?

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

DATABASE SYSTEMS Database programming in a web environment Database System Course, 2016-2017

DATABASE SYSTEMS Database programming in a web environment Database System Course AGENDA FOR

Advanced Database CS 525: Organization? Advanced Database =Database Implementation

Sequential team form and its simplification using graphical models Aditya Mahajan and Sekhar

ALIGNMENT-FREE SEQUENCE COMPARISON OVER HADOOP FOR COMPUTATIONAL BIOLOGY Giuseppe Cattaneo,

= Set Reset 0 S 0 R Q Q Q Q Sequential Logic 3 Sequential Logic 4 SR latch D latch

Unit 13 Sequential Logic Constructs 13.2 Learning Outcomes I understand the difference

NFS Tricks and Benchmarking Traps Daniel Ellard and Margo Seltzer FREENIX 2003 - June 12, 2003

The I/O-Model Aggarwal and Vitter, The Input/Output Complexity of Sorting and Related Problems

Understanding CPU Caches Ulrich Drepper Introduction Discrepancy main CPU and main memory speed

Estimating Risk under Estimating statistics . . . Linearized techniques Interval Uncertainty:

> SUN STORAGE 7000 UNIFIED STORAGE SYSTEMS ITS TIME TO CHANGE YOUR STORAGE

Hybrid SAN & Cluster Enterprise Network Storage Hikvision Enterprise Network Storage