what are we doing with our lives
play

WHAT ARE WE DOING WITH OUR LIVES? Nobody Cares About Our - PowerPoint PPT Presentation

WHAT ARE WE DOING WITH OUR LIVES? Nobody Cares About Our Concurrency Control Research SIGMOD 2017 @ andy_pavlo I am only allowed 3 plugs in this talk. # 3 DISK-ORIENTED CONCURRENCY CONTROL Allows a DBMS to mask the latency of


  1. WHAT ARE WE DOING WITH OUR LIVES? Nobody Cares About Our Concurrency Control Research SIGMOD 2017 @ andy_pavlo

  2. I am only allowed 3 plugs in this talk. #

  3. 3 DISK-ORIENTED CONCURRENCY CONTROL Allows a DBMS to mask the latency of non-volatile storage. Jim Gray Pioneering work on transaction processing from the 1970s. The Great Phil Bernstein

  4. 4 IN-MEMORY CONCURRENCY CONTROL New concurrency control schemes are needed if the database is assumed to be in memory. Early research in 1980s. Some commercial DBMSs from 1990s.

  5. 5 21 ST CENTURY RESEARCH ON IN-MEMORY CONCURRENCY CONTROL Partitioned Protocols → H-Store (VLDB 2007) Non-Partitioned Protocols → Microsoft Hekaton (VLDB 2011) → Silo (SOSP 2013)

  6. 6 NOBODY CARES ABOUT OUR CONCURRENCY CONTROL RESEARCH All of this research is great for “classic” OLTP applications. We are not addressing the needs of new fields and environments.

  7. 7 NOBODY CARES ABOUT OUR CONCURRENCY CONTROL RESEARCH Peter Bailis examined real-world DB applications. Few of them use txns and many of them don’t use them correctly.

  8. 7 NOBODY CARES ABOUT OUR CONCURRENCY CONTROL RESEARCH Peter Bailis examined real-world DB applications. Few of them use txns and many of them don’t use them correctly. We did an automated evaluation with the CMDBAC corpus. Few apps written in popular frameworks use txns. 1

  9. 8 COMMON ASSUMPTIONS MADE IN CONCURRENCY CONTROL RESEARCH Assumption #1: All transactions execute as stored procedures. Assumption #2: All transactions execute with serializable isolation.

  10. 9 CONFERENCE PAPER SURVEY Examined SIGMOD and VLDB publications from 2011-2016. We found 95 out of 1843 (5%) papers on transaction processing and concurrency control.

  11. 14 DATABASE ADMIN SURVEY OVERVIEW We commissioned a survey of DBAs in April 2017 on how applications use databases. 50 responses for 79 DBMS installations. +Nine others

  12. 15 DATABASE ADMIN SURVEY STORED PROCEDURES What percentage of the transactions run on your DBMS are executed as stored procedures? 25 # of Responses 21 20 20 15 12 11 9 10 4 5 0 None 1-10% 11-25% 26-50% 51-75% 76-100%

  13. 16 DATABASE ADMIN SURVEY ISOLATION LEVEL What isolation level do transactions execute at on this DBMS? None Few Most All 30 # of Responses 26 22 20 12 12 12 11 11 10 10 10 8 8 10 6 5 5 4 3 3 2 2 2 1 1 0 0 Read Read Cursor Repeatable Snapshot Serializable Uncommitted Committed Stability Read Isolation

  14. 17 DATABASE ADMIN SURVEY FEEDBACK Stored Procedures → Software engineering challenges. → Don’t want devs to update too often. Serializable Isolation → It was always done this way. → Not worth the overhead.

  15. 18 WHAT DOES THIS MEAN FOR OUR RESEARCH? Assuming that every txn executes as a stored procedure with serializable isolation changes the bottleneck. You end up optimizing things that are not as important as you think…

  16. Aren’t I being hypocritical?

  17. 22 A RESEARCH AGENDA FOR THE NEXT 10 YEARS → Examine Entire DBMS Architecture → Communication Overhead → Understand Lower Isolation Levels

  18. 23 IN-MEMORY MULTI-VERSION CONCURRENCY CONTROL STUDY The DBMS’s concurrency control protocol is not the only critical part of executing txns in a DBMS.

  19. 23 IN-MEMORY MULTI-VERSION CONCURRENCY CONTROL STUDY The DBMS’s concurrency control protocol is not the only critical part of executing txns in a DBMS.

  20. 23 IN-MEMORY MULTI-VERSION CONCURRENCY CONTROL STUDY The DBMS’s concurrency control protocol is not the only critical part of executing txns in a DBMS. → Secondary Indexes → Version Storage / Ordering → Garbage Collection

  21. 24 IN-MEMORY MULTI-VERSION CONCURRENCY CONTROL STUDY Hybrid Workload MVCC TPC-C + OLAP Query (40wh) Configurations 90 Oracle/MySQL Throughput NuoDB (K txn/sec) HYRISE 60 MemSQL HyPer 30 SAP HANA Hekaton 0 Postgres 2 8 16 24 32 40 2.5 # of Threads AN EMPIRICAL EVALUATION OF IN-MEMORY MULTI-VERSION CONCURRENCY CONTROL VLDB 2017

  22. 25 RE-EXAMINE DBMS COMMUNICATION OVERHEAD Most applications are in the same data center as the DBMS machine. Kernel bypass methods: → RDMA → Intel DPDK Prefetching with machine learning.

  23. 26 UNDERSTAND LOWER ISOLATION LEVELS We don’t understand how applications are affected by lower isolation levels. Maybe READ COMMITTED is good enough or maybe people don’t know how dirty their data actually is…

  24. 27 WHAT ARE WE DOING WITH OUR LIVES? CONCLUSION It is (still) an interesting time for database research. Let’s make sure we work on the right problems. We need a better way of collecting information about applications.

  25. 28 SOME PEOPLE DO CARE ABOUT OUR CONCURRENCY CONTROL RESEARCH Serializable Deterministic Snapshot Isolation Concurrency Control Michael Cahill Dan Abadi

  26. END @ andy_pavlo

  27. Joy Arulraj Winter 2018 3

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend