Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, - PowerPoint PPT Presentation

Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich

Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich Application’s perspective of a rack

FruitBox – a data processing system Transactional QP Graph processing Analytical QP Machine learning Ad-hoc BI QP MULTI FLAVOR DATA PROCESSING Workshop for Rack-scale Computing 3

FruitBox  Building a system for multi-flavor data processing: 1. Hardware that meets the resource demand. 2. System architecture to support workload heterogeneity. 3. Aim for 10s-100s millions of MULTI FLAVOR DATA PROCESSING requests per second. Graph processing Machine learning 4. Efficient resource utilization. Transactional QP Analytical QP Ad-hoc BI QP Workshop for Rack-scale Computing 4

FruitBox – a rack-scale data processing system  Which box could run such a heterogeneous WL?  A multicore is not enough  A rack-scale system:  More resources  Better isolation  Blurring the machine-cluster boundaries MULTI FLAVOR DATA PROCESSING RACK-SCALE SYSTEM 1000s of cores Graph processing TBs of RAM InfiniBand Machine learning Transactional QP Analytical QP Ad-hoc BI QP Workshop for Rack-scale Computing 5

Rack-scale data processing system Custom build a rack-scale system for data processing? Many such commercial systems exists – Data Appliances Netezza (IBM ) TwinFin Oracle Exadata and many more … Workshop for Rack-scale Computing 6

System design for Multi-flavor data processing  Separate data-storage from data-processing Data processing Transactional Analytical Machine processing processing Learning Storage Engine  Achieve both physical and logical data independence Workshop for Rack-scale Computing 7

Storage Engine Data processing Tuple- and batch-based interface Transactional Analytical Machine to the storage engine. processing processing Learning Storage Engine Workshop for Rack-scale Computing 8

Storage Engine Data processing Tuple- and batch-based interface Transactional Analytical Machine to the storage engine. processing processing Learning Storage Engine Storage engine components:  KV Stores (B-tree)  Crescando Scans KV Stores → transactional Scans → analytical KV Stores Scans Workshop for Rack-scale Computing 9

Storage Engine Data processing Tuple- and batch-based interface Transactional Analytical Machine to the storage engine. processing processing Learning Storage Engine Storage engine components:  KV Stores (B-tree)  Crescando Scans MVCC: Snapshot isolation KV Stores → transactional Scans → analytical KV Stores Scans Transaction logic separated from query processing. Hyder [SIGMOD’15], HyPer[VLDB’11], Hekaton [SIGMOD’14], 10 SharedDB [Giannikis PhD’14], Tell [SIGMOD’15], Multimed [Eurosys’11]

Handling millions of requests/second  It makes no sense to process them individually if they access the same data.  Why should each query scan a TB of data? vs  Batch requests – share data, computation, bandwidth … for higher throughput and predictable performance trading off a bit of latency. IBM Blink, MonetDB/X100[VLDB’07], CJOIN [VLDB’09], Crescando[VLDB’09], SharedDB [VLDB’12,’14], 11 Workshop for Rack-scale Computing

Efficient resource utilization  Noisy system environment MULTI FLAVOR DATA PROCESSING  Load interaction Graph processing Unpredictable performance Machine learning Not meeting SLAs Transactional QP Analytical QP  Resource overprovisioning Ad-hoc BI QP Inefficiency and higher cost  Getting the most out of such a complex system requires cross-layer optimization.  e.g. DB/OS co-design  Already some work on multicore systems. 12

COD: DB/OS co-design What is the knowledge we have? DB Who knows what? Application requirements and characteristics Big semantic gap! Hardware & architecture + System state and utilization of resources OS COD: Database/Operating System co-design [CIDR’12] Workshop for Rack-scale Computing 12

COD’s interface DBMS DB storage engine other apps Explicit allocation Notification on updates DB/OS Interface Constraints and requirements OS policy engine OS Workshop for Rack-scale Computing 13

Adaptability to dynamic system state Experiment setup • AMD MagnyCours • 4 x 2.2GHz AMD Opteron 6174 processors Adaptability – Latency • total Datastore size 53GB • Noise: another CPU-intensive task 9 running on core 0 8 7 Naïve datastore engine 6 Latency [sec] 5 4 COD SLA 3 2 1 0 0 5 10 15 20 Elapsed time [min] Workshop for Rack-scale Computing 14

Resource efficient deployment DB OS Resource requirements Query plan of operators Multicore machine Data dependency Resource Activity Model of multicore graph Vectors machine Deployment algorithm Deployment of operators to CPU cores Deployment of query plans on multicores [VLDB’15] Workshop for Rack-scale Computing 15

Evaluation Query plan  SharedDB’s TPC-W [1]  11 web-interactions in one query plan  44 operators  20GB dataset AMD Magnycours 1 5 26 30 34  4 x 2 dies: D 0 4  R 6 cores L3 cache A  5 MB L3 cache 3 7 M 38 42 46  16 GB NUMA node 2 6 Workshop for Rack-scale Computing 16 [1] SharedDB – Giannikis et al. VLDB’12

Comparison with standard approaches Throughput [WIPS] Response Time [ms] 50 th 90 th 99 th Approaches # cores Average Stdev Default OS 48 Operator per core 44 Deployment algorithm Workshop for Rack-scale Computing 17

Comparison with standard approaches Throughput [WIPS] Response Time [ms] 50 th 90 th 99 th Approaches # cores Average Stdev Default OS 48 317.30 31.11 8.22 72.43 82.03 Operator per core 44 425.86 54.34 14.59 22.93 36.08 Deployment algorithm Workshop for Rack-scale Computing 18

Comparison with standard approaches Throughput [WIPS] Response Time [ms] 50 th 90 th 99 th Approaches # cores Average Stdev Default OS 48 317.30 31.11 8.22 72.43 82.03 Operator per core 44 425.86 54.34 14.59 22.93 36.08 Deployment algorithm 6 Workshop for Rack-scale Computing 19

Comparison with standard approaches Throughput [WIPS] Response Time [ms] 50 th 90 th 99 th Approaches # cores Average Stdev Default OS 48 317.30 31.11 8.22 72.43 82.03 Operator per core 44 425.86 54.34 14.59 22.93 36.08 Deployment algorithm 6 428.07 32.80 15.36 23.73 36.13 Workshop for Rack-scale Computing 20

Comparison with standard approaches Throughput [WIPS] Response Time [ms] 50 th 90 th 99 th Approaches # cores Average Stdev Default OS 48 317.30 31.11 8.22 72.43 82.03 Operator per core 44 425.86 54.34 14.59 22.93 36.08 Deployment algorithm 6 428.07 32.80 15.36 23.73 36.13 Performance / Resource efficiency savings of x 7.37 Workshop for Rack-scale Computing 21

Conclusion  Multi-flavor data processing system  We have all the pieces of the puzzle Separate data- Batching as a first storage from class citizen data-processing Efficient resource … on a rack-scale management system Putting them together opens a lot of opportunities. Workshop for Rack-scale Computing 22

Conclusion  Multi-flavor data processing system  We have all the pieces of the puzzle  Intelligent storage engine: Separate data- Batching as a first  Co-processors, active-memory, hardware specialization (FPGAs) storage from class citizen  Optimizing the network stack: data-processing  … for different memory access patterns  Extend the cross-layer interface: Efficient resource  DB optimizer that is aware of the complexity of the rack … on a rack-scale management system  Rack-scale resource management Putting them together opens a lot of opportunities. Workshop for Rack-scale Computing 23

Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, - PowerPoint PPT Presentation

Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich Rack-scale Data Processing System Jana Giceva , Darko Makreshanski,

Do we need Rack-Scale Coordination? Alysson Bessani 1 April 21th, 2015 Rack-Scale Computers

Rack in Rails 3 <http://twitter.com/rtomayko> Ryan Tomayko GitHub Rack (Core Team)

RACK: a time-based fast loss recovery draft-ietf-tcpm-rack-01 Yuchung Cheng Neal Cardwell

Towards Reconfigurable Rack-Scale Networking Tyler Szepesi , Bernard Wong, Tim Brecht, Sajjad Rizvi

MRG - AMQP trading system in a rack Carl Trieloff Senior Consulting Software Engineer/ Director

http://rack.github.com Thursday, November 11, 2010 Rack provides a minimal, modular and adaptable

Technical Information Rack Slide Dimension Drawing and Usage Table Dimension Diagram Rack

RACK for SCTP Felix Weinrank Michael Txen Erwin P. Rathgeb Agenda A brief introdcution

Exo: Atomic Broadcast for the Rack-Scale Computer Matthew P. Grosvenor Marwan Fayed Andrew W.

Energy-Efficient Building Blocks For Rack Scale Computing Work In Progress Rami Alkubaty

STANlite a database engine for secure data processing at rack-scale level IEEE International

61A Lecture 30 Announcements Data Processing Data Processing 4 Data Processing Many data sets

Coflow Recent Advances and Whats Next? Mosharaf Chowdhury University of Michigan Rack-Scale

Truck Boat Tail Folding Seat Bike Rack Goals: Improve highway gas mileage on tractor Goals: Allow

Bicycle Rack Voucher Project RFP Pre-Bidders Conference March 6, 2014 1 Overview

CAD Geometry Original The pipe rack structures were represented as solid obstructions; flow is

CS 744: Big Data Systems Shivaram Venkataraman Fall 2019 Who am I ? Assistant Professor in

Offline Data Processing: Tasks and Infrastructure Support T. Yang, UCSB 293S Table of Content

Embedded Systems Design with Statecharts Gerald L uttgen Department of Computer Science

Principles of Program Analysis: Data Flow Analysis Transparencies based on Chapter 2 of the book:

What is Scalable Data Processing? S CALABLE DATA P ROCES S IN G IN R Michael J. Kane and Simon

Data and Process Modelling 2. Information System Development Marco Montali KRDB Research Centre

Data reduction and processing tutorial Petr V. Konarev European Molecular Biology Laboratory,

Heterogenous Resources for data processing Heterogenous Execution for CMSSW Concentrating

Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, - PowerPoint PPT Presentation

Rack-scale Data Processing System Jana Giceva , Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich Rack-scale Data Processing System Jana Giceva , Darko Makreshanski,

Do we need Rack-Scale Coordination? Alysson Bessani 1 April 21th, 2015 Rack-Scale Computers

Rack in Rails 3 &lt;http://twitter.com/rtomayko&gt; Ryan Tomayko GitHub Rack (Core Team)

RACK: a time-based fast loss recovery draft-ietf-tcpm-rack-01 Yuchung Cheng Neal Cardwell

Towards Reconfigurable Rack-Scale Networking Tyler Szepesi , Bernard Wong, Tim Brecht, Sajjad Rizvi

MRG - AMQP trading system in a rack Carl Trieloff Senior Consulting Software Engineer/ Director

http://rack.github.com Thursday, November 11, 2010 Rack provides a minimal, modular and adaptable

Technical Information Rack Slide Dimension Drawing and Usage Table Dimension Diagram Rack

RACK for SCTP Felix Weinrank Michael Txen Erwin P. Rathgeb Agenda A brief introdcution

Exo: Atomic Broadcast for the Rack-Scale Computer Matthew P. Grosvenor Marwan Fayed Andrew W.

Energy-Efficient Building Blocks For Rack Scale Computing Work In Progress Rami Alkubaty

STANlite a database engine for secure data processing at rack-scale level IEEE International

61A Lecture 30 Announcements Data Processing Data Processing 4 Data Processing Many data sets

Coflow Recent Advances and Whats Next? Mosharaf Chowdhury University of Michigan Rack-Scale

Truck Boat Tail Folding Seat Bike Rack Goals: Improve highway gas mileage on tractor Goals: Allow

Bicycle Rack Voucher Project RFP Pre-Bidders Conference March 6, 2014 1 Overview

CAD Geometry Original The pipe rack structures were represented as solid obstructions; flow is

CS 744: Big Data Systems Shivaram Venkataraman Fall 2019 Who am I ? Assistant Professor in

Offline Data Processing: Tasks and Infrastructure Support T. Yang, UCSB 293S Table of Content

Embedded Systems Design with Statecharts Gerald L uttgen Department of Computer Science

Principles of Program Analysis: Data Flow Analysis Transparencies based on Chapter 2 of the book:

What is Scalable Data Processing? S CALABLE DATA P ROCES S IN G IN R Michael J. Kane and Simon

Data and Process Modelling 2. Information System Development Marco Montali KRDB Research Centre

Data reduction and processing tutorial Petr V. Konarev European Molecular Biology Laboratory,

Heterogenous Resources for data processing Heterogenous Execution for CMSSW Concentrating

Rack in Rails 3 <http://twitter.com/rtomayko> Ryan Tomayko GitHub Rack (Core Team)