Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh - PowerPoint PPT Presentation

Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh Jindal, Saeed Amizadeh, Hiren Patel, Wangchao Le, Shi Qiao, Sriram Rao February 8, 2019 * C. Wu, A. Jindal, S. Amizadeh, H. Patel, W. Le, S. Qiao, and S. Rao. Towards a Learning Optimizer for Shared Clouds. In PVLDB, 12(3): 210 –222, 2018.

Rise of Big Data Systems Hive Declarative query interface Cost-based query optimizer (CBO) Spark BIG Flink 400 SELECT Customer.cname, Item.iname Join cid Calcite 6000 FROM Customer iid INNER JOIN Order 800 Join ON Customer.cid == Order.cid Order 10 BigQuery 600 INNER JOIN Item Filter Filter ON Item.iid == Order.iid Age < 18 Price>100 DATA WHERE Item.iprice > 100 Big SQL AND Customer.cage < 18; 50 1000 Customer Item HDInsight Good plan => Good performance SCOPE SYSTEM Problem: CBO can make mistakes Etc. esp. Cardinality Estimation

Rise of Big Data Systems Hive The root of all evil , the Achilles Heel of query optimization, is the estimation of the size of intermediate results, known Spark as cardinalities . – [Guy Lohman, SIGMOD Blog 2014] BIG Flink Calcite 𝐹𝑊𝐽𝑀 BigQuery DATA Big SQL HDInsight SCOPE SYSTEM Etc. ESTIMATION CARDINALITY

Rise of Big Data Systems Hive Spark BIG Flink TUNING! Calcite BigQuery DATA Collecting Statistics Big SQL Providing Query Hints Database Administration HDInsight SCOPE SYSTEM Etc.

Rise of the Clouds MANAGED Hive Spark BIG Flink SERVERLESS Calcite BigQuery DATA Collecting Statistics No Admin Big SQL Providing Query Hints No Expertise Database Administration No Control HDInsight SCOPE SYSTEM Etc.

Rise of the Clouds Hive Spark SELF BIG Flink Calcite TUNING! BigQuery DATA Big SQL HDInsight SCOPE SYSTEM Etc.

Hope: Shared Cloud Infrastructures BIG DATA Shared data processing SYSTEM Massive volumes of query logs Centrally visible query workload

Cosmos: shared cloud infra at Microsoft • SCOPE Workloads: Ideal • Batch processing in a job service • 100Ks jobs; 1000s users; EBs data; 100Ks nodes Under- Over- • Cardinality estimation in SCOPE: estimation estimation • 1 day’s log from Asimov • Lots of constants for best effort estimation • Big data, unstructured Data, custom code 400 Q1 • Workload patterns Join cid 6000 Q2 iid • Recurring jobs Aggregate Join 800 Order 10 600 600 • Shared query subgraphs Filter Filter Filter Age < 18 Price>100 Price>100 • Can we learn cardinality models? 50 1000 1000 Item Customer Item

Learning Cardinality Model • Strict: cache previously seen values Subgraph Logical Parameter Data Type Expression Values Inputs • Low coverage • Online feedback Strict Fixed Fixed Fixed • General: learning a single model General Variable Variable Variable • Hard to featurize Template Fixed Variable Variable • Hard to train • Prediction latency • Low accuracy • Template: learning a model per subgraph template => No one-size-fits-all

Learned Cardinality Models Join Join cid cid • Subgraph Template: Filter Filter • Same logical subexpression Age < 18 Age < 20 Order Order’ • Different physical implementation Customer Customer’ • Different parameters and inputs Table 3: The features used for learning cardinality. Name Description JobName Name of the job containing the subgraph • Feature Selection NormJobName Normalize job name InputCardinality Total cardinality of all inputs to the subgraph • Model Selection P ow ( InputCardinality , 2) Square of InputCardinality Sqrt ( InputCardinality ) Square root of InputCardinality Log ( InputCardinality ) Log of InputCardinality • Generalized liner models due to their AvgRowLength Average output row length interpretability InputDataset Name of all input datasets to the subgraph Parameters One or more parameters in the subgraph • More complex models, such as multi- layer perceptron harder to train

Accuracy: 10-fold cross validation Fraction Subgraph Instances 1 75 th 90 th 0.9 Model Percentile Percentile 0.8 Error Error 0.7 Default 74602% 5931418% 0.6 Neural Network SCOPE 0.5 Linear Regression Poisson Regression 0.4 Poisson 1.5% 32% 0.3 Regression 0.2 0.1 Note: Neural network overfits due to small 0 observation and feature space per model 10 -6 10 -4 10 -2 10 0 10 2 10 4 10 6 10 8 Estimated/Actual Cardinality Ratio

Applicability: %tage subgraphs having models Varying Training Window Sliding Test Window 100 100 Jobs Jobs Subgraphs Subgraphs Applicability (%) Applicability (%) 80 80 60 60 40 40 20 20 0 0 1-day 2-day 4-day 1-week 2-week 1-month 1-day 1-week 1-month Train Duration Test Slide Duration

End-to-end Feedback Loop Easy to featurize with low overhead Accurate and easy to understand Model Lookup & Prediction Query Compiler Optimizer Scheduler Runtime Result Annotation hints Cardinality to the query Models optimizer Model Parallel Workload Server Trainer Analyzer Compiled Optimized plans & Execution graphs Actual runtime query DAGs estimated statistics & resources statistics Figure 5: The feedback loop architecture. Trained offline over new batches of data Large number of smaller, highly accurate models

Performance • Subset of hourly jobs from Asimov • These queries process unstructured data, use SPJA operators, and a UDO • Re-ran the queries over same production data, but with redirected output 1200 700 12000 Default Optimizer Default Optimizer Default Optimizer Number of Vertices With CardLearner Processing Time (s) With CardLearner With CardLearner 900 525 9000 Latency (s) 600 350 6000 300 175 3000 0 0 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 Queries Queries Queries

Avoiding Learning Bias Actual:100 Actual:100 Actual:100 • Learning only what is seen Estimated:75 Estimated:50 Estimated:200 Actual:75 Actual:100 Actual:50 Z Y X • Exploratory join ordering X Y X Z Y Z • Actively try different join orders (a) Plan 1 (b) Plan 2 (c) Plan 3 • Pruning: discard plans with subexpressions that are more expensive than at least one other plan • Maximize new observations when comparing plans • Execution strategies • Static workload tuning • Using sample data • Leveraging recurring/overlapping jobs

Takeaways • Big data systems increasingly use cost-based optimization • Users cannot tune these systems in managed/serverless services • Hard to achieve a one-size-fits-all query optimizer • Instance optimized systems are more feasible • Very promising results from SCOPE workloads: • Could achieve very high accuracy • Reasonably large applicability, could further apply exploration • Performance gains, most significant being less resource consumption • Learned cardinality models a step towards self-learning optimizers

Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh - PowerPoint PPT Presentation

Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh Jindal, Saeed Amizadeh, Hiren Patel, Wangchao Le, Shi Qiao, Sriram Rao February 8, 2019 * C. Wu, A. Jindal, S. Amizadeh, H. Patel, W. Le, S. Qiao, and S. Rao. Towards a Learning

The MySQL Query Optimizer Explained Through Optimizer Trace ystein Grvlen Senior Staff

Clouds A B Clouds A Eastern 2/3 of the U.S. Clouds Clouds on Mars are made of _____ . A.

When you look up into the sky, you will often see clouds. No two clouds are the same, and there

Explaining the Postgres Query Optimizer B RUCE M OMJIAN The optimizer is the "brain" of

2 Microstructures of Warm Clouds Clouds that lie completely below the 0 C isotherm, referred to

Understanding and control of MySQL Query Optimizer traditional and novel tools and techniques

6 Artificial Modification of Clouds The microstructures of clouds are influenced by the concen-

4. Droplet Growth in Warm Clouds In warm clouds, droplets can grow by condensation in a

Session 3: Hydrology & Clouds 3:00- 5:30 PM Session 3: Hydrology & Clouds 3:00- 5:30 PM

The Volcano Optimizer Generator Generator: Object-oriented and scientific Extensibility and

The Volcano Optimizer Generator: Extensibility and Efficient Search Presentation: Mirna Limic

Breathing in the Clouds: Thin Air or Bad Atmosphere? G. Vossen: Breathing in the Clouds 1

System for Intel SGX Adil Ahmad Kyungtae Kim Muhammad Ihsanulhaq Sarfaraz Byoungyoung Lee 1

Social Clouds Creating a research agenda Andrew Lippman MIT Media Lab October, 2010 Clouds and

Towards Supercloud Computing: User-Centric Security Management for Clouds of Clouds Marc Lacoste

MI MI and Shared MI MI and Shared and Shared Decision Making and Shared Decision Making

CS5460: Operating Systems Lecture 5: Processes and Threads (Chapters 3-4) CS 5460: Operating

Introduction to Database Systems Datalog & Deductive Databases Textbook Reference Database

Comprehending Ringads Jeremy Gibbons WadlerFest, April 2016 Comprehending Ringads 2 1.

Minimum Caterpillar Trees and Ring-Stars: a branch-and-cut algorithm Luidi G. Simonetti Yuri A.

1 and customer name for customers who order it. Display product name and category for products

YI R alice bob Dha What nodes processeachpacket O to What nodes need to 1 each packet a

Identity internetworking with eduGAIN Diego R. Lopez - RedIRIS Confederations Federate

Querying Relational Data: Algebra Gerome Miklau UMass Amherst CMPSCI 645 Database Systems

Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh - PowerPoint PPT Presentation

Towards a Learning Optimizer for Shared Clouds* Chenggang Wu, Alekh Jindal, Saeed Amizadeh, Hiren Patel, Wangchao Le, Shi Qiao, Sriram Rao February 8, 2019 * C. Wu, A. Jindal, S. Amizadeh, H. Patel, W. Le, S. Qiao, and S. Rao. Towards a Learning

The MySQL Query Optimizer Explained Through Optimizer Trace ystein Grvlen Senior Staff

Clouds A B Clouds A Eastern 2/3 of the U.S. Clouds Clouds on Mars are made of _____ . A.

When you look up into the sky, you will often see clouds. No two clouds are the same, and there

Explaining the Postgres Query Optimizer B RUCE M OMJIAN The optimizer is the &quot;brain&quot; of

2 Microstructures of Warm Clouds Clouds that lie completely below the 0 C isotherm, referred to

Understanding and control of MySQL Query Optimizer traditional and novel tools and techniques

6 Artificial Modification of Clouds The microstructures of clouds are influenced by the concen-

4. Droplet Growth in Warm Clouds In warm clouds, droplets can grow by condensation in a

Session 3: Hydrology &amp; Clouds 3:00- 5:30 PM Session 3: Hydrology &amp; Clouds 3:00- 5:30 PM

The Volcano Optimizer Generator Generator: Object-oriented and scientific Extensibility and

The Volcano Optimizer Generator: Extensibility and Efficient Search Presentation: Mirna Limic

Breathing in the Clouds: Thin Air or Bad Atmosphere? G. Vossen: Breathing in the Clouds 1

System for Intel SGX Adil Ahmad Kyungtae Kim Muhammad Ihsanulhaq Sarfaraz Byoungyoung Lee 1

Social Clouds Creating a research agenda Andrew Lippman MIT Media Lab October, 2010 Clouds and

Towards Supercloud Computing: User-Centric Security Management for Clouds of Clouds Marc Lacoste

MI MI and Shared MI MI and Shared and Shared Decision Making and Shared Decision Making

CS5460: Operating Systems Lecture 5: Processes and Threads (Chapters 3-4) CS 5460: Operating

Introduction to Database Systems Datalog &amp; Deductive Databases Textbook Reference Database

Comprehending Ringads Jeremy Gibbons WadlerFest, April 2016 Comprehending Ringads 2 1.

Minimum Caterpillar Trees and Ring-Stars: a branch-and-cut algorithm Luidi G. Simonetti Yuri A.

1 and customer name for customers who order it. Display product name and category for products

YI R alice bob Dha What nodes processeachpacket O to What nodes need to 1 each packet a

Identity internetworking with eduGAIN Diego R. Lopez - RedIRIS Confederations Federate

Querying Relational Data: Algebra Gerome Miklau UMass Amherst CMPSCI 645 Database Systems

Explaining the Postgres Query Optimizer B RUCE M OMJIAN The optimizer is the "brain" of

Session 3: Hydrology & Clouds 3:00- 5:30 PM Session 3: Hydrology & Clouds 3:00- 5:30 PM

Introduction to Database Systems Datalog & Deductive Databases Textbook Reference Database