Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters - PowerPoint PPT Presentation

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters Prashanth Thinakaran , Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das June 6th, ICDCS 2017

Executive Summary ‣ Problem: Heterogeneity agnostic datacenter schedulers leads to poor placement choices of jobs • Schedulers Ignoring hardware and application level heterogeneity ‣ Constraints are used as a medium • To express task level heterogeneity (Eg., Latency sensitive, Batch) • To expose hardware level heterogeneity (Eg., ISA, Clock speed, Accelerators) • To ensure task performance guarantees to ensure QoS ‣ Phoenix is a constraint-aware scheduler that is: • Heterogeneity-aware and hybrid hence scalable • Uses real-time CRV metric for task reordering at peak congestions optimizing for tail latencies • Improves the 99th percentile (tail) latency by 1.9x across production cluster traces 2

Outline • Scheduler Design Paradigm • Motivation • Modeling and synthesizing task constraints • Phoenix architecture • Results 3

Scheduler Design Paradigm Distributed Number of jobs executed per day 100B Sparrow Yacc-D 10B Control Plane Phoenix Hybrid Schedulers 1B Mercury Hawk Eagle Choosy 100M Centralized Mesos Borg 10M Constraint unaware Constraint aware Early Task binding to Queue Late 4

Constraint share in Google traces Minimum Disks 1% Maximum Disks 8% Number of cores 17% ISA (x86,ARM) 74% ISA (x86,ARM) Number of Nodes Ethernet Speed Number of cores Maximum Disks Kernel Version Platform Family CPU Clock speed Minimum Disks 6

Task placement constraints ‣ Constraint-based Job Requests in Cloud Schedulers • More than 50% of all tasks subscribe to task constraints • Eg., A job may request two server nodes belonging to x86 with at least 1 Gbps of network speed between them • Constraint subscription surges • Impact other unconstrained tasks • Root cause for tail-latencies 7

Synthesizing Task Constraints ‣ Publicly available Google’s cluster workload traces [1] • Hashed constraint values were correlated with constraint frequency vector proposed in [2] ‣ Yahoo, Cloudera synthetically generated constraints • Benchmarking model proposed in [1] to characterize and generate constraints for tasks • Cross-validated & accuracy is close to 87% [1] C. Reiss, J. Wilkes, and J. L. Hellerstein, “Google cluster-usage traces: format+ schema,” Google Inc., White Paper 2011. [2] B. Sharma, V. Chudnovsky, J. L. Hellerstein, R. Rifaat, and C. R. Das, “Modeling and synthesizing task placement constraints in google compute clusters,” in Proceedings of the 2nd ACM Symposium on Cloud Computing. 9

Constraint distribution • 50% of tasks are constrained • 33% of jobs demand two constraints but only 12% of it could be satisfied • As incoming jobs demand more constraints it become difficult to satisfy all of them. 10

Job Queuing Delays Yahoo Cloudera ‣ High tail latency for resource constrained tasks • Average 2 to 2.5x at tail incase of Eagle and Yacc-d ‣ High volume of scheduling requests demands distributed scheduling 11

Job response times vs Cluster Load Need for a scalable scheduler that Yahoo Cloudera Google could handle tasks with multiple Response times normalized to constrained jobs for Eagle-C • 99th percentile job response times shooting up for all constraints traces • More the system utilization more the response time degradation 12

Phoenix architectural overview Distributed Distributed Distributed Distributed ...... Scheduler 1 Scheduler 2 Scheduler 3 Scheduler n Heartbeat Interval Worker Queue n Worker Queue 1 Worker Queue 2 Worker Queue 3 Worker Queue worker queues more ...... n-1 CRV Monitor Centralized Scheduler Constraint Resource Vector Lookup Table 14

utilization < threshold DS Worker 1 SRPT fails at tail of constrained DS Worker 2 SRPT CRV Monitor reordering jobs at higher utilization DS Worker 3 DS Worker 4 15

utilization > threshold DS Worker 1 DS Worker 2 CRV CRV Monitor reordering DS Worker 3 DS Worker 4 16 16

Architecture contd.. • CRV monitor keeps track of Constraint Resource Vector (CRV) • Demand and supply ratio of every constraint at every machine is updated for every heartbeat interval • P-K based queue waiting time estimators for admission control • When CRV increase beyond a set threshold CRV based reordering is initiated 17

Phoenix compared to Eagle *Lower the better Google Yahoo 19

Phoenix compared to Hawk/Sparrow *Lower the better Cloudera Google trace- Hawk Google trace- Sparrow 20

Response times for long jobs Cloudera Yahoo Google 21

Summary • Phoenix is a hybrid constraint-aware scheduler • Dynamically adapts it self at high resource demands using CRV metric based reordering • Improves tail-latency by an average of 1.9x for heavily resource constrained tasks • Not affecting long job response times and fairness of other unconstrained tasks 22

prashanth@cse.psu.edu http://www.cse.psu.edu/hpcl/index.html

CRV statistics • Number of task reordering is reliant on inter arrival patterns of jobs • Average utilization of the cluster was 80% 24

Constraint Modeling • Types of constraints • Hard constraints -> Eg., Minimum memory, No. of cores • Soft constraints -> Clock speed, Network bandwidth • Affinity constraints -> HDFS Data locality, MPI tasks • Constraint support in existing schedulers • Mesos - Locality preferences of tasks • Kubernetes - It is on their roadmap to support soft & hard • Affinity constraints impact scheduling delays 2x to 4x 25

Eagle Yahoo Eagle CC

Scheduling Optimization Metrics • Job response Times • Hybrid schedulers uses SRPT for job turnaround times • Comes at the cost of fairness of the other unconstrained jobs • Admission Control • Negotiating for Jobs with multiple resource constraints • Hard to soft relaxation of constraints • Late Binding • Avoiding early commit and reducing queue waiting times especially for short jobs. • Load Balancing • Job stealing techniques improves the overall resource utilization but not always the case. • Task migration overheads and constraint preferences violation should be taken in to account 27

Queuing delays of Google jobs using SRPT • Sporadic peaks and valleys of job submission pattern • At peaks, heavy tail latency leads to QoS violations of short jobs • Queuing delays cascade in to other unconstrained tasks • Naive SRPT based queue management fails to deliver • Constrained tasks scheduled by Hawk and Yacc-d also experience 2 to 2.5x queueing delays (repetition of information) 28

Evaluation Methodology • Trace-driven simulator built on top of Eagle and Sparrow • Three production datacenter traces were used for evaluation • Yahoo, Cloudera & Google • Cluster inter arrival rate is bursty and unpredictable with peak to median ratio from 9:1 to 260:1 29

Impact on unconstrained jobs 30

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters - PowerPoint PPT Presentation

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters Prashanth Thinakaran , Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das June 6th, ICDCS 2017 Executive Summary Problem: Heterogeneity agnostic datacenter

Constraint Networks Dario Maggi University Basel October 9, 2014 Dario Maggi Constraint

Preempting Scheduler Activations Scheduler activations are completely preemptable Deadlocks

WORK STEALING SCHEDULER 2 6/16/2010 Work Stealing Scheduler

Design and Implemention of a Plugin Scheduler for DIET March 11, 2005 Design and Implemention of

Phoenix Area Purchased/Referred Care Area Reserve Pool Phoenix, Arizona September 4-6,

Apache Phoenix We put the SQL back in NoSQL http://phoenix.incubator.apache.org James Taylor

Constraint Satisfaction Problems Chapter 5 Section 1 3 Constraint Satisfaction 1 Outline

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Constraint-Based Refactoring Rename Field Problem Proven Correct Solution Constraint- Based

WEM Reform: Constraint Development Responsibilities PSO-WG Meeting 3 February 2019 1

On Minimal Constraint Networks Georg Gottlob Minimal Constraint Networks Montanari 1974: To

Combining Combining Constraint Programming Constraint Programming and Integer Programming and

Constraint Satisfaction Problems Chapter 6 Constraint Satisfaction Problems A constraint

Albertsons 1002 Phoenix, AZ ACME Markets Philadelphia, PA Albertsons 1002 Phoenix, AZ

Phoenix Group members: Hilary Hartline Tyler Micek, Kiana Mays, City Of Phoenix Office of

LTE eNB Scheduler performance 3rd Fed4FIRE Engineering Conference experiments 14.03.2018

TRILL DC Interconnect

Clear, Concise, Compelling. How to Present Your Science to Best Effect Alison Hatt Molecular

SELECTION CBM Topic Overview 1. Introduction To HRM 2. Recruitment & Selection Overview

Evolutionary Design By: Dianna Fox and Dan Morris Review 4 main types of Evolutionary

Chapter 4 Interrupts ECE 3120 Dr. Mohamed Mahmoud http://iweb.tntech.edu/mmahmoud/

Git Strikes Back Pete X. Graham Contents 1. Branching and merging revisited 2. Why use rebase?

I interference freedom Interlock Instructions that reads updates shared Hue instructor memory as

v4-16-Release: bug reports, committed fixes and proposed changes P. Hristov 21/05/2009 Weekly

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters - PowerPoint PPT Presentation

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters Prashanth Thinakaran , Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das June 6th, ICDCS 2017 Executive Summary Problem: Heterogeneity agnostic datacenter

Constraint Networks Dario Maggi University Basel October 9, 2014 Dario Maggi Constraint

Preempting Scheduler Activations Scheduler activations are completely preemptable Deadlocks

WORK STEALING SCHEDULER 2 6/16/2010 Work Stealing Scheduler

Design and Implemention of a Plugin Scheduler for DIET March 11, 2005 Design and Implemention of

Phoenix Area Purchased/Referred Care Area Reserve Pool Phoenix, Arizona September 4-6,

Apache Phoenix We put the SQL back in NoSQL http://phoenix.incubator.apache.org James Taylor

Constraint Satisfaction Problems Chapter 5 Section 1 3 Constraint Satisfaction 1 Outline

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Constraint-Based Refactoring Rename Field Problem Proven Correct Solution Constraint- Based

WEM Reform: Constraint Development Responsibilities PSO-WG Meeting 3 February 2019 1

On Minimal Constraint Networks Georg Gottlob Minimal Constraint Networks Montanari 1974: To

Combining Combining Constraint Programming Constraint Programming and Integer Programming and

Constraint Satisfaction Problems Chapter 6 Constraint Satisfaction Problems A constraint

Albertsons 1002 Phoenix, AZ ACME Markets Philadelphia, PA Albertsons 1002 Phoenix, AZ

Phoenix Group members: Hilary Hartline Tyler Micek, Kiana Mays, City Of Phoenix Office of

LTE eNB Scheduler performance 3rd Fed4FIRE Engineering Conference experiments 14.03.2018

TRILL DC Interconnect

Clear, Concise, Compelling. How to Present Your Science to Best Effect Alison Hatt Molecular

SELECTION CBM Topic Overview 1. Introduction To HRM 2. Recruitment &amp; Selection Overview

Evolutionary Design By: Dianna Fox and Dan Morris Review 4 main types of Evolutionary

Chapter 4 Interrupts ECE 3120 Dr. Mohamed Mahmoud http://iweb.tntech.edu/mmahmoud/

Git Strikes Back Pete X. Graham Contents 1. Branching and merging revisited 2. Why use rebase?

I interference freedom Interlock Instructions that reads updates shared Hue instructor memory as

v4-16-Release: bug reports, committed fixes and proposed changes P. Hristov 21/05/2009 Weekly

SELECTION CBM Topic Overview 1. Introduction To HRM 2. Recruitment & Selection Overview