phoenix a constraint aware scheduler for heterogeneous
play

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters - PowerPoint PPT Presentation

Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters Prashanth Thinakaran , Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das June 6th, ICDCS 2017 Executive Summary Problem: Heterogeneity agnostic datacenter


  1. Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters Prashanth Thinakaran , Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das June 6th, ICDCS 2017

  2. Executive Summary ‣ Problem: Heterogeneity agnostic datacenter schedulers leads to poor placement choices of jobs • Schedulers Ignoring hardware and application level heterogeneity ‣ Constraints are used as a medium • To express task level heterogeneity (Eg., Latency sensitive, Batch) • To expose hardware level heterogeneity (Eg., ISA, Clock speed, Accelerators) • To ensure task performance guarantees to ensure QoS ‣ Phoenix is a constraint-aware scheduler that is: • Heterogeneity-aware and hybrid hence scalable • Uses real-time CRV metric for task reordering at peak congestions optimizing for tail latencies • Improves the 99th percentile (tail) latency by 1.9x across production cluster traces 2

  3. Outline • Scheduler Design Paradigm • Motivation • Modeling and synthesizing task constraints • Phoenix architecture • Results 3

  4. Scheduler Design Paradigm Distributed Number of jobs executed per day 100B Sparrow Yacc-D 10B Control Plane Phoenix Hybrid Schedulers 1B Mercury Hawk Eagle Choosy 100M Centralized Mesos Borg 10M Constraint unaware Constraint aware Early Task binding to Queue Late 4

  5. Outline • Scheduler Design Paradigm • Motivation • Modeling and synthesizing task constraints • Phoenix architecture • Results 5

  6. Constraint share in Google traces Minimum Disks 1% Maximum Disks 8% Number of cores 17% ISA (x86,ARM) 74% ISA (x86,ARM) Number of Nodes Ethernet Speed Number of cores Maximum Disks Kernel Version Platform Family CPU Clock speed Minimum Disks 6

  7. Task placement constraints ‣ Constraint-based Job Requests in Cloud Schedulers • More than 50% of all tasks subscribe to task constraints • Eg., A job may request two server nodes belonging to x86 with at least 1 Gbps of network speed between them • Constraint subscription surges • Impact other unconstrained tasks • Root cause for tail-latencies 7

  8. Outline • Scheduler Design Paradigm • Motivation • Modeling and synthesizing task constraints • Phoenix architecture • Results 8

  9. Synthesizing Task Constraints ‣ Publicly available Google’s cluster workload traces [1] • Hashed constraint values were correlated with constraint frequency vector proposed in [2] ‣ Yahoo, Cloudera synthetically generated constraints • Benchmarking model proposed in [1] to characterize and generate constraints for tasks • Cross-validated & accuracy is close to 87% [1] C. Reiss, J. Wilkes, and J. L. Hellerstein, “Google cluster-usage traces: format+ schema,” Google Inc., White Paper 2011. [2] B. Sharma, V. Chudnovsky, J. L. Hellerstein, R. Rifaat, and C. R. Das, “Modeling and synthesizing task placement constraints in google compute clusters,” in Proceedings of the 2nd ACM Symposium on Cloud Computing. 9

  10. Constraint distribution • 50% of tasks are constrained • 33% of jobs demand two constraints but only 12% of it could be satisfied • As incoming jobs demand more constraints it become difficult to satisfy all of them. 10

  11. Job Queuing Delays Yahoo Cloudera ‣ High tail latency for resource constrained tasks • Average 2 to 2.5x at tail incase of Eagle and Yacc-d ‣ High volume of scheduling requests demands distributed scheduling 11

  12. Job response times vs Cluster Load Need for a scalable scheduler that Yahoo Cloudera Google could handle tasks with multiple Response times normalized to constrained jobs for Eagle-C • 99th percentile job response times shooting up for all constraints traces • More the system utilization more the response time degradation 12

  13. Outline • Scheduler Design Paradigm • Motivation • Modeling and synthesizing task constraints • Phoenix architecture • Results 13

  14. Phoenix architectural overview Distributed Distributed Distributed Distributed ...... Scheduler 1 Scheduler 2 Scheduler 3 Scheduler n Heartbeat Interval Worker Queue n Worker Queue 1 Worker Queue 2 Worker Queue 3 Worker Queue worker queues more ...... n-1 CRV Monitor Centralized Scheduler Constraint Resource Vector Lookup Table 14

  15. utilization < threshold DS Worker 1 SRPT fails at tail of constrained DS Worker 2 SRPT CRV Monitor reordering jobs at higher utilization DS Worker 3 DS Worker 4 15

  16. utilization > threshold DS Worker 1 DS Worker 2 CRV CRV Monitor reordering DS Worker 3 DS Worker 4 16 16

  17. Architecture contd.. • CRV monitor keeps track of Constraint Resource Vector (CRV) • Demand and supply ratio of every constraint at every machine is updated for every heartbeat interval • P-K based queue waiting time estimators for admission control • When CRV increase beyond a set threshold CRV based reordering is initiated 17

  18. Outline • Scheduler Design Paradigm • Motivation • Modeling and synthesizing task constraints • Phoenix architecture • Results 18

  19. Phoenix compared to Eagle *Lower the better Google Yahoo 19

  20. Phoenix compared to Hawk/Sparrow *Lower the better Cloudera Google trace- Hawk Google trace- Sparrow 20

  21. Response times for long jobs Cloudera Yahoo Google 21

  22. Summary • Phoenix is a hybrid constraint-aware scheduler • Dynamically adapts it self at high resource demands using CRV metric based reordering • Improves tail-latency by an average of 1.9x for heavily resource constrained tasks • Not affecting long job response times and fairness of other unconstrained tasks 22

  23. prashanth@cse.psu.edu http://www.cse.psu.edu/hpcl/index.html

  24. CRV statistics • Number of task reordering is reliant on inter arrival patterns of jobs • Average utilization of the cluster was 80% 24

  25. Constraint Modeling • Types of constraints • Hard constraints -> Eg., Minimum memory, No. of cores • Soft constraints -> Clock speed, Network bandwidth • Affinity constraints -> HDFS Data locality, MPI tasks • Constraint support in existing schedulers • Mesos - Locality preferences of tasks • Kubernetes - It is on their roadmap to support soft & hard • Affinity constraints impact scheduling delays 2x to 4x 25

  26. Eagle Yahoo Eagle CC

  27. Scheduling Optimization Metrics • Job response Times • Hybrid schedulers uses SRPT for job turnaround times • Comes at the cost of fairness of the other unconstrained jobs • Admission Control • Negotiating for Jobs with multiple resource constraints • Hard to soft relaxation of constraints • Late Binding • Avoiding early commit and reducing queue waiting times especially for short jobs. • Load Balancing • Job stealing techniques improves the overall resource utilization but not always the case. • Task migration overheads and constraint preferences violation should be taken in to account 27

  28. Queuing delays of Google jobs using SRPT • Sporadic peaks and valleys of job submission pattern • At peaks, heavy tail latency leads to QoS violations of short jobs • Queuing delays cascade in to other unconstrained tasks • Naive SRPT based queue management fails to deliver • Constrained tasks scheduled by Hawk and Yacc-d also experience 2 to 2.5x queueing delays (repetition of information) 28

  29. Evaluation Methodology • Trace-driven simulator built on top of Eagle and Sparrow • Three production datacenter traces were used for evaluation • Yahoo, Cloudera & Google • Cluster inter arrival rate is bursty and unpredictable with peak to median ratio from 9:1 to 260:1 29

  30. Impact on unconstrained jobs 30

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend