Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters
Prashanth Thinakaran, Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das
June 6th, ICDCS 2017
Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters - - PowerPoint PPT Presentation
Phoenix: A Constraint-aware Scheduler for Heterogeneous Datacenters Prashanth Thinakaran , Jashwant Gunasekaran, Bikash Sharma, Mahmut Kandemir, Chita Das June 6th, ICDCS 2017 Executive Summary Problem: Heterogeneity agnostic datacenter
June 6th, ICDCS 2017
2
choices of jobs
3
4
Early Late Centralized Distributed
Hybrid Schedulers
Task binding to Queue
Constraint aware Constraint unaware
Mercury Sparrow
Hawk Eagle
Phoenix
Mesos Borg
Choosy
10M 100B 10B 1B 100M
Number of jobs executed per day
Yacc-D
Control Plane
5
6
Minimum Disks 1% Maximum Disks 8% Number of cores 17% ISA (x86,ARM) 74%
ISA (x86,ARM) Number of Nodes Ethernet Speed Number of cores Maximum Disks Kernel Version Platform Family CPU Clock speed Minimum Disks
7
with at least 1 Gbps of network speed between them
8
9
frequency vector proposed in [2]
generate constraints for tasks
[1] C. Reiss, J. Wilkes, and J. L. Hellerstein, “Google cluster-usage traces: format+ schema,” Google Inc., White Paper 2011. [2] B. Sharma, V. Chudnovsky, J. L. Hellerstein, R. Rifaat, and C. R. Das, “Modeling and synthesizing task placement constraints in google compute clusters,” in Proceedings of the 2nd ACM Symposium on Cloud Computing.
10
11
scheduling
Yahoo Cloudera
12
Yahoo Cloudera Google Response times normalized to constrained jobs for Eagle-C
13
14
Distributed Scheduler 1 Worker Queue 1 CRV Monitor Worker Queue 2 Worker Queue 3 Worker Queue n-1 Worker Queue n more worker queues ...... Distributed Scheduler 2 Distributed Scheduler 3 Heartbeat Interval Centralized Scheduler
Constraint Resource Vector Lookup Table
Distributed Scheduler n ......
15
Worker 1 Worker 2 Worker 3 Worker 4 CRV Monitor utilization < threshold SRPT reordering DS DS DS DS
16 16
Worker 1 Worker 2 Worker 3 Worker 4 CRV Monitor utilization > threshold CRV reordering DS DS DS DS
17
18
19
*Lower the better Google Yahoo
20
Google trace- Hawk Cloudera Google trace- Sparrow *Lower the better
21
Yahoo Cloudera Google
22
24
25
27
28
to 2.5x queueing delays (repetition of information)
29
30