A Generalized Blind Scheduling Policy Hanhua Feng 1 , Vishal Misra 2 - PowerPoint PPT Presentation

A Generalized Blind Scheduling Policy Hanhua Feng 1 , Vishal Misra 2 , 3 and Dan Rubenstein 2 1 Infinio Systems 2 Columbia University in the City of New York 3 Google TTIC SUMMER WORKSHOP: DATA CENTER SCHEDULING FROM THEORY TO PRACTICE Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 1 / 37

Overview Introduction 1 PBS Policy 2 Properties of the PBS policy 3 Implementation and Experimental Results 4 PBS in the Data Center 5 Conclusion and Future Work 6 Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 2 / 37

Introduction Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 3 / 37

Scheduling Policies in Queueing Models Scheduling is a compromise . . . not only between individual tasks, but also . . . between systems with different workload patterns, between different performance requirements, including mean response time, mean slowdown, responsiveness, . . . fairness measures: seniority, RAQFM, . . . Our work Design a flexible scheduling policy to balance these requirements. Assumptions in this talk Single-server queueing model Work-conserving, preemption allowed Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 4 / 37

Blind Scheduling Policies Blind policies Non-blind policies No information about remaining Know required and remaining service until tasks complete. service time when tasks arrive. Non-blind policy examples Blind policy examples SJF, SRPT, SMART . . . FCFS, PS, LAS, LCFS, . . . Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 5 / 37

How Do We Measure Fairness of a Policy? Fairness criteria [cf. Raz,Levy&Avi-Itzhak 2004] Task seniority (emphasis on t i ) ⇒ FCFS Task service requirements (emphasis on x i ) Equal attained service ⇒ LAS/FBPS Combination of the two: Equal share of processor Current: dx i ( t ) / dt i ( t ) ≡ x ′ i ( t ) ⇒ PS Aggregated: x i ( t ) / t i ( t ) ⇒ GAS Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 6 / 37

How to Measure Fairness of a Policy? (cont’d) Fairness measures in literature Comparison vs FCFS [Wang & Morris 1985] RAQFM: Comparison vs PS [Raz,Levy&Avi-Itzhak 2004] A quantitative measure. Difficult to analyze: with results for FCFS, LCFS, PLCFS, and Random in M / M / 1. G / D / m [Raz,Levy&Avi-Itzhak 2005] Expected slowdown for given required service E [ S | X = x ] compared with PS [Wierman&Harchol-Balter 2004] A classification: always fair/unfair, sometimes fair. Assume M / G / 1. Extended in [Wierman&Harchol-Balter 2005]. SQF [Avi-Itzhak,Brosh&Levy 2007] Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 7 / 37

PBS Policy Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 8 / 37

Balance Between Two Fairness Criteria Two fairness criteria (cont’d) Seniority — Prefer larger sojourn time t i ( t ) Service requirements — Prefer smaller attained service x i ( t ) Our idea: A configurable balance Schedule a task with maximal t i ( t ) − α x i ( t ). More general: g ( t i ( t )) − α g ( x i ( t )), e,g., log t i ( t ) − α log x i ( t ). Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 9 / 37

Our Parameterized Scheduler: PBS The PBS policy with a single server For every task i , compute its priority value t i ( x ) p i ( t ) = log t i ( t ) − α log x i ( t ) , Equivalent to P i ( t ) = [ x i ( t )] α α is a configurable parameter in [0 , ∞ ). At time t , serve the task with the highest priority p i (or P i ). Randomly choose among equal-priority tasks. Preempt low-priority tasks, if currently been served. Can be used in continuous time (theory) or in discrete time (practice). PBS: A Unified Priority-Based Scheduler Sigmetrics 2007 Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 10 / 37

PBS: Priority-based Blind Scheduling (cont’d) Why PBS? Tunable: Parameter α can be changed from 0 to ∞ . Emulate well-known policies: P i = t i / x α i α = 0: First-come first-serve (FCFS) P i = t i α → ∞ : Least attained service (LAS), P i ∼ 1 / x i a.k.a. Foreground-Background Processor-Sharing (FBPS) α = 1: Greatest Attained Slowdown (GAS), P i = t i / x i closely emulate Processor-Sharing (PS). α = other values: Hybrid policies. Blind: Using only past information ( t i , x i ) Simple: Easy to implement. Dimensionless: Not dependent on scale of time unit (minute, second). Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 11 / 37

Behavior of PBS An example How to read the graphs Four tasks in 4 colors X-axis: Time Arrival time: 0s,1s,3s,5s Y-axis: CPU utilization per task. Service: 4.5s,2.5s,3s,2s Area: Service received. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 12 / 37

Properties Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 13 / 37

Properties of PBS for 0 < α < ∞ Some properties of PBS proved in the paper A new task immediately receives service after arrival. Small CPU fraction for α < 1 Large CPU fraction for α > 1. Seniority : Earlier tasks get more attained service. Time-shared : CPU may be shared by two or more tasks. Hospitality : A new task always gets a CPU share. Convergence : Converge to PS in a long run for long jobs. Converge to DPS with an offset to log formula, No Starvation : Priority values of temporarily blocked tasks increase towards infinity, and will become highest-priority task. For α close to 0 (FCFS) or ∞ (LAS), tasks may be blocked for a long time. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 14 / 37

PBS Tunability: A Graphical Conclusion PBS is monotonic in many aspects Guidelines for tuning α manually. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 15 / 37

Implementation Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 16 / 37

Implementation in Linux Kernel CPU utilization measurement Discrete time implementation in Linux 2.6.15. 50ms moving average of measured CPU utilization per task. Measurement results are close to simulation results. Difference is the roughness on small time scales. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 17 / 37

Emulating Existing Linux Scheduler A small tweak Add a bonus priority γ to the current task in order to limit context switch. With α = 2 and γ = 0 . 07, PBS looks close to Linux native scheduler. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 18 / 37

Experimental model A closed model A fixed number of users. Each user submits a task after thinking. Exponentially distributed thinking time. Response time of every task is measured. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 19 / 37

Experimental Results (Set A) Computational tasks with almost deterministic CPU usage. About 3-second processing for each task. 8 users, 25s average thinking time. For this work load, small α works best. PBS ( α < 0 . 7) outperforms Linux and Round-robin. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 20 / 37

Experimental Results (Set B) (1/2) Apache web server 2.0, dynamic pages with heavy processing. Overloaded with 30 users, 10s average thinking time. Processing time is heavy-tailed. For this workload, big α works best. PBS ( α > 2) outperforms Linux and Round-robin. Conclusion Different α ’s are better for different workloads. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 21 / 37

Experimental Results (Set B) (2/2) Apache web server 2.0, dynamic pages with heavy processing. Overloaded with 30 users, 10s average thinking time. Processing time is heavy-tailed. Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 22 / 37

Data center Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 23 / 37

Data center fabric: A giant switch DC Fabric: Just a Giant Switch H1 H1 H2 H2 H3 H3 H4 H4 H5 H5 H6 H6 H7 H7 H8 H8 H9 H9 TX RX 8 Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 24 / 37

Transport in Data Centers DC transport = Objective? Flow scheduling Ø Minimize avg FCT on giant switch H1 H1 H2 H2 H3 H3 H4 H4 H5 H5 H6 H6 H7 H7 H8 H8 H9 H9 TX RX ingress & egress capacity constraints 9 Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 25 / 37

pFabric pFabric: Minimal Near-Optimal Datacenter Transport (Alizadeh et al. Sigcomm 2013) Goal: Complete Flows Quickly Requires scheduling flows such that: High throughput for large flows Fabric latency (no queuing delays) for small flows Prior work: use rate control to schedule flows vastly improve performance, but complex Feng, Misra, Rubenstein (Columbia) A Generalized Blind Scheduling Policy TTIC WORKSHOP 26 / 37

A Generalized Blind Scheduling Policy Hanhua Feng 1 , Vishal Misra 2 - PowerPoint PPT Presentation

A Generalized Blind Scheduling Policy Hanhua Feng 1 , Vishal Misra 2 , 3 and Dan Rubenstein 2 1 Infinio Systems 2 Columbia University in the City of New York 3 Google TTIC SUMMER WORKSHOP: DATA CENTER SCHEDULING FROM THEORY TO PRACTICE Feng,

Aperiodic Task Scheduling Radek Pel anek Preemptive Scheduling Non-preemptive Scheduling

Nanocrafter Xylem vs. r pl r lv r pl r lv r pl r lv + d - d - d + d BLIND BLIND RATINGS

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms 2

Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Module 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms

Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Three

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

Instruction Scheduling Last time Instruction scheduling using list scheduling Today

CPU Scheduling Heechul Yun 1 Agenda Introduction to CPU scheduling Classical CPU

Blind/Visually Impaired Silvia Ludena Veronica Sarabia Katie Stoddard Huong Vo Blind/Visually

The V The V The V The V- - - -30 Drilling Solution 30 Drilling Solution 30 Drilling

Visual disability Low vision 2015 Estimated blind people 2020 Visually impaired 285 M Blind

CANADIANS WHO ARE BLIND, DEAF-BLIND, AND PARTIALLY- SIGHTED Keith D Gordon Ph.D. Senior

What Do I Need to Do to Assess Personnel Competency? GENERAL INFORMATION What is competency and

Hacking XPATH 2.0 Tom Forbes Sumit Siddharth 7Safe, UK

SANTA CRUZ/ALAMEDA ROAD CONFIGURATIONS INTERSECTION AND STREET VIEWS This presentation includes

Economy and planet: a blind spot for community psychology? Mark Burton Steady State Manchester

Scalability evaluation of blind spread-spectrum image watermarking Peter Meerwald, Andreas Uhl

Noise2Self: Blind Denoising by Self-Supervision Joshua Batson Loc Royer Noisy Data

r trs tt

Deep Integration of Human and Machine Intelligence for Accessibility Jeffrey P. Bigham Carnegie