Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam - PowerPoint PPT Presentation

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam Elahi 1 , 3 Andrea Marin 2 Sabina Rossi 2 Carey Williamson 3 1 Mount Royal University, Canada 2 Universit` a Ca’Foscari Venezia, Italy 3 University of Calgary, Canada 1

Talk overview Introduction and Contribution The queueing model and its solution Case study Conclusion 2

Introduction and Contribution 3

Size based scheduling • Goal: serve first the smallest jobs to improve the expected response time • Assumptions: • Do we know the job size at the arrival epoch? ( S hortest R emaining P rocessing T ime) • Do we know the distribution of the job size? (Gittin’s policy) • Is the job size distribution heavy tailed? (Least Attained Service (LAS), Multilevel queues) 4

How do multilevel queues work? Example: two levels with P rocessor S haring (PS) server 5

Frequency scaling • Goal: Energy saving • Main idea: When there are few jobs in the system, we can reduce the processor speed • We increase the expected response time w.r.t. constant speed only for the lucky jobs • The service time is directly proportional to the service speed f • The power consumption depends on the service speed as f α , were 2 ≤ α ≤ 3 • Linear frequency scaling: The server speed is proportional to the number of jobs in the system 6

Related work (short list) • Policies independent of the received service • (George, Harrison): FCFS queues and frequency scaling • (Wierman et al.): M/G/1/PS queues with frequency scaling • Policies with known job size • (Bansal et al.; Andrew et al.): Worst case analysis of SRPT with frequency scaling • (Andrew at al.; Elahi, Williamson): Unfairness in SRPT with frequency scaling • (Lassila; Aalto): LAS with sleeping servers 7

Contribution • We study a two-level queue with PS discipline and linear speed scaling on the low-priority jobs (PS+IS) • We give a numerical solution of the queueing system and validate it with discrete event simulation • We study the behaviour of the model with job size distributions obtained by monitoring TCP flows of a data centre 8

The queueing model and its solution 9

Graphical representation Processor Sharing (PS) X Jobs with size < a Poisson( λ ) a X − a Jobs with size ≥ a Infinite Server (IS) Note: The IS system works only when the PS is idle 10

Job size distribution • We use Generalized Hyperexponential (GH) distributions K � p k µ k e − p k µ k f X ( x ) = k =1 where � K k =1 p k = 1, p k ∈ R , µ k ∈ R + , f X ( x ) > 0 for all x ∈ R + • GH distributions are dense in the domain of the distributions • They can approximate any distribution arbitrary well 11

Analysis of the queue: sketch • We can see the system consisting of two queues: • High priority one which is M/G/1/PS whose job sizes are truncated at a • Low priority one which works during the idle periods of the PS which is a M B /G/ ∞ queue • The arrival process is Poisson with intensity λ • The batch size is the number of jobs that crossed the threshold during a busy period of the PS level • The generating function of the batch size distribution has not an explicit form but has a characteristic equation (Kleinrock) • The solution of the IS queue requires the distribution of the batch size 12

Computation of the batch size distribution • We invert the generating function with the Lattice-Poisson algorithm by Abate and Whitt • The evaluation of the generating function is obtained with a fixed point algorithm whose convergence is proved by resorting to Banach’s contraction mapping fixed point theorem • The accuracy of the numerical procedure is validated in low and heavy-load by comparing the first two moments of the distribution (which can be computed explicitly for GH distributions from the characteristic equation) with those obtained by the numerical inversion 13

Computation of the power consumption • We resort to the literature for the power consumed by the PS queue • We provide a numerical solution for the IS queue and integer values of the exponent α • The power consumption is derived from the second ( α = 2) and third ( α = 3) moments of the occupancy distribution in the IS queue 14

Case study 15

Dataset • TCP flows monitored at the data centre of the Universit` a Ca’ Foscari Venezia in November 2019 • Fitting with PH-Fit into an acyclic phase-type distribution • Transformation of the acyclic phase-type distribution into a GH 10 -1 10 0 1 10 -2 0.9 10 -1 10 -3 0.8 0.7 10 -4 10 -2 0.6 10 -5 0.5 10 -3 10 -6 0.4 0.3 10 -7 10 -4 0.2 10 -8 0.1 10 -9 0 10 -5 10 -1 10 0 10 1 10 2 10 3 10 4 10 5 10 -1 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 8 10 -1 10 0 10 1 10 2 10 3 10 4 10 5 (a) Probability density (b) Empirical and (c) Complementary function in log-log analytical cumulative cumulative density scale. density function in function in log-log log-linear scale. scale. 16

PS+IS vs. PS: Comparison of the expected response time • PS queue has speed 1 and IS has speed f < 1 4 4 3.5 3.5 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 0 5 10 15 0 5 10 15 10 4 10 4 (a) Expected response time: (b) Expected response time: ρ PS = 0 . 85. ρ PS = 0 . 92. 17

PS+IS vs. PS: Comparison of the power consumption 1 1.3 1.25 0.95 1.2 1.15 0.9 1.1 0.85 1.05 1 0.8 0.95 0.9 0.75 0.85 0.7 0.8 0 0.5 1 1.5 2 2.5 3 3.5 4 0 0.5 1 1.5 2 2.5 3 3.5 4 10 4 10 4 (a) Power consumption: (b) Power consumption: ρ PS = 0 . 85 when 0 ≤ a ≤ 4 · 10 4 . ρ PS = 0 . 92 when 0 ≤ a ≤ 4 · 10 4 . 18

PS+IS vs. PS: Slowdown 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 (a) Slowdown of PS+IS (b) Slowdown of PS+IS conditioned to the job size x with conditioned to the job size x with ρ PS = 0 . 85. ρ PS = 0 . 92. 19

PS+IS vs. PS: Comparison of the expected response times with same power consumption 2 1 2 1 1.8 0.9 1.8 0.9 1.6 0.8 1.6 0.8 1.4 0.7 1.4 0.7 1.2 0.6 1.2 0.6 1 0.5 1 0.5 0.8 0.4 0.8 0.4 0.6 0.3 0.6 0.3 0.4 0.2 0.4 0.2 0.2 0.1 0.2 0.1 0 0 0 0 0 5 10 15 0 5 10 15 10 4 10 4 (a) Comparison of the expected (b) Comparison of the expected response time with ρ PS = 0 . 70 and response time with ρ PS = 0 . 70 and f = 0 . 10. f = 0 . 15. 20

Simulation • Simulation has been used to cross validate the numerical results • Simulation allows the investigation of other characteristics of the system such as the distribution of the system speed Speed of PS+IS with ; PS =0.70, a = 2 " 10 3 Speed of PS+IS with ; PS =0.85, a = 2 " 10 3 0.45 0.45 f = 0 : 05 f = 0 : 05 0.4 0.4 f = 0 : 10 f = 0 : 10 0.35 f = 0 : 15 0.35 f = 0 : 15 0.3 0.3 Probability Probability 0.25 0.25 0.2 0.2 0.15 0.15 0.1 0.1 0.05 0.05 0 0 0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3 Speed Speed (a) System speed: ρ PS = 0 . 70. (b) System speed: ρ PS = 0 . 85. 21

Conclusion 22

Conclusion • We have introduced a two-level queueing system (PS+IS) with linear speed scaling for the low-priority level • A numerical solution procedure has been proposed and its accuracy has been validated with discrete event simulation • Experiments on real-world job size distributions have been carried out • We showed that the model-driven configuration of the PS+IS system is crucial for obtaining the benefits of the speed scaling without compromising the slowdown of the system too much 23

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam - PowerPoint PPT Presentation

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam Elahi 1 , 3 Andrea Marin 2 Sabina Rossi 2 Carey Williamson 3 1 Mount Royal University, Canada 2 Universit` a CaFoscari Venezia, Italy 3 University of Calgary, Canada 1 Talk

Stacks, Queues, and Priority Queues Inf 2B: Heaps and Priority Queues Stacks, queues, and priority

CS200: Queues n Prichard Ch. 8 CS200 - Queues 1 Queues n First In First Out (FIFO) structure n

Frequency Decomposition The base frequency or the fundamental frequency is the lowest frequency.

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Figure 1a: Multilevel Visualization of Market Power Cases in Tree Format Figure 1b: Multilevel

Agenda 1. More on multilevel R formulas 2. Generalized Multilevel Models 3. GMLM in R 1 More

Multilevel Krylov Methods Deflation Deflation, DD, MG Reinhard Nabben Multilevel Krylov

Floor Chris Campbell, PE Senior Fire Engineer Reddit.com mealsharedotorg Stairwell Queues 2

CS171 Introduction to Computer Science II Priority Queues and Binary Heap Priority Queues and

CSCI 104 Priority Queues / Heaps Mark Redekopp David Kempe Sandra Batista 2 PRIORITY QUEUES

CS200: Priority Queues, Heaps Prichard Ch. 12 CS200 - Tables and Priority Queues 1 Priority

Priority Queues (Heaps) October 11, 2016 CMPE 250 Priority Queues October 11, 2016 1 / 29

Priority Queues 3rd October 2019 Priority Queues Binary heaps Leftist heaps

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Time-Frequency Analysis Time Frequency Analysis in Visual Signal Yetmen Wang AnCAD, Inc.

Agenda Motivation Multipath approaches Modeling Experiments and results

3Q FY19 Financial Results 11 July 2019 Disclaimer This presentation is for information only and

C S 3 5 4 R e p r e s e n t i n g Ma p s : T o p o l o g i c a l

CS 574: Randomized Algorithms Lecture 4. Occupancy Problems, Balls in Bins September 3, 2015

Autoscaling Effects in Speed Scaling Systems Carey Williamson Department of Computer Science

2Q & 1HFY20/21 Financial Results 27 October 2020 Important Notice This presentation shall be

Typical DO Sag Curve David Reckhow CEE 577 #16 2 1 CEE 577 Lecture #16 10/23/2017

PV Systems - Components and Concepts Maximum Power Point Tracking (MPPT) Week 7.3 Arno Smets

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam - PowerPoint PPT Presentation

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam Elahi 1 , 3 Andrea Marin 2 Sabina Rossi 2 Carey Williamson 3 1 Mount Royal University, Canada 2 Universit` a CaFoscari Venezia, Italy 3 University of Calgary, Canada 1 Talk

Stacks, Queues, and Priority Queues Inf 2B: Heaps and Priority Queues Stacks, queues, and priority

CS200: Queues n Prichard Ch. 8 CS200 - Queues 1 Queues n First In First Out (FIFO) structure n

Frequency Decomposition The base frequency or the fundamental frequency is the lowest frequency.

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Figure 1a: Multilevel Visualization of Market Power Cases in Tree Format Figure 1b: Multilevel

Agenda 1. More on multilevel R formulas 2. Generalized Multilevel Models 3. GMLM in R 1 More

Multilevel Krylov Methods Deflation Deflation, DD, MG Reinhard Nabben Multilevel Krylov

Floor Chris Campbell, PE Senior Fire Engineer Reddit.com mealsharedotorg Stairwell Queues 2

CS171 Introduction to Computer Science II Priority Queues and Binary Heap Priority Queues and

CSCI 104 Priority Queues / Heaps Mark Redekopp David Kempe Sandra Batista 2 PRIORITY QUEUES

CS200: Priority Queues, Heaps Prichard Ch. 12 CS200 - Tables and Priority Queues 1 Priority

Priority Queues (Heaps) October 11, 2016 CMPE 250 Priority Queues October 11, 2016 1 / 29

Priority Queues 3rd October 2019 Priority Queues Binary heaps Leftist heaps

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

Time-Frequency Analysis Time Frequency Analysis in Visual Signal Yetmen Wang AnCAD, Inc.

Agenda Motivation Multipath approaches Modeling Experiments and results

3Q FY19 Financial Results 11 July 2019 Disclaimer This presentation is for information only and

C S 3 5 4 R e p r e s e n t i n g Ma p s : T o p o l o g i c a l

CS 574: Randomized Algorithms Lecture 4. Occupancy Problems, Balls in Bins September 3, 2015

Autoscaling Effects in Speed Scaling Systems Carey Williamson Department of Computer Science

2Q &amp; 1HFY20/21 Financial Results 27 October 2020 Important Notice This presentation shall be

Typical DO Sag Curve David Reckhow CEE 577 #16 2 1 CEE 577 Lecture #16 10/23/2017

PV Systems - Components and Concepts Maximum Power Point Tracking (MPPT) Week 7.3 Arno Smets

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

2Q & 1HFY20/21 Financial Results 27 October 2020 Important Notice This presentation shall be