frequency scaling in multilevel queues
play

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam - PowerPoint PPT Presentation

Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam Elahi 1 , 3 Andrea Marin 2 Sabina Rossi 2 Carey Williamson 3 1 Mount Royal University, Canada 2 Universit` a CaFoscari Venezia, Italy 3 University of Calgary, Canada 1 Talk


  1. Frequency Scaling in Multilevel Queues IFIP Performance 2020 Maryam Elahi 1 , 3 Andrea Marin 2 Sabina Rossi 2 Carey Williamson 3 1 Mount Royal University, Canada 2 Universit` a Ca’Foscari Venezia, Italy 3 University of Calgary, Canada 1

  2. Talk overview Introduction and Contribution The queueing model and its solution Case study Conclusion 2

  3. Introduction and Contribution 3

  4. Size based scheduling • Goal: serve first the smallest jobs to improve the expected response time • Assumptions: • Do we know the job size at the arrival epoch? ( S hortest R emaining P rocessing T ime) • Do we know the distribution of the job size? (Gittin’s policy) • Is the job size distribution heavy tailed? (Least Attained Service (LAS), Multilevel queues) 4

  5. How do multilevel queues work? Example: two levels with P rocessor S haring (PS) server 5

  6. Frequency scaling • Goal: Energy saving • Main idea: When there are few jobs in the system, we can reduce the processor speed • We increase the expected response time w.r.t. constant speed only for the lucky jobs • The service time is directly proportional to the service speed f • The power consumption depends on the service speed as f α , were 2 ≤ α ≤ 3 • Linear frequency scaling: The server speed is proportional to the number of jobs in the system 6

  7. Related work (short list) • Policies independent of the received service • (George, Harrison): FCFS queues and frequency scaling • (Wierman et al.): M/G/1/PS queues with frequency scaling • Policies with known job size • (Bansal et al.; Andrew et al.): Worst case analysis of SRPT with frequency scaling • (Andrew at al.; Elahi, Williamson): Unfairness in SRPT with frequency scaling • (Lassila; Aalto): LAS with sleeping servers 7

  8. Contribution • We study a two-level queue with PS discipline and linear speed scaling on the low-priority jobs (PS+IS) • We give a numerical solution of the queueing system and validate it with discrete event simulation • We study the behaviour of the model with job size distributions obtained by monitoring TCP flows of a data centre 8

  9. The queueing model and its solution 9

  10. Graphical representation Processor Sharing (PS) X Jobs with size < a Poisson( λ ) a X − a Jobs with size ≥ a Infinite Server (IS) Note: The IS system works only when the PS is idle 10

  11. Job size distribution • We use Generalized Hyperexponential (GH) distributions K � p k µ k e − p k µ k f X ( x ) = k =1 where � K k =1 p k = 1, p k ∈ R , µ k ∈ R + , f X ( x ) > 0 for all x ∈ R + • GH distributions are dense in the domain of the distributions • They can approximate any distribution arbitrary well 11

  12. Analysis of the queue: sketch • We can see the system consisting of two queues: • High priority one which is M/G/1/PS whose job sizes are truncated at a • Low priority one which works during the idle periods of the PS which is a M B /G/ ∞ queue • The arrival process is Poisson with intensity λ • The batch size is the number of jobs that crossed the threshold during a busy period of the PS level • The generating function of the batch size distribution has not an explicit form but has a characteristic equation (Kleinrock) • The solution of the IS queue requires the distribution of the batch size 12

  13. Computation of the batch size distribution • We invert the generating function with the Lattice-Poisson algorithm by Abate and Whitt • The evaluation of the generating function is obtained with a fixed point algorithm whose convergence is proved by resorting to Banach’s contraction mapping fixed point theorem • The accuracy of the numerical procedure is validated in low and heavy-load by comparing the first two moments of the distribution (which can be computed explicitly for GH distributions from the characteristic equation) with those obtained by the numerical inversion 13

  14. Computation of the power consumption • We resort to the literature for the power consumed by the PS queue • We provide a numerical solution for the IS queue and integer values of the exponent α • The power consumption is derived from the second ( α = 2) and third ( α = 3) moments of the occupancy distribution in the IS queue 14

  15. Case study 15

  16. Dataset • TCP flows monitored at the data centre of the Universit` a Ca’ Foscari Venezia in November 2019 • Fitting with PH-Fit into an acyclic phase-type distribution • Transformation of the acyclic phase-type distribution into a GH 10 -1 10 0 1 10 -2 0.9 10 -1 10 -3 0.8 0.7 10 -4 10 -2 0.6 10 -5 0.5 10 -3 10 -6 0.4 0.3 10 -7 10 -4 0.2 10 -8 0.1 10 -9 0 10 -5 10 -1 10 0 10 1 10 2 10 3 10 4 10 5 10 -1 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 8 10 -1 10 0 10 1 10 2 10 3 10 4 10 5 (a) Probability density (b) Empirical and (c) Complementary function in log-log analytical cumulative cumulative density scale. density function in function in log-log log-linear scale. scale. 16

  17. PS+IS vs. PS: Comparison of the expected response time • PS queue has speed 1 and IS has speed f < 1 4 4 3.5 3.5 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 0 5 10 15 0 5 10 15 10 4 10 4 (a) Expected response time: (b) Expected response time: ρ PS = 0 . 85. ρ PS = 0 . 92. 17

  18. PS+IS vs. PS: Comparison of the power consumption 1 1.3 1.25 0.95 1.2 1.15 0.9 1.1 0.85 1.05 1 0.8 0.95 0.9 0.75 0.85 0.7 0.8 0 0.5 1 1.5 2 2.5 3 3.5 4 0 0.5 1 1.5 2 2.5 3 3.5 4 10 4 10 4 (a) Power consumption: (b) Power consumption: ρ PS = 0 . 85 when 0 ≤ a ≤ 4 · 10 4 . ρ PS = 0 . 92 when 0 ≤ a ≤ 4 · 10 4 . 18

  19. PS+IS vs. PS: Slowdown 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 (a) Slowdown of PS+IS (b) Slowdown of PS+IS conditioned to the job size x with conditioned to the job size x with ρ PS = 0 . 85. ρ PS = 0 . 92. 19

  20. PS+IS vs. PS: Comparison of the expected response times with same power consumption 2 1 2 1 1.8 0.9 1.8 0.9 1.6 0.8 1.6 0.8 1.4 0.7 1.4 0.7 1.2 0.6 1.2 0.6 1 0.5 1 0.5 0.8 0.4 0.8 0.4 0.6 0.3 0.6 0.3 0.4 0.2 0.4 0.2 0.2 0.1 0.2 0.1 0 0 0 0 0 5 10 15 0 5 10 15 10 4 10 4 (a) Comparison of the expected (b) Comparison of the expected response time with ρ PS = 0 . 70 and response time with ρ PS = 0 . 70 and f = 0 . 10. f = 0 . 15. 20

  21. Simulation • Simulation has been used to cross validate the numerical results • Simulation allows the investigation of other characteristics of the system such as the distribution of the system speed Speed of PS+IS with ; PS =0.70, a = 2 " 10 3 Speed of PS+IS with ; PS =0.85, a = 2 " 10 3 0.45 0.45 f = 0 : 05 f = 0 : 05 0.4 0.4 f = 0 : 10 f = 0 : 10 0.35 f = 0 : 15 0.35 f = 0 : 15 0.3 0.3 Probability Probability 0.25 0.25 0.2 0.2 0.15 0.15 0.1 0.1 0.05 0.05 0 0 0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3 Speed Speed (a) System speed: ρ PS = 0 . 70. (b) System speed: ρ PS = 0 . 85. 21

  22. Conclusion 22

  23. Conclusion • We have introduced a two-level queueing system (PS+IS) with linear speed scaling for the low-priority level • A numerical solution procedure has been proposed and its accuracy has been validated with discrete event simulation • Experiments on real-world job size distributions have been carried out • We showed that the model-driven configuration of the PS+IS system is crucial for obtaining the benefits of the speed scaling without compromising the slowdown of the system too much 23

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend