a receding horizon approach for the runtime management of
play

A Receding Horizon Approach for the Runtime Management of IaaS - PowerPoint PPT Presentation

A Receding Horizon Approach for the Runtime Management of IaaS Cloud Systems www.modaclouds.eu Danilo Ardagna, Michele Ciavotta {danilo.ardagna, michele.ciavotta}@polimi.it Politecnico di Milano Riccardo Lancellotti


  1. A Receding Horizon Approach for the Runtime Management of IaaS Cloud Systems www.modaclouds.eu Danilo Ardagna, Michele Ciavotta {danilo.ardagna, michele.ciavotta}@polimi.it Politecnico di Milano Riccardo Lancellotti riccardo.lancellotti@unimore.it Università di Modena e Reggio Emilia

  2. Agenda ∗ Introduction ∗ Problem ∗ Problem statement and design assumption ∗ Receding Horizon algorithm ∗ Experimental Analysis ∗ Conclusions 2

  3. Introduction The advent of Cloud Computing changed dramatically the ICT industry ∗ Google, Amazon, Microsoft, Salesforce, Oracle, SAP, SoftLayer, Rackspace etc… ∗ Cost-efgective solutions ∗ Computational power ∗ Reliability ∗ Auto-scaling New business paradigms appeared on the market ∗ IaaS, PaaS, SaaS ∗ But also DaaS, BDaaS, HDaaS, etc… 3

  4. Introduction: challenges The growing popularity of Cloud Computing opens new challenges ∗ Vendor lock-in ∗ Design for Quality of Service (QoS) guarantees ∗ Managing the lifecycle of a Cloud application ∗ Managing Elasticity ∗ Resource Provisioning ∗ Self-adaptation 4

  5. Introduction : resource provisioning Resource Provisioning: mechanism for leasing and releasing virtual cloud resources to guarantee adequate QoS … it requires management solutions that support ∗ Performance prediction, ∗ Monitoring of Service Level Agreements (SLA), ∗ Adaptive re-confjguration actions. T ools currently supplied by IaaS providers, are often too basic and inadequate for ∗ Highly variable workload, ∗ Applications with a dynamic behavior characterized by uncertainty. 5

  6. Introduction: our approach Proposal: a fast and efgective Capacity Allocation technique ∗ based on the Receding Horizon control strategy ∗ integrated within MODAClouds runtime platform ∗ that minimizes the execution costs of a Cloud application, ∗ guaranteeing QoS constraints expressed in terms of average response time 6

  7. Agenda ∗ Introduction ∗ Problem ∗ Problem statement and design assumption ∗ Receding Horizon algorithm ∗ Experimental Analysis ∗ Conclusions 7

  8. Problem: design assumptions Perspective of a Software-as-a-Service (SaaS) provider hosting his/her applications on an Infrastructure-as-a-Service (IaaS) provider Applications are single tier hosted in virtual machines (VMs) that are dynamically instantiated by the IaaS provider Each VM hosts a single WS application Multiple homogeneous VMs implementing the same WS application can run in parallel

  9. Problem: design assumptions Each WS class hosted in a VM is modeled as an M/G/1 queue in tandem with a delay center SLA based on the average response time: every WS class has to provide a response time lower than a threshold

  10. Problem: design assumptions IaaS providers charge software providers on an hourly basis ∗ reserved VMs ( time-unit cost) ∗ on demand VMs ( time-unit cost ) Time management: ∗ Time slots: (5, 10 min) ∗ Time window: ( 1-5 ) ∗ Charging interval: (60 min)

  11. Problem: formulation Time unit costs Time management Freely available VMs Workload prediction CA plan

  12. Problem: formulation The CA problem can be formulated as: Total cost Subject to the conditions: Response time limited number of reserved VMs 12

  13. Receding Horizon Algorithm 1 1 First slot configuration ( r k , d k ) Solve Optimization In a nutshell, the Capacity Allocation problem Optimizer Model is solved for every time slot in but only Optimal solution the actions concerning the fjrst forthcoming time slot are enacted. IaaS Receding Horizon controller Interface Clock Predicted workload Update Model Parameters 1 n w ( b k , . . . , b ⇤ ⇤ ) k Cloud Monitoring Application Platform 13

  14. Receding Horizon Algorithm 14

  15. Agenda ∗ Introduction ∗ Problem ∗ Problem statement and design assumption ∗ Receding Horizon algorithm ∗ Experimental Analysis ∗ Conclusions 15

  16. Experimental Analysis Scalability: ∗ Large set of randomly generated instances ∗ Daily distribution of requests from real log traces Comparison with state of the art approaches: ∗ Heuristic ∗ Oracle with perfect knowledge of the future Time scale analysis: ∗ SLA violations 16

  17. Experiment Design Workload prediction ∗ Incoming workload has been obtained for traces of a very large dynamic web-based system ∗ Difgerent workload for each WS class ∗ Prediction obtained by adding white noise to each sample ∗ Noise proportional to the arrival rate ∗ Inaccuracy increases with the time slot Performance parameters ∗ Service rate ∗ Queueing delay ∗ Reserved instances Instance cost ∗ Randomly generated considering prices currently charged by common IaaS providers 17

  18. Experiment Design T raffjc profjles: ∗ Normal workload with low noise ∗ Normal workload with high noise ∗ Spiky workload with low noise ∗ Spiky workload with high noise The difgerent levels of noise corresponds to: 18

  19. Scalability The analysis demonstrated that our approach scales almost linearly with respect to the number of request classes. Systems up to 160 classes and 5 time slots can be solved in less than 200 sec. 19

  20. Cost – Normal traffjc • 10 minutes time 530 scale 480 • Low noise level 430 S-t Algorithm 380 ] t[$ Heu (40,50) os Heu (50,60) C 330 Heu (60,80) Oracle 280 230 180 1 2 3 4 T w Costs comparison

  21. Cost – Spiky traffjc • 5 minutes time 880 scale 780 • Low noise level 680 S-t Algorithm 580 ] t[$ Heu (40,50) s o Heu (50,60) C 480 Heu (60,80) Oracle 380 280 180 1 2 3 4 5 T w Costs comparison 21

  22. Time scale analysis 7000" Goal: evaluate the impact of time scale on the proposed receding horizon algorithm 6000" Analyses have been supported by a discrete event 5000" simulator based on the Omnet++ framework created on purpose. )' s m '( 4000" ∗ able to capture the time-varying performance degradation IME 'T due to resource contention via Random Environments (REs) E S N O P 3000" S E Performance indicators considered: R ∗ SLA violation (the percentage of time slots where the 2000" average response time exceeds the SLA thresholds ∗ Dropped request (the percentage of requests dropped as a 1000" result of the fjnite queue length) 0" 22 0" " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " 10" " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " "" " " " " " " " " " " " " " " " " " " " " " " 20 T IME '(m in )'

  23. Time scale analysis The values are related to a 24 hours analysis with low noise and averaged over 10 executions. A control time granularity of 5 minutes tends to provide better performance if compared to granularity of 10 minutes both in terms of SLA violations and in terms of dropped requests. 23

  24. Agenda ∗ Introduction ∗ Problem ∗ Problem statement and design assumption ∗ Receding Horizon algorithm ∗ Experimental Analysis ∗ Conclusions 24

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend