response time optimized distributed cloud resource
play

Response Time-Optimized Distributed Cloud Resource Allocation - PowerPoint PPT Presentation

Response Time-Optimized Distributed Cloud Resource Allocation Matthias Keller Holger Karl Computer Networks Group Universitt Paderborn Minimizing response times Latency-critical service Interactive, emergency service request t


  1. Response Time-Optimized Distributed Cloud Resource Allocation Matthias Keller Holger Karl Computer Networks Group Universität Paderborn

  2. Minimizing response times • Latency-critical service • Interactive, emergency service request t Response Time answer DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 21

  3. Minimizing response times • Latency-critical service • Interactive, emergency service request t Time to compute Response Time the answer answer DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 22

  4. Minimizing response times request t Queuing System: Time in Queue Response Time + Processing Time Time in System (TIS) answer DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 23

  5. Minimizing response times • Latency-critical service • Interactive, emergency service • Decision: Spend time on RTT or TIS request t Queuing System: Time in Queue Response Time + Processing Time Time in System (TIS) answer DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 24

  6. Minimizing response times • Latency-critical service • Interactive, emergency service • Decision: Spend time on RTT or TIS request Time in System (TIS) Response Time answer DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 25

  7. Example: RTT + TIS • Demand assignment • Facility Location Solution with RTT only 14 RTT solution 12 average response time 10 8 Better 6 4 2 0 0 10 20 30 40 50 arrival rate DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 26

  8. Example: RTT + TIS • Demand assignment • Facility Location Solution with RTT only 14 RTT solution 12 average response time RTT solution with TIS 10 8 Better 6 4 2 0 0 10 20 30 40 50 arrival rate DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 27

  9. Example: RTT + TIS • Demand assignment • Facility Location Solution with RTT only 14 RTT solution 12 average response time RTT solution with TIS 10 8 Better 6 Surprise at 4 runtime 2 0 0 10 20 30 40 50 arrival rate DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 28

  10. Example: RTT + TIS • Demand assignment • Facility Location Solution with RTT only • With RTT + TIS 14 RTT solution 12 average response time RTT solution with TIS 10 RTT+TIS solution 8 Better 6 4 2 0 0 10 20 30 40 50 arrival rate DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 29

  11. Goal Given • Network • Data centres Objective • Minimize response time Means • Allocation of n VMs at data centres DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 30

  12. Goal Given Characterise: • How does response time • Network depend on number n of VMs? • Data centres Objective • Minimize response time Means • Allocation of n VMs at data Response Time centres Optimal Solutions Number of VMs DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 31

  13. Two Approaches Accurate Solution • Mixed Integer Convex Problem • Convex TIS function for each data centre 10 time in system 8 6 4 2 0 0.00 0.30 0.60 0.90 utilization • Tough to solve – slow? DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 32

  14. Two Approaches Accurate Solution Approximate Solution • Mixed Integer Convex Problem • Reformulation: Mixed Integer Linear Problem • Convex TIS function for each • Linearization of TIS function data centre 10 10 original time in system time in system 8 8 uniform 6 6 4 4 2 2 0 0 0.00 0.30 0.60 0.90 0.00 0.30 0.60 0.90 utilization utilization • Accuracy? Speed? • Tough to solve – slow? DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 33

  15. Improve accuracy of linearization • Objective: 10 • Minimize the maximum difference original time in system 8 uniform • Control knobs 6 4 • Number of basepoints 2 • End point at asymptote 0 • Basepoint positions utilization 2.5 difference to org. Better 2.0 1.5 1.0 0.5 0.0 0.00 0.30 0.60 0.90 34 DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation

  16. Improve accuracy of linearization • Objective: 10 • Minimize the maximum difference original time in system 8 uniform • Control knobs 6 imamoto 4 • Number of basepoints 2 • End point at asymptote 0 • Basepoint positions utilization 2.5 difference to org. Better 2.0 • Evaluation in Paper 1.5 1.0 0.5 0.0 0.00 0.30 0.60 0.90 35 DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation

  17. Evaluation of both approaches Convex Problem Linear Problem • Reference Solution • Approximate Solution • Tough to solve – slow? • Accuracy? Speed? Linearization DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 36

  18. Evaluation of both approaches Convex Problem Linear Problem • Reference Solution • Approximate Solution • Tough to solve – slow? • Accuracy? Speed? Linearization Configurations • 6 topologies, 12 – 54 nodes • à 50 random demand realizations • 10 data centre fix DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 37

  19. Evaluation of both approaches Convex Problem Linear Problem • Reference Solution • Approximate Solution • Tough to solve – slow? • Accuracy? Speed? Linearization VM limit: 5 – 10 Configurations • 6 topologies, 12 – 54 nodes • à 50 random demand realizations • 10 data centre fix DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 38

  20. Results – Approximation Ratio approx. ratio = Resp.time Linear Resp.time Convex 1.16 Better 1.14 approximation ratio 1.12 1.10 1.08 1.06 1.04 1.02 1.00 dfn-bwin di-yuan norway atlanta zib54 ta2 topology large small DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 39

  21. Results – Approximation Ratio approx. ratio = Resp.time Linear Resp.time Convex 1.16 Better 1.14 approximation ratio 1.12 1.10 1.08 1.06 1.04 1.02 1.00 dfn-bwin di-yuan norway atlanta zib54 ta2 topology large small DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 40

  22. Results – Runtime Ratio runtime ratio = Runtime Linear Runtime Convex 10 0 Better 10 -1 runtime ratio 10 -2 10 -3 10 -4 dfn-bwin di-yuan atlanta norway zib54 ta2 topology large small DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 41

  23. Results – Runtime Ratio runtime ratio = Runtime Linear Runtime Convex 10 0 Better 10 -1 runtime ratio 10 -2 10 -3 Runtime Linear Runtime Convex 10 -4 2s 0:28h dfn-bwin di-yuan atlanta norway zib54 ta2 7s 1:03h 5s topology 1:15h large small DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 42

  24. Results – Runtime Ratio runtime ratio = Runtime Linear Runtime Convex 10 0 Better 10 -1 runtime ratio 10 -2 10 -3 Runtime Linear Runtime Convex 10 -4 2s 0:28h dfn-bwin di-yuan atlanta norway zib54 ta2 7s 1:03h 5s topology 1:15h large small DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 43

  25. Results – Optimal Solutions • More Resources: • Shorter time in queuing system • VMs at closer data centres 350 RTT 300 Response Time (ms) TIS 250 200 150 100 50 0 5 6 7 8 9 10 5 6 7 8 9 10 5 6 7 8 9 10 di-yuan norway dfn-bwin Resource Limit (#VM) DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 44

  26. Results – Optimal Solutions • More Resources: • Shorter time in queuing system • VMs at closer data centres 350 RTT 300 Response Time (ms) TIS 250 200 150 100 50 0 5 6 7 8 9 10 5 6 7 8 9 10 5 6 7 8 9 10 di-yuan norway dfn-bwin Resource Limit (#VM) DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 45

  27. In the paper… • Convex/Linear Problem Formulation • Facility Location Problem & queuing model • P-median facility location + convex cost function • P-median facility location + piecewise linear cost function • Piecewise Linear Function: Minimize maximal difference • Convexity Proof • Evaluation • Pareto optimal solutions • Compare linear/convex problem • Approx. Ratio • Runtime DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 46

  28. In conclusion… … adjust your latency-sensitive service: • Faster! • Adapt to demand fluctuations swiftly • Accurate! • With queuing delay – no surprises at runtime DCC 2014 Response Time-Optimized Distributed Cloud Resource Allocation 47

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend