handling flash deals with soft guarantee in hybrid cloud
play

Handling Flash Deals with Soft Guarantee in Hybrid Cloud Yipei Niu 1 - PowerPoint PPT Presentation

Handling Flash Deals with Soft Guarantee in Hybrid Cloud Yipei Niu 1 , Fangming Liu 1 , Xincai Fei 1 , Bo Li 2 Email: fmliu@hust.edu.cn 1 Huazhong University of Science & Technology 2 The Hong Kong University of Science & Technology 1


  1. Handling Flash Deals with Soft Guarantee in Hybrid Cloud Yipei Niu 1 , Fangming Liu 1 , Xincai Fei 1 , Bo Li 2 Email: fmliu@hust.edu.cn 1 Huazhong University of Science & Technology 2 The Hong Kong University of Science & Technology 1

  2. What are flash deals? n Amazon Prime Day q Prime Day is a one-day-only global shopping event q New deals are released as often as every five minutes n New iPhones pre-order q iPhone 6 preorders were slated to start at midnight n WeChat red envelope q WeChat has offered virtual red envelope containing virtual money that can be cashed out Flash deals offer benefits to subscribers within short time! q Only the first some persons would be able to share the envelope and hence the money 2

  3. Fast & Simple Simple Easy to get n q One click on mouse q Shake smartphone Straightforward business logic n q First some persons win Front-end Fast Limited profit Notification n q Discounted merchandise service q Newly released iPhone Worker q WeChat Red Envelope Short duration n Refresh every 5 minutes p Midnight on release day p Storage Spring Festival Gala p 3

  4. Yet crowded n Sales on Amazon’s Prime Day exceeded Black Friday in 2014 n The times of shaking phones reached a total of 11 billion and a peak of 810 million per minute n The pre-orders exceeded two million in the first 24 hours, making Apple Store unresponsive How to handle such fast, simple, and crowded flash deals? 4

  5. Is private cloud OK? n Private cloud q Dedicated datacenter or server cluster q Virtual resources provided by cloud providers Private cloud n Private cloud solution q Advantages n Requirement of security n Enhanced security q Protect confidential data n Ultimate control q Disadvantages n Limited capacity n Requirement of performance n Low scalability q Maximum uptime n Complex to operate q Fast page load time How to increase capacity and improve scalability? 5

  6. To buy or To rent? n Cost Increases linearly To buy: q Infrastructure n Unable to scale up or down based on workloads q Temporary use Low price n To rent: Auto scaling n + q Scalable capacity q Easy to operate Potentially unlimited n resources Hybrid cloud solution is a promising choice! 6

  7. Is hybrid cloud enough? n Revisit flash deals q Flash deals always bring benefits q Flash deals involve simple operations n Postpone serving requests q Incentive to wait longer to get benefits q Serve partial requests instantly q Postpone serving others n One example q Instead of waiting for the results returned from the application tier (1, 2, 3, 4, 5 in left) q Web servers send responses back to users (2 in right) q Guarantee the requests served asynchronously within deadline (3, 4 in right) 7

  8. Hybrid cloud with soft guarantee ü Obtaining the best performance ü Preventing cost from exceeding budget ü Assigning partial requests to the asynchronous process ü Distributing workloads between the private and public clouds n Problems q Without prior knowledge of requests q How to schedule requests q How to adjust the scale of public cloud 8

  9. Modeling flash deal applications n Single-tier Architecture [1][2][3][4] q Request arrival follows Poison process q Service time is generally distributed q Model the application as an M/G/1/PS queue q Response time in queue n Multi-tier Architecture [5][6][7] q Lemma 1. the arrival rate , when the queueing system is stable q Response time in queue 9

  10. Extending multi-tier with service degradation ... ... ... Interactive µ 1 µ K process ... K th tier 1 st tier Asynchronous process Message Asynchronous Web tier queue service n Service degradation q Each message binds to a series of tasks q Classify messages into different priority classes q Model the asynchronous process as a priority queue 10

  11. Evaluating response time n Private cloud – single tier is the number of requests assigned to the private cloud q during the t th time slot q Model flash deals in private cloud as single-tier architecture q Response time can be evaluated as n Public cloud – multi tier with soft guarantee is the number of requests assigned to the public cloud q during the t th time slot q Model flash deals in private cloud as multi-tier architecture q Interactive process q Asynchronous process n Hybrid cloud q Response time can be evaluated as 11

  12. Request scheduling problem ü Stage 2: service degradation ü Stage 1: workload distribution n Workload distribution n Service degradation q Distributing requests q Assigning partial requests between the private and to asynchronous process public clouds 12

  13. Capacity adjusting problem n We set a budget n The number of EC2 instances a tenant can boot is n The decision on leasing n EC2 instances is n Performance-Cost ratio of leasing n EC2 instances n Problem formulation Online NP hard problem Controlling cost PC ratio Capacity decision 13

  14. Capacity adjusting algorithm n Define a partial linear problem on n The partial linear problem n The corresponding dual problem represents the optimal solution to problem q n Decision on capacity adjustment 14

  15. Capacity adjusting algorithm n Two special cases q If q If n The algorithm becomes ineffective n Inspired by the existing literature [8][9], we make Assumption 1 n Summary of capacity adjusting algorithm Dual problem Dual problem Origin problem on [0, s] on [0, m] on [0, m] 15

  16. Optimality analysis n Q1: is the same to ? n Q2: is accurate enough as a substitute to ? n Q3: how much is the gap between and ? n Q4: how much is the gap between OPT and the algorithm? 16

  17. Evaluation n Real world trace q Online traffic in U.S. on Cyber Monday measured by Akamai n Testbed q A private cloud on two servers with OpenStack Mitaka q A public cloud 20 EC2 large type instances on AWS n Implementation q The web tier is deployed by an Apache HTTP server q Two Tomcat 9.0 servers as the application tier, q Use HttpClient 4.5.2 to generate requests q A Servlet querying records of a table from a MySQL database 17

  18. Evaluation n Obviously, scheduling more requests to the asynchronous process can reduce response time remarkably n The response time of the asynchronous process can be controlled within predefined deadline 18

  19. Evaluation n Compared with CEOA, CAA reduces response time by 15% and improves the PC ratio by 19% on average, respectively 19

  20. Conclusion n We proposed a solution for flash deal applications to withstand flash crowds in a hybrid cloud n Concerning scheduling requests, we achieved fast response time of the interactive process as well as guaranteed requests served in the asynchronous process within a predefined deadline n In terms of adjusting capacity, we tuned scale of the public cloud with the objectives of performance-cost ratio maximization as well as outsourcing cost minimization. n Compared with previous work, our solution reduced response time by 15% on average and effectively maintained cost within the budget.

  21. Reference No. Paper Source [1] Modeling differentiated services of multi-tier web applications MASCOTS06 EC ’01 [2] Preserving qos of e- commerce sites through self-tuning: A performance model approach [3] Autonomous resource provisioning for multi-service web applications WWW’10 [4] Provisioning Servers in the Application Tier for E-commerce Systems IWQoS’04 [5] Agile dynamic provisioning of multi-tier internet applications TAAS [6] Con- trolling quality of service in multi-tier web applications ICDCS’06 [7] An analytical model for multi-tier internet services and its applications SIGMETRICS’05 [8] The adwords problem: Online keyword matching with budgeted bidders under random EC’05 permutations [9] A dynamic near-optimal algorithm for online linear programming Operations research 21

  22. Q&A Thank You! “Cloud Datacenter & Green Computing” Research Group Huazhong University of Science & Technology http://grid.hust.edu.cn/fmliu/ fmliu@hust.edu.cn 22

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend