Handling Flash Deals with Soft Guarantee in Hybrid Cloud Yipei Niu 1 - - PowerPoint PPT Presentation

handling flash deals with soft guarantee in hybrid cloud
SMART_READER_LITE
LIVE PREVIEW

Handling Flash Deals with Soft Guarantee in Hybrid Cloud Yipei Niu 1 - - PowerPoint PPT Presentation

Handling Flash Deals with Soft Guarantee in Hybrid Cloud Yipei Niu 1 , Fangming Liu 1 , Xincai Fei 1 , Bo Li 2 Email: fmliu@hust.edu.cn 1 Huazhong University of Science & Technology 2 The Hong Kong University of Science & Technology 1


slide-1
SLIDE 1

Handling Flash Deals with Soft Guarantee in Hybrid Cloud

Yipei Niu1, Fangming Liu1, Xincai Fei1, Bo Li2 Email: fmliu@hust.edu.cn

1Huazhong University of Science & Technology 2The Hong Kong University of Science & Technology 1

slide-2
SLIDE 2

What are flash deals?

2

n Amazon Prime Day

q Prime Day is a one-day-only global shopping event q New deals are released as often as every five minutes

n New iPhones pre-order

q iPhone 6 preorders were slated to start at midnight

n WeChat red envelope

q WeChat has offered virtual red envelope containing

virtual money that can be cashed out

q Only the first some persons would be able to share the

envelope and hence the money

Flash deals offer benefits to subscribers within short time!

slide-3
SLIDE 3

Fast & Simple

3

Simple

n

Easy to get

q One click on mouse q Shake smartphone

n

Straightforward business logic

q First some persons win

Fast

n

Limited profit

q Discounted merchandise q Newly released iPhone q WeChat Red Envelope

n

Short duration

p

Refresh every 5 minutes

p

Midnight on release day

p

Spring Festival Gala

Front-end Storage Worker Notification service

slide-4
SLIDE 4

Yet crowded

4

How to handle such fast, simple, and crowded flash deals?

n Sales on Amazon’s Prime Day exceeded Black Friday

in 2014

n The times of shaking phones reached a total of 11

billion and a peak of 810 million per minute

n The pre-orders exceeded two million in the first 24

hours, making Apple Store unresponsive

slide-5
SLIDE 5

Is private cloud OK?

5

Private cloud

n Requirement of security

q Protect confidential data

n Private cloud

q Dedicated datacenter

  • r server cluster

q Virtual resources provided

by cloud providers

n Private cloud solution

q Advantages

n Enhanced security n Ultimate control

q Disadvantages

n Limited capacity n Low scalability n Complex to operate

n Requirement of performance

q Maximum uptime q Fast page load time

How to increase capacity and improve scalability?

slide-6
SLIDE 6

To buy or To rent?

+

n

Low price

n

Auto scaling

q Scalable capacity q Easy to operate n

Potentially unlimited resources

To rent: To buy:

n Cost Increases linearly q Infrastructure n Unable to scale up or

down based on workloads

q Temporary use

Hybrid cloud solution is a promising choice!

6

slide-7
SLIDE 7

Is hybrid cloud enough?

7

n Revisit flash deals

q Flash deals always bring benefits q Flash deals involve simple operations

n Postpone serving requests

q Incentive to wait longer to get benefits q Serve partial requests instantly q Postpone serving others

n One example

q Instead of waiting for the results returned from the

application tier (1, 2, 3, 4, 5 in left)

q Web servers send responses back to users (2 in right) q Guarantee the requests served asynchronously within

deadline (3, 4 in right)

slide-8
SLIDE 8

Hybrid cloud with soft guarantee

8

ü Assigning partial requests to the asynchronous process ü Distributing workloads between the private and public clouds ü Obtaining the best performance ü Preventing cost from exceeding budget

n Problems

q Without prior knowledge of requests q How to schedule requests q How to adjust the scale of public cloud

slide-9
SLIDE 9

Modeling flash deal applications

9

n Single-tier Architecture [1][2][3][4]

q Request arrival follows Poison process q Service time is generally distributed q Model the application as an M/G/1/PS queue q Response time in queue

n Multi-tier Architecture [5][6][7]

q Lemma 1. the arrival rate

, when the queueing system is stable

q Response time in queue

slide-10
SLIDE 10

Extending multi-tier with service degradation

10

Asynchronous service Web tier Message queue

... ... µ1 1st tier ... µK Kth tier ...

n Service degradation

q Each message binds to a series of tasks q Classify messages into different priority classes q Model the asynchronous process as a priority queue

Interactive process Asynchronous process

slide-11
SLIDE 11

Evaluating response time

11

n Private cloud – single tier

q

is the number of requests assigned to the private cloud during the tth time slot

q Model flash deals in private cloud as single-tier architecture q Response time can be evaluated as

n Public cloud – multi tier with soft guarantee

q

is the number of requests assigned to the public cloud during the tth time slot

q Model flash deals in private cloud as multi-tier architecture q Interactive process q Asynchronous process

n Hybrid cloud

q Response time can be evaluated as

slide-12
SLIDE 12

Request scheduling problem

12

n Workload distribution

q Distributing requests

between the private and public clouds

ü Stage 2: service degradation ü Stage 1: workload distribution

n Service degradation

q Assigning partial requests

to asynchronous process

slide-13
SLIDE 13

Capacity adjusting problem

13

n We set a budget n The number of EC2 instances a tenant can boot is n The decision on leasing n EC2 instances is n Performance-Cost ratio of leasing n EC2 instances n Problem formulation

PC ratio Capacity decision Controlling cost

Online problem NP hard

slide-14
SLIDE 14

Capacity adjusting algorithm

14

n Define a partial linear problem on n The partial linear problem n The corresponding dual problem

q

represents the optimal solution to problem

n Decision on capacity adjustment

slide-15
SLIDE 15

Capacity adjusting algorithm

15

n Two special cases

q If q If

n The algorithm becomes ineffective n Inspired by the existing literature [8][9], we make

Assumption 1

n Summary of capacity adjusting algorithm

Dual problem

  • n [0, s]

Dual problem

  • n [0, m]

Origin problem

  • n [0, m]
slide-16
SLIDE 16

Optimality analysis

16

n Q1: is the same to ? n Q2: is accurate enough as a substitute to ? n Q4: how much is the gap between OPT and the algorithm? n Q3: how much is the gap between and ?

slide-17
SLIDE 17

Evaluation

17

n Real world trace

q Online traffic in U.S. on

Cyber Monday measured by Akamai

n Testbed

q A private cloud on two servers with OpenStack Mitaka q A public cloud 20 EC2 large type instances on AWS

n Implementation

q The web tier is deployed by an Apache HTTP server q Two Tomcat 9.0 servers as the application tier, q Use HttpClient 4.5.2 to generate requests q A Servlet querying records of a table from a MySQL

database

slide-18
SLIDE 18

Evaluation

18

n Obviously, scheduling more requests to the asynchronous

process can reduce response time remarkably

n The response time of the asynchronous process can be

controlled within predefined deadline

slide-19
SLIDE 19

Evaluation

19

n Compared with CEOA, CAA reduces response time by 15%

and improves the PC ratio by 19% on average, respectively

slide-20
SLIDE 20

Conclusion

n We proposed a solution for flash deal applications to

withstand flash crowds in a hybrid cloud

n Concerning scheduling requests, we achieved fast

response time of the interactive process as well as guaranteed requests served in the asynchronous process within a predefined deadline

n In terms of adjusting capacity, we tuned scale of the

public cloud with the objectives of performance-cost ratio maximization as well as outsourcing cost minimization.

n Compared with previous work, our solution reduced

response time by 15% on average and effectively maintained cost within the budget.

slide-21
SLIDE 21

Reference

21

No. Paper Source [1]

Modeling differentiated services of multi-tier web applications MASCOTS06

[2]

Preserving qos of e- commerce sites through self-tuning: A performance model approach EC ’01

[3]

Autonomous resource provisioning for multi-service web applications WWW’10

[4]

Provisioning Servers in the Application Tier for E-commerce Systems IWQoS’04

[5]

Agile dynamic provisioning of multi-tier internet applications TAAS

[6]

Con- trolling quality of service in multi-tier web applications ICDCS’06

[7]

An analytical model for multi-tier internet services and its applications SIGMETRICS’05

[8]

The adwords problem: Online keyword matching with budgeted bidders under random permutations EC’05

[9]

A dynamic near-optimal algorithm for online linear programming Operations research

slide-22
SLIDE 22

Q&A Thank You!

“Cloud Datacenter & Green Computing” Research Group Huazhong University of Science & Technology

http://grid.hust.edu.cn/fmliu/ fmliu@hust.edu.cn

22