H YBRID C LOUD R ESOURCE P ROVISIONING P OLICY IN THE P RESENCE OF R - PowerPoint PPT Presentation

H YBRID C LOUD R ESOURCE P ROVISIONING P OLICY IN THE P RESENCE OF R ESOURCE F AILURES Bahman Javadi University of Western Sydney, Australia Jemal Abawajy Deakin University, Australia 1 Richard O. Sinnott The University of Melbourne, Australia The 4 th IEEE International Conference on Cloud Computing Technology and Science Taiwan, December 2012

IEEE CloudCom 2012 A GENDA ¢ Introduction ¢ System Context ¢ Hybrid Cloud Architecture ¢ Proposed Provisioning Policies ¢ Performance Evaluation ¢ Simulation Results ¢ Conclusions 2

IEEE CloudCom 2012 I NTRODUCTION ¢ Hybrid Cloud Systems  Public Clouds  Private Clouds ¢ Resource Provisioning in Hybrid Cloud  Users’ QoS (i.e., deadline)  Resource failures ¢ Taking into account  Workload model à workflows in a scientific project  Failure correlations à real failure traces ¢ Knowledge-free approach : not any information about the failure model 3

IEEE CloudCom 2012 S YSTEM C ONTEXT ¢ Our policies are proposed in the context of the Australian Urban Research Infrastructure Network (AURIN) project  An e-Infrastructure supporting research in urban and built environment research disciplines  Web Portal Application (portlet-based) ¢ A lab in a browser (http://portal.aurin.org.au) ¢ Access to the federated data source ¢ Web Feature Service (WFS) ¢ Workflow environment based on Object Modeling System (OMS) ¢ NeCTAR NSP and Research Cloud 4

IEEE CloudCom 2012 T HE AURIN ARCHITECTURE 5

IEEE CloudCom 2012 H YBRID C LOUD A RCHITECTURE ¢ Based on InterGrid components ¢ Using a Gateway (IGG) as the broker InterGrid Gateway Management & Monitoring JMX Communication Module Message-Passing Scheduler Persistence DB Java Derby (Provisioning Policies & Peering) Virtual Machine Manager Local Grid IaaS Emulator Resources Middleware Provider IGG 6

IEEE CloudCom 2012 W ORKLOAD M ODEL ¢ Workflows in the AURIN project  Potentially large number of resources over a short period of time.  Several tasks that are sensitive to communication networks and resource failures ( tightly coupled ) ¢ User Requests  Type of virtual machine;  Number of virtual machines;  Estimated duration of the request;  Deadline for the request. 7

IEEE CloudCom 2012 F AILURES IN U SER R EQUESTS ¢ Resource failure is inevitable  Redundant components in public Clouds ¢ highly reliable service  Leads to service failure in private Clouds ¢ Correlation in Failures à overlapped failures  Spatial  Temporal 8

IEEE CloudCom 2012 F AILURES IN U SER R EQUESTS ( CONT .) ¢ The sequence of overlapped failures H = { F i | F i = ( E 1 , ..., E n ) , T s ( E i +1 ) ≤ T e ( E i ) } ¢ Downtime of the service X D = ( max { T e ( F i ) } − min { T s ( F i ) } ) 8 F i 2 H 9

IEEE CloudCom 2012 P ROPOSED P OLICIES ¢ Size-based Strategy  Spatial correlation : multiple failures occur on different nodes within a short time interval  Strategy: sends wider requests to more reliable public Cloud systems  Mean number of VMs per request ¢ P 1 : probability of one VM ¢ P 2 : probability of power of two VMs requests is given as follows: S = P 1 + 2 d k e ( P 2 ) + 2 k (1 − ( P 1 + P 2 )) ¢ Request size: two-stage uniform distribution ( l,m,h,q ) k = ql + m + (1 − q ) h 2 10

IEEE CloudCom 2012 P ROPOSED P OLICIES ( CONT .) ¢ Time-based strategy  Temporal correlation: the failure rate is time- dependent and some periodic failure patterns can be observed in different time-scales  Request duration: are long tailed . • The mean request duration Lognormal distribution in a • parallel production system T = e µ + σ 2 2 11

IEEE CloudCom 2012 P ROPOSED P OLICIES ( CONT .) ¢ Area-based strategy  Making a compromise between the size-based and time-based strategy  The mean area of the requests A = T · S  This strategy sends long and wide requests to the public Cloud,  It would be more conservative than a size-based strategy and less conservative than a time-based strategy. 12

IEEE CloudCom 2012 S CHEDULING A LGORITHMS ¢ Scheduling the request across private and public Cloud resources ¢ Two well-know algorithms where requests are allowed to leap forward in the queue  Conservative backfilling  Selective backfilling XFactor = W i + T i T i ¢ VM Checkpointing  VM stops working for the unavailability period  The request is started from where it left off when the node becomes available again 13

IEEE CloudCom 2012 P ERFORMANCE E VALUATION ¢ CloudSim Simulator ¢ Performance Metrics  Deadline violation rate  Slowdown M Slowdown = 1 W i + max ( T i , bound ) X max ( T i , bound ) M i =1  Cloud Cost on EC2 Cost pl = ( H pl + M pl · H u ) C n + ( M pl · B in ) C x  Workload Model ¢ Parallel jobs model of a multi-cluster system (i.e., DAS-2) Input Parameters Distribution/Value Inter-arrival time Weibull ( α = 23 . 375 , 0 . 2 ≤ β ≤ 0 . 3 ) No. of VMs Loguniform ( l = 0 . 8 , m, h = log 2 N s , q = 0 . 9 ) Request duration Lognormal ( 2 . 5 ≤ µ ≤ 3 . 5 , σ = 1 . 7 ) P 1 0.02 P 2 0.78 14

IEEE CloudCom 2012 P ERFORMANCE E VALUATION ( CONT .) ¢ Failures from Failure Trace Archive (FTA)  Grid’5000 traces ¢ 18-month ¢ 800 events/node ¢ Average availability: 22.26 hours ¢ Average unavailability: 10.22 hours ¢ Synthetic Deadline ( st i + ( f · ta i ) , if [ st i + ( f · ta i )] < ct i d i = ct i , otherwise  f : stringency factor  f >1 is normal deadline (e.g., f =1.3) ¢ N s = N c = 64 15

IEEE CloudCom 2012 S IMULATION R ESULTS ¢ Violation rate Request arrival rate Request size 16 Request duration

IEEE CloudCom 2012 S IMULATION R ESULTS ( CONT .) ¢ Slowdown Request size Request arrival rate 17 Request duration

IEEE CloudCom 2012 S IMULATION R ESULTS ( CONT .) ¢ Cloud Cost on EC2 Request arrival rate Request size 18 Request duration

IEEE CloudCom 2012 C ONCLUSIONS ¢ QoS-based resource provisioning in a failure- prone hybrid Cloud system ¢ Three different flexible brokering strategies based on failure correlation and workload model ¢ Knowledge free approach ¢ Using time-based strategy (high load),  20% violation rate  ~1200 USD per month on EC2 ¢ Future Work  Use a set of real workflow applications from the AURIN project and run real experiments. 19

IEEE CloudCom 2012 20

H YBRID C LOUD R ESOURCE P ROVISIONING P OLICY IN THE P RESENCE OF R - PowerPoint PPT Presentation

H YBRID C LOUD R ESOURCE P ROVISIONING P OLICY IN THE P RESENCE OF R ESOURCE F AILURES Bahman Javadi University of Western Sydney, Australia Jemal Abawajy Deakin University, Australia 1 Richard O. Sinnott The University of Melbourne,

Hello! My name is... Buffy Automatic TV series Naming of Characters in TV Video , by M.

Im Improving g Res esource av availa labi bili lity ty i in CER ERN C N Clo loud ud

O PEN L EARNING S EMINAR P ROVISIONING @ U NISA P RINCIPLES OF O PEN L EARNING O PEN L EARNING P

CIENCE P P OLICY OLICY AND AND I I NTERNA TIONAL NTERNATIONAL S CIENTIFIC CIENTIFIC C C OLLABORA

UNIVERSAL ACCESS TO CARE WORK GROUP 2018 1 LPRO : L EGISLATIVE P OLICY AND R ESEARCH O FFICE

Loud Voices in the China Field Loud Voices in the China Field A recent debate in Eurasian

C loud A pp P rofiler: T elco C loud A pplications T racing and M onitoring CTPD Project By:

T YPE -G UIDED W ORST -C ASE I NPUT G ENERATION Di Wang , Jan Hoffmann Carnegie Mellon

Resource Efficiency in the Cloud I N C LOUD C OMPUTING Neeraj Kulkarni, Feng Qi, Glyfina Fernando

I N C LOUD C OMPUTING Christina Delimitrou Stanford University Defense May 26 th

T-90 Tank Column near Luhansk T-90 SighPngs in Ukraine

A A novel hyb ybrid distributed-ro routing and SDN N solution for traffic engineering ANR

7.-9. September 2011 Hamburg Ernst M. Mutke Technical Director HMK Supercomputing GmbH Agenda

A N EW A PPROACH OF Z ONAL H YBRID RANS-LES B ASED ON A T WO - EQUATION k M ODEL [2] L ARS D

F ROM S TO HA P HASE 1 Writing the stochastic -calculus code. G =? b k b . G b k p p .

UNIVERSAL ACCESS TO CARE WORK GROUP 2018 1 LPRO : L EGISLATIVE P OLICY AND R ESEARCH O FFICE

Agenda Identifying students for additional support Making decisions based on data

Formulations Workshop Outcomes Pediatric Formulation Development: Challenges of Today and

Training, Education, & Staffing: Focus on low & m iddle-incom e countries ( I ndia as a

Statistical Data Mining Definitions Population, Sample, Statistic Simple Statistics

P4P : Provider Portal for P2P Applications Richard Alimi, Doug Pasko, Laird Popkin, Ye Wang,

Supervised Practice Program F LO R I D A B O A R D O F B A R E X A M I N E R S Supervised

Looking through the Higgs portal with exotic Higgs decays Jessie Shelton University of Illinois,

PCORI Engagement Webinar: Emerging Approaches for Structuring Multi-Stakeholder Engagement in

Sambuz

Useful Links

Newsletter

Mail Us

H YBRID C LOUD R ESOURCE P ROVISIONING P OLICY IN THE P RESENCE OF R - PowerPoint PPT Presentation

H YBRID C LOUD R ESOURCE P ROVISIONING P OLICY IN THE P RESENCE OF R ESOURCE F AILURES Bahman Javadi University of Western Sydney, Australia Jemal Abawajy Deakin University, Australia 1 Richard O. Sinnott The University of Melbourne,

Hello! My name is... Buffy Automatic TV series Naming of Characters in TV Video , by M.

Im Improving g Res esource av availa labi bili lity ty i in CER ERN C N Clo loud ud

O PEN L EARNING S EMINAR P ROVISIONING @ U NISA P RINCIPLES OF O PEN L EARNING O PEN L EARNING P

CIENCE P P OLICY OLICY AND AND I I NTERNA TIONAL NTERNATIONAL S CIENTIFIC CIENTIFIC C C OLLABORA

UNIVERSAL ACCESS TO CARE WORK GROUP 2018 1 LPRO : L EGISLATIVE P OLICY AND R ESEARCH O FFICE

Loud Voices in the China Field Loud Voices in the China Field A recent debate in Eurasian

C loud A pp P rofiler: T elco C loud A pplications T racing and M onitoring CTPD Project By:

T YPE -G UIDED W ORST -C ASE I NPUT G ENERATION Di Wang , Jan Hoffmann Carnegie Mellon

Resource Efficiency in the Cloud I N C LOUD C OMPUTING Neeraj Kulkarni, Feng Qi, Glyfina Fernando

I N C LOUD C OMPUTING Christina Delimitrou Stanford University Defense May 26 th

T-90 Tank Column near Luhansk T-90 SighPngs in Ukraine

A A novel hyb ybrid distributed-ro routing and SDN N solution for traffic engineering ANR

7.-9. September 2011 Hamburg Ernst M. Mutke Technical Director HMK Supercomputing GmbH Agenda

A N EW A PPROACH OF Z ONAL H YBRID RANS-LES B ASED ON A T WO - EQUATION k M ODEL [2] L ARS D

F ROM S TO HA P HASE 1 Writing the stochastic -calculus code. G =? b k b . G b k p p .

UNIVERSAL ACCESS TO CARE WORK GROUP 2018 1 LPRO : L EGISLATIVE P OLICY AND R ESEARCH O FFICE

Agenda Identifying students for additional support Making decisions based on data

Formulations Workshop Outcomes Pediatric Formulation Development: Challenges of Today and

Training, Education, &amp; Staffing: Focus on low &amp; m iddle-incom e countries ( I ndia as a

Statistical Data Mining Definitions Population, Sample, Statistic Simple Statistics

P4P : Provider Portal for P2P Applications Richard Alimi, Doug Pasko, Laird Popkin, Ye Wang,

Supervised Practice Program F LO R I D A B O A R D O F B A R E X A M I N E R S Supervised

Looking through the Higgs portal with exotic Higgs decays Jessie Shelton University of Illinois,

PCORI Engagement Webinar: Emerging Approaches for Structuring Multi-Stakeholder Engagement in

Sambuz

Useful Links

Newsletter

Mail Us

Training, Education, & Staffing: Focus on low & m iddle-incom e countries ( I ndia as a