An Adaptive Technique to Model Virtual Machine Behavior for - - PowerPoint PPT Presentation

an adaptive technique to model virtual machine behavior
SMART_READER_LITE
LIVE PREVIEW

An Adaptive Technique to Model Virtual Machine Behavior for - - PowerPoint PPT Presentation

An Adaptive Technique to Model Virtual Machine Behavior for Scalable Cloud Monitoring C. Canali R. Lancellotti University of Modena and Reggio Emilia Department of Engineering Enzo Ferrari ISCC'14, 24-26 Jun. 2014, Madeira 1


slide-1
SLIDE 1

ISCC'14, 24-26 Jun. 2014, Madeira 1

An Adaptive Technique to Model Virtual Machine Behavior for Scalable Cloud Monitoring

  • C. Canali
  • R. Lancellotti

University of Modena and Reggio Emilia Department of Engineering “Enzo Ferrari”

slide-2
SLIDE 2

ISCC'14, 24-26 Jun. 2014, Madeira 2

Challenge: monitoring

  • Large data centers (> 105 VMs)

huge amount of data →

  • Point of view: IaaS provider

monitoring supporting → infrastructure management

  • VM can be anything

treat VM as black boxes →

  • → Scalability issues

VM VM VM VM VM VM VM VM VM VM VM VM

slide-3
SLIDE 3

ISCC'14, 24-26 Jun. 2014, Madeira 3

Challenge: monitoring

  • Current approach: reduce amount of

data in a uniform way

– Reduce sampling frequency – Reduce number of metrics considered

  • → Reduced monitoring effectiveness

– Less information available for management

  • Solution: Exploit VM similarity

VM VM VM VM VM VM VM VM VM VM VM VM VM VM VM VM

CL1 CL2

slide-4
SLIDE 4

ISCC'14, 24-26 Jun. 2014, Madeira 4

Improving monitoring scalability

  • Group similar VMs together
  • Detailed monitoring of cluster

representatives

  • Reduced monitoring of other VMs
  • → Data collection reduced by one order of

magnitude

VM VM VM VM VM VM VM VM

CL1 CL2

slide-5
SLIDE 5

ISCC'14, 24-26 Jun. 2014, Madeira 5

Challenge: fast identification

  • VM behavior model built starting with

time series of resource usage on VMs

  • Long time series to characterize

VM behavior

→ Highly accurate clustering

  • Clustering accuracy decreased by shorter

time series

→ problems coping with Cloud dynamic behavior

  • Need to combine fast and accurate

identification of VM behavior

slide-6
SLIDE 6

ISCC'14, 24-26 Jun. 2014, Madeira 6

Reference scenario

  • IaaS, medium-long term commitment

– Amazon Reserved instances, private cloud

  • Reactive VM relocation

– Local manager

  • Periodic global consolidation

– Global optimization

slide-7
SLIDE 7

ISCC'14, 24-26 Jun. 2014, Madeira 7

Our proposal: adaptive approach

  • Observation:

– Some VMs are easily identified as belonging

to a cluster even after short observation

– Other VMs require more detail to build a

reliable behavior model

  • Proposal:

– Cluster as fast as possible VMs clearly

belonging to a cluster

– Postpone clustering of VMs when not sure

  • Adoption of fuzzy logic perspective

– Introduce decree of belonging of VM to

clusters to rete reliability of clustering result

– Gray area of uncertain clustering

slide-8
SLIDE 8

ISCC'14, 24-26 Jun. 2014, Madeira 8

Adaptive algorithm

  • Adaptive identification
  • f time series length
  • When clustering is not

ambiguous (white area)

– VM behavior

model is OK

– No update

required

  • When clustering is

ambiguous (gray area)

– Need to improve VM

behavior model

– Further observation

required

slide-9
SLIDE 9

ISCC'14, 24-26 Jun. 2014, Madeira 9

Definition of Gray Area

  • Feature space: k-dimensional space

– Each VM described by a feature vector (point

in feature space)

– Each cluster has a centroid described as a

point in the feature space

  • For each VM n:

– Vector of distances from the cluster centroids

  • Definition of gray area

– A VM is in gray area iif exists a couple of

clusters i, j such that 1−ε< di

n

d j

n< 1

1−ε ,0<ε<1

Dn={d1,

n d2, n …,dC n }

slide-10
SLIDE 10

ISCC'14, 24-26 Jun. 2014, Madeira 10

Definition of Gray Area

  • Higher epsilon

wider gray area →

  • Problem: definition of right value of epsilon

– Open problem, still

working on that...

– Experimental results

suggest ε=0.33 as a rule of thumb

slide-11
SLIDE 11

December 6th, 2013 - DEIB - PoliMi 11

Case study

  • Datacenter supporting a e-health Web

application

– Web server and DBMS – 110 VMs – 11 metrics for each VM, – Sampling frequency: 5 min

  • Goal: separate Web servers and DBMS

– Clustering accuracy – % of VM in gray area

  • 2 VM behavior model approaches

– PCA-based – Bhattacharyya distance-based

slide-12
SLIDE 12

ISCC'14, 24-26 Jun. 2014, Madeira 12

Experimental results

Time series length: 1 day, PCA-based clustering

slide-13
SLIDE 13

ISCC'14, 24-26 Jun. 2014, Madeira 13

Experimental results

  • Validating the choice of epsilon

– For

ε ≥ 0.33 the accuracy is 100% (absence of mis-classified VMs)

– The size of the gray area depends on the

clustering algorithms

PCA-based Bhattacharyya distance-based

slide-14
SLIDE 14

ISCC'14, 24-26 Jun. 2014, Madeira 14

Experimental results

slide-15
SLIDE 15

ISCC'14, 24-26 Jun. 2014, Madeira 15

Conclusion

  • Experimental results are encouraging

– Can achieve high clustering purity – Can provide accurate clustering even with

very short time series

– Works with different clustering algorithms – Adaptive approach to select the time series

length

  • This is not a crystal ball

– But may be a useful tool

to improve monitoring and management of cloud data centers

slide-16
SLIDE 16

ISCC'14, 24-26 Jun. 2014, Madeira 16

On-going works

  • Adaptive selection of the

ε parameter

  • Evaluation with time-series < 24 h
  • Comparison with other fuzzy clustering

algorithms

  • Additional experiments with different

workloads (help appreciated)

slide-17
SLIDE 17

ISCC'14, 24-26 Jun. 2014, Madeira 17

An Adaptive Technique to Model Virtual Machine Behavior for Scalable Cloud Monitoring

  • C. Canali
  • R. Lancellotti

University of Modena and Reggio Emilia Department of Engineering “Enzo Ferrari”