Automatic clustering of similar VM to improve the scalability of - PowerPoint PPT Presentation

Automatic clustering of similar VM to improve the scalability of monitoring and management in IaaS cloud infrastructures C. Canali R. Lancellotti University of Modena and Reggio Emilia Department of Engineering “Enzo Ferrari” December 6th, 2013 - DEIB - PoliMi 1

WEBLab ● WEBLab: Web Engineering and Benchmarking Lab ● Contributing to – DIEF - Department of Engineering “Enzo Ferrari” (not only automotive) – CRIS - Research center of Security ● Research interests – Distributed systems – Cloud computing – Performance / scalability issues – Monitoring in distributed systems – Security in networked / cloud systems – ... December 6th, 2013 - DEIB - PoliMi 2

Agenda ● Background and motivation – IaaS Cloud – Reference scenario – Traditional approach vs. clustering – Impact on monitoring and management ● Clustering based on metric correlation – Theoretical model(s) – Experimental evaluation ● Clustering based on Bhattacharyya distance – Theoretical model(s) – Experimental evaluation ● Conclusion and future work December 6th, 2013 - DEIB - PoliMi 3

Cloud computing December 6th, 2013 - DEIB - PoliMi 4

Cloud computing ● Cloud computing AKA Utility computing ● Access to resources and services: – Multiple customers → same provider – Leveraging economies of scale NOTE: We may – No initial cost (pay per use) still have long-time – Exploit virtualization technologies commitments (e.g. reserved instances) ● Multiple cloud paradigms: – SaaS – PaaS – IaaS December 6th, 2013 - DEIB - PoliMi 5

Challenges: monitoring ● Large data centers (> 10 5 VMs) → huge amount of data ● Multiple data centers → geographic data exchange ● VM can be anything → treat VM as black boxes → Scalability issues ● VM VM VM VM VM VM VM VM VM VM VM VM December 6th, 2013 - DEIB - PoliMi 6

Challenges: monitoring ● Current approach → reduce amount of data in a uniform way: – Reduce sampling frequency – Reduce number of metrics considered → Reduced monitoring effectiveness ● – Less information available to take management decision December 6th, 2013 - DEIB - PoliMi 7

Challenges: management ● Large data centers → large opt. problems – Too many variables – Too many bounds – Like a huge multi-dimensional tetris ● VM can be anything → treat VM as black boxes → difficult search for complementary workloads → Scalability issues ● VM VM VM VM VM VM VM VM VM VM VM VM December 6th, 2013 - DEIB - PoliMi 8

Challenges: management ● Current approach → reduce amount of bounds: – Assume VM resource utilization constant over long periods (e.g. day/night) – Reduce number of metrics considered – Consider only nominal resource utilization → rely on hierarchical management → Reduced management effectiveness ● – No support for fine grained management – Sub-optimal management decisions December 6th, 2013 - DEIB - PoliMi 9

Exploiting VM similarity ● No information on VM behavior is used to improve scalability ● Proposal: automatically cluster VMs with similar behavior ● Requirements: – No human intervention – No models for VM classes – No crystal ball CL1 CL2 VM VM VM VM VM VM VM VM VM VM VM VM VM VM VM VM December 6th, 2013 - DEIB - PoliMi 10

Improving monitoring scalability ● Group similar VMs together ● Elect a few (e.g., 3) cluster representatives – Support for byzantine failures in representatives ● Detailed monitoring of cluster representatives ● Reduced monitoring of other VMs CL1 CL2 VM VM VM VM VM VM VM VM December 6th, 2013 - DEIB - PoliMi 11

Improving monitoring scalability ● Numeric example ● Every VM as a black box: – 1000 VMs, K metrics, 1 sample/5 min → 288 10 3 K sample/day – ● With clustering: – 15 clusters, 67 VMs per cluster – 3 representative per cluster → 45 VMs, K metrics, 1 sample/5 min – Non representatives → 955 VM, K metrics 1 sample/6 hour → 16,8 10 3 K sample/day – ● Data collected reduced by 17:1 December 6th, 2013 - DEIB - PoliMi 12

Improving management scalability ● Server placement and consolidation ● Build a small consolidation solution ● Replicate solution as a building block Building block solution Global problem Residual problem solution December 6th, 2013 - DEIB - PoliMi 13

Reference scenario ● IaaS with long term commitment – Amazon Reserved instances, private cloud ● Reactive VM relocation – Local manager ● Periodic global consolidation – Global optimization December 6th, 2013 - DEIB - PoliMi 14

Proposed methodology ● Methodology: – Define quantitative model for VM behavior Data samples – Cluster similar VM together (time series) ● Elect a few (e.g., 3) cluster Extract quantitiative Model of VM behavior representatives ● Fine-grained monitoring of Quantitative model cluster representatives ● Reduced monitoring applied to Clustering other VMs Clustering – Reduced number of metrics solution – Lower sampling frequency December 6th, 2013 - DEIB - PoliMi 15

Design choices ● How to represent VM behavior? ● Use correlation between metrics – Possible enhancement: use PCA ● Use probability distribution of metrics – Use histograms & Bhattacharyya distance – May need to select which information are “useful” – Must merge heterogeneous information from multiple metrics – May exploit ensemble techniques to provide robust performance – Possible enhancement: use histogram smoothing December 6th, 2013 - DEIB - PoliMi 16

Design choices ● How to perform clustering? ● Use K-Means – When VM behavior is represented as a feature vector ● Use spectral clustering – When VM behavior can be used to compute distance/ similarity between VMs December 6th, 2013 - DEIB - PoliMi 17

Agenda ● Background and motivation – IaaS Cloud – Reference scenario – Traditional approach vs. clustering – Impact on monitoring and management ● Clustering based on metric correlation – Theoretical model(s) – Experimental evaluation ● Clustering based on Bhattacharyya distance – Theoretical model(s) – Experimental evaluation ● Conclusion and future work December 6th, 2013 - DEIB - PoliMi 18

Theoretical model ● Extraction of a quantitative model of VM behavior – Input: time series of metrics Data samples describing VM n behavior (time series) (X1, ... ,Xm) Extract quantitiative – Compute correlation matrix Sn for Model of VM behavior each VM n Quantitative – Output: feature vectors Vn model Clustering NOTE: We exploit simmetry in Clustering matrix Sn solution to remove redundant information December 6th, 2013 - DEIB - PoliMi 19

Theoretical model ● Clustering of VMs – Input: feature vector Vi – Clustering based on k-means Data samples algorithm (time series) – Output: clustering solution Extract quantitiative Model of VM behavior Quantitative model Clustering Clustering solution December 6th, 2013 - DEIB - PoliMi 20

Case study ● Datacenter supporting a e-health Web application – Web server and DBMS – 110 VMs – 11 metrics for each VM, – Sampling frequency: 5 min ● Goal: separate Web servers and DBMS – Main metric: Purity of clustering ● Three types of analyses – Impact of time series length – Impact of filtering techniques – Impact of number of nodes December 6th, 2013 - DEIB - PoliMi 21

Impact of time series length ● Reduction of available data → reduction in the purity of clustering ● Purity > 0.7 for time series > 20 dd December 6th, 2013 - DEIB - PoliMi 22

Impact of filtering techniques ● Application of data filtering: – Remove idle periods in time series ● Data filtering improves performance – Removal of periods providing limited information ● Purity >0.8 even for 5 days time series December 6th, 2013 - DEIB - PoliMi 23

Impact of number of nodes Number of VMs Purity Clustering time [s] 10 1 49.7 30 0.86 59.5 50 0.84 68.6 70 0.84 78.0 90 0.83 88.3 110 0.84 95.3 ● Purity is not adversely affected by # of VM – Purity ~ 0.85 for [30-110] VMs December 6th, 2013 - DEIB - PoliMi 24

Proposed enhancement ● The clustering time grows – Linearly with # of VM – Quadratically with # of metrics → Potential scalability issue ● ● Can we reduce the number of metrics? ● Can we reduce the quadratic relationship? December 6th, 2013 - DEIB - PoliMi 25

Proposed enhancement ● The clustering time grows – Linearly with # of VM – Quadratically with # of metrics → Potential scalability issue ● ● Can we reduce the number of metrics? → NO: clustering purity is heavily affected ● Can we reduce the quadratic relationship? → YES: can exploit PCA techniques December 6th, 2013 - DEIB - PoliMi 26

Reducing number of metrics December 6th, 2013 - DEIB - PoliMi 27

PCA-based technique December 6th, 2013 - DEIB - PoliMi 28

PCA-based technique ● Building the feature vector: December 6th, 2013 - DEIB - PoliMi 29

How many principal components? ● Use of Skree plot ● 1 component captures ~60% of variance → good enough for us ● December 6th, 2013 - DEIB - PoliMi 30

Performance evaluation December 6th, 2013 - DEIB - PoliMi 31

Performance evaluation December 6th, 2013 - DEIB - PoliMi 32

Automatic clustering of similar VM to improve the scalability of - PowerPoint PPT Presentation

Automatic clustering of similar VM to improve the scalability of monitoring and management in IaaS cloud infrastructures C. Canali R. Lancellotti University of Modena and Reggio Emilia Department of Engineering Enzo Ferrari December

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Clustering Data Clustering with user constraints The clustering problem : Given a set of

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Similarity is crucial to cognition General (often implicit) hypothesis: similar stimulus in

Unsupervised learning Clustering and Dimensionality Reduction Marta Arias marias@cs.upc.edu

LECTURE 7 Clustering The k-means algorithm Hierarchical Clustering The DBSCAN algorithm

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

Toward Understanding Heterogeneity in Computing Arnold L. Rosenberg Ron C. Chiang Electrical

CS1010 Programming Methodology AY18/19 Sem 1 Lecture 2 21 August 2018 Admin Matters Unit 3:

Ubiquitous and Mobile Computing CS 525M: DroidCluster: Towards Smartphone Cluster Computing Pengfei

National Tsing Hua University Taiwan Advisor: Prof. Jerry Chou Computer Science Department

NVIDIA GPU Architecture for General Purpose Computing Anthony Lippert 4/27/09 1 Outline

Efficient Diameter Approximation for Large Graphs in MapReduce Geppino Pucci - Universit` a di

Introduction to Machine Learning CMU-10701 19. Clustering and EM Barnabs Pczos Contents

CS 147: Computer Systems Performance Analysis Workload Characterization 1 / 31 Overview CS147