Empirical Evaluation of Workload Forecasting Techniques for Predictive Cloud Resource Scaling
In Kee Kim, Wei Wang, Yanjun (Jane) Qi, and Marty Humphrey Computer Science @ University of Virginia
Empirical Evaluation of Workload Forecasting Techniques for - - PowerPoint PPT Presentation
Empirical Evaluation of Workload Forecasting Techniques for Predictive Cloud Resource Scaling In Kee Kim , Wei Wang, Yanjun (Jane) Qi, and Marty Humphrey Computer Science @ University of Virginia Motivation Cloud Resource Scaling Approach
In Kee Kim, Wei Wang, Yanjun (Jane) Qi, and Marty Humphrey Computer Science @ University of Virginia
Reactive Auto Scaling
[AWS, Google, Azure, etc.] Autoscaling based on Resource Utilization: CPU, Memory, Network-I/O…
2
Reactive Auto Scaling
[AWS, Google, Azure, etc.] Autoscaling based on Resource Utilization: CPU, Memory, Network-I/O…
Resource Demand Time
3
Reactive Auto Scaling
[AWS, Google, Azure, etc.] Autoscaling based on Resource Utilization: CPU, Memory, Network-I/O…
Number of Instances Resource Demand Time
4
Reactive Auto Scaling
[AWS, Google, Azure, etc.] Autoscaling based on Resource Utilization: CPU, Memory, Network-I/O…
Number of Instances Resource Demand Time
Scaling Delays Scaling Delays
5
Reactive Auto Scaling
[AWS, Google, Azure, etc.] Autoscaling based on Resource Utilization: CPU, Memory, Network-I/O…
Predictive Resource Scaling
Resource Scaling based on forecasting:
Number of Instances Resource Demand Time
Scaling Delays Scaling Delays
6
Reactive Auto Scaling
[AWS, Google, Azure, etc.] Autoscaling based on Resource Utilization: CPU, Memory, Network-I/O…
Predictive Resource Scaling
Resource Scaling based on forecasting:
Number of Instances Resource Demand Time
Scaling Delays Scaling Delays
7
Predictive Resource Management Engine
Cloud Infrastructure Resource Scaling Workload Predictor
Workload
8
Predictive Resource Management Engine
Resource Scaling Workload Predictor
Regression? Machine Learning? Time Series?
Workload Cloud Infrastructure
9
10
X X X = 4K cases
Realistic Workload Collection of WL Predictor Resource Manager Public Clouds
Naive Regression Time Series Non-temporal Resource Scaling Job Scheduling VM Control
. . .
24 WL patterns 21 Predictors 4 Policies 2 Configs.
11
12
Mean-based Recent-mean
(kNN)
Global Model
(Linear, Quad, Cubic)
Local Model
(Linear, Quad, Cubic)
Smoothing
(WMA, EMA, DES)
Box-Jenkins
( AR, ARMA, ARIMA)
SVMs
(Linear, Gaussian)
Ensemble
(RF, GBM, Exts)
Decision Tree
13
t
Compute
Inactivity Period
t
Compute
t
Compute
t
Compute
On and Off (Batch/Scientific) Growing (Emerging Service) Random/Unpredictable (Media) Cyclic Bursting (E-Commerce)
14
Job Portal
Job (Duration, Deadline)
Workload Repository
Predictor for Scaling-Out Predictor for Scaling-In
Samples for Prediction Prediction Result
Predictive Scaling Module
Job Arrival Info
Predictive Scaler
Predictive Scaling Decision
Job
Job Queue J J J J J J J J Job Exe Job Exe Job Exe
Cloud Resource Management System
Resource Management Module
(e.g. job scheduling, VM scaling, and management) +/- VMs, Job Assign.
Cloud Infrastructure (e.g. AWS, Azure) Workload
Predictor for Scaling-Out Predictor for Scaling-In
Predictive Scaler
15
Workload Repository
16
17
Overall Prediction Accuracy Overall Prediction Overhead (10K Jobs)
18
Overall Prediction Accuracy Overall Prediction Overhead (10K Jobs)
Average: 0.6360 SVMs: 0.37 -- 0.4 (42% less than average)
19
Overall Prediction Accuracy Overall Prediction Overhead (10K Jobs)
kNN: 0.5s for 10K Jobs ARMA: 6032s
20
Workload Rank Predictor MAPE Workload Rank Predictor MAPE
Growing
1
0.28
On/Off
1
0.22 2 AR 0.29 2 ARMA 0.30 3 ARMA 0.30 3
0.44 Avg.
Avg.
Bursty
1 ARIMA 0.38
Random
1
0.45 2 Brown’s DES 0.41 2
0.46 3
0.43 3
0.46 Avg.
Avg.
21
+ Reactive Scaling-In) -- Baseline
+ Predictive Scaling-In)
22
0.0 0.2 0.4 0.6 0.8 1.0 1.2 PR RP PP
Cost DL Miss Rate
0.0 0.2 0.4 0.6 0.8 1.0 1.2 PR RP PP
Cost DL Miss Rate
Baseline (RR)
(a) Hourly Pricing Model (b) Minutely Pricing Model
23
0.0 0.2 0.4 0.6 0.8 1.0 1.2 PR RP PP
Cost DL Miss Rate
0.0 0.2 0.4 0.6 0.8 1.0 1.2 PR RP PP
Cost DL Miss Rate
Baseline (RR)
(a) Hourly Pricing Model (b) Minutely Pricing Model
24
37% 58% 67% 87%
cost DL Miss
0.0 0.2 0.4 0.6 0.8 1.0 1.2 PR RP PP
Cost DL Miss Rate
0.0 0.2 0.4 0.6 0.8 1.0 1.2 PR RP PP
Cost DL Miss Rate
Baseline (RR)
Cost: No Improvement (a) Hourly Pricing Model (b) Minutely Pricing Model
25
60% 72%
DL Miss
0.00 0.20 0.40 0.60 0.80 1.00 1.20 Lin-SVM (Best) Average Gau-SVM (Best) Average BRDES (Best) Average Lin-SVM (Best) Average Growing On/Off Bursty Random
Hourly Pricing Model
0.00 0.20 0.40 0.60 0.80 1.00 1.20 AR (Best) Average ARMA (Best) Average BRDES (Best) Average Gau-SVM (Best) Average Growing On/Off Bursty Random
Minutely Pricing Model
Workloads Top 1 Top 2 Top 3
Growing Linear SVM AR ARMA On/Off Gaussian SVM ARMA Linear SVM Bursty ARIMA Brown’s DES Linear SVM Random Gaussian SVM Linear Regression Linear SVM
26
0.00 0.20 0.40 0.60 0.80 1.00 1.20 Lin-SVM (Best) Average Gau-SVM (Best) Average BRDES (Best) Average Lin-SVM (Best) Average Growing On/Off Bursty Random
Hourly Pricing Model
0.00 0.20 0.40 0.60 0.80 1.00 1.20 AR (Best) Average ARMA (Best) Average BRDES (Best) Average Gau-SVM (Best) Average Growing On/Off Bursty Random
Minutely Pricing Model
Workloads Top 1 Top 2 Top 3
Growing Linear SVM AR ARMA On/Off Gaussian SVM ARMA Linear SVM Bursty ARIMA Brown’s DES Linear SVM Random Gaussian SVM Linear Regression Linear SVM
27
0.00 0.20 0.40 0.60 0.80 1.00 1.20 Lin-SVM (Best) Average Gau-SVM (Best) Average BRDES (Best) Average Lin-SVM (Best) Average Growing On/Off Bursty Random
Hourly Pricing Model
0.00 0.20 0.40 0.60 0.80 1.00 1.20 AR (Best) Average ARMA (Best) Average BRDES (Best) Average Gau-SVM (Best) Average Growing On/Off Bursty Random
Minutely Pricing Model
Workloads Top 1 Top 2 Top 3
Growing Linear SVM AR ARMA On/Off Gaussian SVM ARMA Linear SVM Bursty ARIMA Brown’s DES Linear SVM Random Gaussian SVM Linear Regression Linear SVM
28
0.00 0.20 0.40 0.60 0.80 1.00 1.20 Lin-SVM (Best) Average Gau-SVM (Best) Average BRDES (Best) Average Lin-SVM (Best) Average Growing On/Off Bursty Random
Hourly Pricing Model
0.00 0.20 0.40 0.60 0.80 1.00 1.20 AR (Best) Average ARMA (Best) Average BRDES (Best) Average Gau-SVM (Best) Average Growing On/Off Bursty Random
Minutely Pricing Model
Workloads Top 1 Top 2 Top 3
Growing Linear SVM AR ARMA On/Off Gaussian SVM ARMA Linear SVM Bursty ARIMA Brown’s DES Linear SVM Random Gaussian SVM Linear Regression Linear SVM
29
(statistically) accurate predictors.
30
31