2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Opprentice: Towards Practical and Automatic Anomaly Detection - - PowerPoint PPT Presentation
Opprentice: Towards Practical and Automatic Anomaly Detection - - PowerPoint PPT Presentation
Opprentice: Towards Practical and Automatic Anomaly Detection Through Machine Learning Dapeng Liu , Youjian Zhao, Haowen Xu, Yongqian Sun, Dan Pei, Jiao Luo, Xiaowei Jing, Mei Feng 2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn) KPIs and
KPIs and Anomaly Detection
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
KPIs (Key Performance Indicators): A set of performance measures that evaluate the service quality
Page views (PV) of Baidu 1
KPIs and Anomaly Detection
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
KPIs (Key Performance Indicators): A set of performance measures that evaluate the service quality
Page views (PV) of Baidu
KPI anomalous (unexpected) behaviors Potential failures, bugs, attacks...
2
KPIs and Anomaly Detection
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
KPIs (Key Performance Indicators): A set of performance measures that evaluate the service quality
Page views (PV) of Baidu
KPI anomalous (unexpected) behaviors Potential failures, bugs, attacks... Anomaly detection matters: Find anomalous behaviors of the KPI curve
Diagnose and fix it Avoid further influences and revenue losses
3
KPIs and Anomaly Detection
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
KPIs (Key Performance Indicators): A set of performance measures that evaluate the service quality
Page views (PV) of Baidu
KPI anomalous (unexpected) behaviors Potential failures, bugs, attacks, etc. Anomaly detection matters: Find anomalous behaviors of the KPI curve
Diagnose and fix it Avoid further influences and revenue losses
4
IMC’ 15 Dissecting UbuntuOne: Autopsy of a Global-scale Personal Cloud Back-end IMC’ 15 The Dark Menace: Characterizing Network-based Attacks in the Cloud
How to Build the Anomaly Detection System
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Domain experts (Operators)
- Responsible for the KPIs
- Knowing the KPI behaviors well
Developers
- Building the detection system
- Knowing several anomaly detectors
Simple threshold … Historical Average Wavelet Holt-Winters
5
How to Build the Anomaly Detection System
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators Developers
Describe anomalies
In practice, it is more complex
6
How to Build the Anomaly Detection System
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators Developers
Describe anomalies
Wavelet Moving Average Holt-Winters
… Select detectors & Tune parameters Detection System
In practice, it is more complex
7
How to Build the Anomaly Detection System
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators Developers
Describe anomalies
Wavelet Moving Average Holt-Winters
… Select detectors & Tune parameters Detection System Anomalies
In practice, it is more complex
8
How to Build the Anomaly Detection System
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators Developers
Describe anomalies
Wavelet Moving Average Holt-Winters
… Select detectors & Tune parameters Detection System Anomalies
In practice, it is more complex
9
How to Build the Anomaly Detection System
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators Developers
Describe anomalies
Wavelet Moving Average Holt-Winters
… Select detectors & Tune parameters Detection System Anomalies
Challenges
Selecting and combining suitable detectors are tricky Detectors are not intuitive to tune
2. 3.
Operators have difficulties to precisely and formally define anomalies in advance
1.
10
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
(Operators’ apprentice)
A More Natural Way
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
OP
Opprentice
PV
13
Design Goal
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators
Label Accuracy preference (Precision & recall) Provide Anomaly Detection Opprentice
14
Design Goal
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Operators
Label Accuracy preference (Precision & recall) Provide Anomaly Detection Opprentice
vs.
15
Background and Motivation Key Ideas Results Conclusion
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Outline
16
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Detector model:
17
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
severity =
|𝑤𝑏𝑚𝑣𝑓−𝜈| 𝜏
𝑤𝑏𝑚𝑣𝑓 For example
Detector model:
Historical Average
18
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
severity =
|𝑤𝑏𝑚𝑣𝑓−𝜈| 𝜏
𝑤𝑏𝑚𝑣𝑓 For example
Detector model:
Historical Average sThld
1
19
Anomaly feature
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
severity =
|𝑤𝑏𝑚𝑣𝑓−𝜈| 𝜏
𝑤𝑏𝑚𝑣𝑓 For example
Detector model:
Historical Average sThld
1
20
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Detector Configurations
Time series decomposition HW 0.2 0.2 0.2 HW 0.5 0.7 0.7 Differencing-last day Differencing-last season WMA-WIN30 Differencing-last slot Historical average-4 season EWMA-0,7
Extract features KPI data
(Detectors with different parameters)
21
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Detector Configurations
Time series decomposition HW 0.2 0.2 0.2 HW 0.5 0.7 0.7 Differencing-last day Differencing-last season WMA-WIN30 Differencing-last slot Historical average-4 season EWMA-0,7
Extract features KPI data
(Detectors with different parameters)
22
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Classification in the feature space (Supervised machine learning)
23
Key Ideas
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Classification in the feature space (Supervised machine learning)
Operators
24
Labeling overhead
– Solution: an effective labeling tool
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Address Challenges of Designing Opprentice
25
Labeling overhead
– Solution: an effective labeling tool
Incomplete anomaly types in the historical data
– Solution: incremental re-training with new data
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Address Challenges of Designing Opprentice
26
Labeling overhead
– Solution: an effective labeling tool
Incomplete anomaly types in the historical data
– Solution: incremental re-training with new data
Class imbalance problem
– Solution: adjusting classification threshold (cThld) based on the preference
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Address Challenges of Designing Opprentice
27
Labeling overhead
– Solution: an effective labeling tool
Incomplete anomaly types in the historical data
– Solution: incremental re-training with new data
Class imbalance problem
– Solution: adjusting classification threshold (cThld) based on the preference
Irrelevant and redundant features
– Solution: random forests
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Address Challenges of Designing Opprentice
28
Design Overview
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Training a classifier
See the paper for full details
29
Design Overview
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Training a classifier Detecting anomalies
30
See the paper for full details
Background and Motivation Key Ideas Results Conclusion
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Outline
31
Evaluation
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
32
Evaluation
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
33
Random forests vs. Basic Detectors and Static Combinations
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
basic detector basic detector basic detector
Random forest
34
Evaluation
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
35
Random Forests vs. Other Learning Algorithms
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
(The order of features is based on mutual information)
36
Evaluation
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
37
See the paper for full details
Opprentice as a whole
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Opprentice achieves
40% 23% 110%
more points inside the preference regions than 5-Fold cross-validation
38
Oracle mode (best case) Opprentice 5-Fold
Opprentice as a whole
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Opprentice achieves
40% 23% 110%
more points inside the preference regions than 5-Fold cross-validation
39
Oracle mode (best case) Opprentice 5-Fold
Conclusion
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)
Opprentice is an automatic and accurate machine learning framework for KPI anomaly detection
Opprentice bridges the gap in applying complex detectors in practice
The idea of Opprentice
i.e., using machine learning to model the domain knowledge
could be a very promising way to automate other service managements
Opprentice Defining anomalies Selecting detectors Tuning detectors
40
2015/12/3 Dapeng Liu (liudp10@mails.tsinghua.edu.cn)