iForest: Interpreting Random Forests via Visual Analytics
Xun Zhao, Ya Yanhong Wu, Dik Lun Lee, Weiwei Cui
iForest: Interpreting Random Forests via Visual Analytics Xun Zhao, - - PowerPoint PPT Presentation
iForest: Interpreting Random Forests via Visual Analytics Xun Zhao, Ya Yanhong Wu , Dik Lun Lee, Weiwei Cui Background Random Forest Fraud Detection Medical Diagnosis Churn Prediction 1 Icons created by Anatolii Babii, Atif Arshad, and
Xun Zhao, Ya Yanhong Wu, Dik Lun Lee, Weiwei Cui
1
Fraud Detection Medical Diagnosis Churn Prediction
Icons created by Anatolii Babii, Atif Arshad, and Dinosoft Labs from the Noun Project.
2
3
4
5
6
7
8
9
Source: https://xkcd.com/1838/
10
Reveal the relationships between features and predictions
Icons created by Melvin, alrigel, and Dinosoft Labs from the Noun Project.
Uncover the underlying working mechanisms Provide case-based reasoning
11
12
Data Overview Feature View Decision Path View
14
Provide case-based reasoning
Data Overview Feature View Decision Path View
15
True False True True Positive False Negative False False Positive True Negative Predicted Values Actual Values
16
Positive Negative Panning & Zooming each circle represents a data item Default View
17
Data Overview Feature View Decision Path View
Reveal the relationships between features and predictions
18
each cell illustrates the statistics and importance of a feature
19
Feature A (numerical)
high
20
Feature A (numerical) x = 60
high
21
Feature A (numerical) Split point distribution
22
Feature B (ordinal)
high high
23
Data Overview Feature View Decision Path View
Uncover the underlying working mechanisms
24
25
Positive Negative ration between positive and negative decision paths each circle represents a decision path lasso to select a specific set of paths for exploration
26
Feature Cell: Summarize the feature ranges of the selected paths vertical bar: feature value of the current data item pixel-based bar chart: feature range summary
27
A < 0.5 C > 1.5 C < 3.5
Decision Path I:
C > 2.5 A < 0.5
Decision Path II:
Layer 1 (root) Layer 2 Layer 3
28
A < 0.5 C > 1.5 C < 3.5
Decision Path I:
C > 2.5 A < 0.5
Decision Path II:
Layer 1 (root) Layer 2 Layer 3
29
A < 0.5 C > 1.5 C < 3.5
Decision Path I:
C > 2.5 A < 0.5
Decision Path II:
Layer 1 (root) Layer 2 Layer 3
30
A < 0.5 C > 1.5 C < 3.5
Decision Path I:
C > 2.5 A < 0.5
Decision Path II:
Layer 1 (root) Layer 2 Layer 3
31
Leaf Node Leaf Node
32
34
5 10 15 20 25 30 Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Task 8 Task 9 Task 10
Task Completion Time (seconds)
35
iForest: Interpreting Random Forests via Visual Analytics
Yanhong Wu Email: yanwu@visa.com URL: http://yhwu.me