XGBOOST: A SCALABLE TREE BOOSTING SYSTEM
ADVISOR: JIA-LING KOH SPEAKER: YIN-HSIANG LIAO 2018/04/17, FROM KDD 2016
XGBOOST: A SCALABLE TREE BOOSTING SYSTEM ADVISOR: JIA-LING KOH - - PowerPoint PPT Presentation
XGBOOST: A SCALABLE TREE BOOSTING SYSTEM ADVISOR: JIA-LING KOH SPEAKER: YIN-HSIANG LIAO 2018/04/17, FROM KDD 2016 Outline Introduction Method Experiment Conclusion 2 Introduction Regression tree CART (Gini) Boosting Ensemble method, an
ADVISOR: JIA-LING KOH SPEAKER: YIN-HSIANG LIAO 2018/04/17, FROM KDD 2016
2
3
4
5
6
7
Objective function
8
Objective function
9
Objective function
10
Objective function
T : number of leaf
11
Objective function
12
Objective function
13
Objective function
The larger the better, might be negative Greedy strategy
14
Objective function
15
Objective function
16
Split Finding
17
Split Finding
When to stop?
18
Split Finding
19
Split Finding
20
Split Finding
21
Split Finding
22
Split Finding
Sort criteria: Missing value last Learn the best direction (of the feature)
23
Split Finding
24
System Design
CSC format (compressed column) Ex: Different blocks can be distributed across machine, stored
25
System Design
26
System Design
27
System Design
Out-of-core computation: Block compression Ex: [0, 2, 2, 0, 1, 2] Block sharding A prefetch thread is assigned to each disk.
28
System Design
29
30
31
32
33
System Design