MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
Versions of Random Forests: Properties and Performances Choongsoon Bae
Google Inc. U.C.Berkeley
Versions of Random Forests: Properties and Performances Choongsoon - - PowerPoint PPT Presentation
M OTIVATION CART B AGGING R ANDOM F ORESTS P ERFORMANCES Versions of Random Forests: Properties and Performances Choongsoon Bae Google Inc. U.C.Berkeley March 26, 2009 Joint work with Peter Bickel M OTIVATION CART B AGGING R ANDOM F ORESTS
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
Google Inc. U.C.Berkeley
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
400 makes, models and vehicle types No Yes
Other makes and models No
Yes
Other makes and models No
Yes
Ford F-150
Honda Accord
Ford Taurus Taken from Critical Features of HIgh Performance Decision Trees Salford Systems
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
i
, . . . , X(d)
i
i = 1, . . . , n
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
i
, . . . , X(d)
i
i = 1, . . . , n
ˆ α1, ˆ β1, ˆ γ1 = argmin
(α1,β1,γ1)∈R3 n
∑
i=1
1
i
≤ γ1
i
> γ1
ˆ αd, ˆ βd, ˆ γd = argmin
(αd,βd,γd)∈R3 n
∑
i=1
1
i
≤ γd
i
> γd
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
i
, . . . , X(d)
i
i = 1, . . . , n
ˆ α1, ˆ β1, ˆ γ1 = argmin
(α1,β1,γ1)∈R3 n
∑
i=1
1
i
≤ γ1
i
> γ1
ˆ αd, ˆ βd, ˆ γd = argmin
(αd,βd,γd)∈R3 n
∑
i=1
1
i
≤ γd
i
> γd
ˆ t = argmin
j=1,...,d n
∑
i=1
1
αj1
i
≤ ˆ γj
βj1
i
> ˆ γj
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
i
, . . . , X(d)
i
i = 1, . . . , n
ˆ α1, ˆ β1, ˆ γ1 = argmin
(α1,β1,γ1)∈R3 n
∑
i=1
1
i
≤ γ1
i
> γ1
ˆ αd, ˆ βd, ˆ γd = argmin
(αd,βd,γd)∈R3 n
∑
i=1
1
i
≤ γd
i
> γd
ˆ t = argmin
j=1,...,d n
∑
i=1
1
αj1
i
≤ ˆ γj
βj1
i
> ˆ γj
t)
X(ˆ
t) > ˆ
γˆ
t
X(ˆ
t) ≤ ˆ
γˆ
t
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
X(3)
X(2)
X(4)
X(6)
X(2)
X(1)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
X(3)
X(2)
X(4)
X(6)
X(2)
X(1)
X(1)
X(5)
X(7)
X(4)
X(2)
X(4)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
X(3)
X(2)
X(4)
X(6)
X(2)
X(1)
X(1)
X(5)
X(7)
X(4)
X(2)
X(4)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
X(3)
X(6)
X(2)
X(1)
X(1)
X(5)
X(7)
X(4)
X(2)
X(4)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
X(3)
X(6)
X(2)
X(1)
X(1)
X(4)
X(2)
X(4)
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
X(3)
X(4)
X(1)
X(3)
X(6)
X(2)
X(1)
X(1)
X(4)
X(2)
X(4)
Majority
Majority
Majority
Majority
Majority
Majority
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
M
m=1
j M
m=1
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 CART vs Bagging (n=100,sigma=0.5,split=20, B=100) true CART Bagging Loess
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 2 4 6 8 10 12 CART vs Bagging (n=100,sigma=0.5,split=5, B=100) true CART Bagging Loess
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
M
i=1
M
i=1
i=j
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
M × d F.
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
n → 0, where k is the number of nodes, n is the
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
exp(β×A(k)) ∑M
k=1 exp(β×A(k)) for suitable β.
k=1 ˆ
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
n ∑n i=1 1(φ(Xi) = Yi): empirical loss function
n
n = arg min ˜ φ(k)
n ,k=1,...,K ˜
n ) + P(k, n), where
n(1 + log d) is a penalty term for some sufficiently
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
2L , where µ denotes the Lebesque
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
n) − L(φ∗)
n) − L(φ∗)
d ,
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
3 2d log M
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
5 10 15 20 25 30 35 40 0.89 0.9 0.91 0.92 0.93 0.94 0.95 0.96 0.97 0.98 Random feature subset size (F) Accuracy Accuracy of Random Forests RF RFu RFl CART DAWRF PRF
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
100 200 300 400 500 600 0.95 0.955 0.96 0.965 0.97 0.975 0.98 0.985 0.99 Terminal node size Accuracy Accuracy of Random Forests RF30 RF30u RF30l CART RF Purely RF DAWRT
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.8 0.85 0.9 qn Accyracy Accuracy of BNN BNN BNNl BNNu CARTF
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
2 4 6 8 10 12 14 16 18 20 0.915 0.92 0.925 0.93 0.935 0.94 0.945 0.95 0.955 0.96 0.965 Random feature subset size(F) Accyracy Accuracy of Random Forests RFwob RFwob
u
RFwob
l
DAWRF CART RF PRF
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
100 200 300 400 500 600 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 Terminal node size Accyracy Accuracy of Random Forests RF5 RF5u RF5l CART RF Purely RF DAWRT BNN
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES
MOTIVATION CART BAGGING RANDOM FORESTS PERFORMANCES