Selective sampling algorithms for cost-sensitive multiclass prediction
Alekh Agarwal
Microsoft Research
Alekh Agarwal Selective sampling for multiclass prediction
Selective sampling algorithms for cost-sensitive multiclass - - PowerPoint PPT Presentation
Selective sampling algorithms for cost-sensitive multiclass prediction Alekh Agarwal Microsoft Research Alekh Agarwal Selective sampling for multiclass prediction Why active learning? Standard setting - receive randomly sampled examples
Microsoft Research
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
2 4 6 8 10 1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 2 4 6 8 10 1 2 3 4 5 6 7 8 9 10 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 2 4 6 8 10 12 14 16 2 4 6 8 10 12 14 16 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
−100 −50 50 100 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
wTx P(y = 1 | w, x)
Alekh Agarwal Selective sampling for multiclass prediction
i=1 exp(vi))
j=1 exp(vj))
j=1 exp(xTWj)
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
W(i) = − K
Alekh Agarwal Selective sampling for multiclass prediction
W(i) = − K
i=1 Sx W∗(i)
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Wt(i)
Wt(i)
Alekh Agarwal Selective sampling for multiclass prediction
Wt(i)
Wt(i) Alekh Agarwal Selective sampling for multiclass prediction
Wt(i)
Wt(i)
Alekh Agarwal Selective sampling for multiclass prediction
Wt(i)
Wt(i)
t
W∈W
F
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
M−1
t
Alekh Agarwal Selective sampling for multiclass prediction
M−1
t
t
Alekh Agarwal Selective sampling for multiclass prediction
2.
Alekh Agarwal Selective sampling for multiclass prediction
2.
2.
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
T
t )])
Alekh Agarwal Selective sampling for multiclass prediction
T
t )])
W∗(y∗ t ) − Sxt W∗(y
′
t) ≤ ǫ}.
Alekh Agarwal Selective sampling for multiclass prediction
ud2K
ℓ ǫ2
Alekh Agarwal Selective sampling for multiclass prediction
t = arg maxi=1,...,K Sxt W∗(i),
′
t = arg maxi=y ∗
t Sxt
W∗(i)
Wt(i),
′′
t = arg maxi=ˆ yt Sxt Wt(i).
Alekh Agarwal Selective sampling for multiclass prediction
t = arg maxi=1,...,K Sxt W∗(i),
′
t = arg maxi=y ∗
t Sxt
W∗(i)
Wt(i),
′′
t = arg maxi=ˆ yt Sxt Wt(i).
Wt(ˆ
Wt(y
′′
t ) ≤ 2ηDGS xtM−1
t
Selective sampling for multiclass prediction
t = arg maxi=1,...,K Sxt W∗(i),
′
t = arg maxi=y ∗
t Sxt
W∗(i)
Wt(i),
′′
t = arg maxi=ˆ yt Sxt Wt(i).
Wt(ˆ
Wt(y
′′
t ) ≤ 2ηDGS xtM−1
t
Wt(i) − Sxt W∗(i)| ≤ ηDGS xtM−1
t
Wt(y
′′
t )
t
Wt(ˆ
t
t
W∗(ˆ
W∗(y
′′
t )
Alekh Agarwal Selective sampling for multiclass prediction
ǫ>0
ud
ℓ ǫ log 1
ud2K
ℓ ǫ2
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
P(y | x, w∗) xTw∗ 2ǫ0 Alekh Agarwal Selective sampling for multiclass prediction
ud2K
ℓ NT
2
Alekh Agarwal Selective sampling for multiclass prediction
ud2K
ℓ NT
2
Alekh Agarwal Selective sampling for multiclass prediction
ud2K
ℓ NT
2
T
Alekh Agarwal Selective sampling for multiclass prediction
1000 2000 3000 4000 5000 0.8 0.85 0.9 0.95 1 1.05 1.1 1.15 1.2 1.25
Number of queries Regret ratio Regret versus number of queries
Passive Random BBQ DGS 1000 2000 3000 4000 5000 0.9 0.95 1 1.05 1.1 1.15 1.2 1.25 1.3 1.35 1.4
Number of queries Regret ratio Regret versus number of queries
Alekh Agarwal Selective sampling for multiclass prediction
1000 2000 3000 4000 5000 6000 7000 0.95 1 1.05 1.1 1.15 1.2 1.25
Number of queries Regret ratio Regret versus number of queries
Passive Random BBQ DGS
Alekh Agarwal Selective sampling for multiclass prediction
1000 2000 3000 4000 5000 6000 7000 0.95 1 1.05 1.1 1.15 1.2 1.25
Number of queries Regret ratio Regret versus number of queries
Passive Random BBQ DGS
Alekh Agarwal Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction
Selective sampling for multiclass prediction
Alekh Agarwal Selective sampling for multiclass prediction