Linking losses for density ratio and class-probability estimation
Aditya Krishna Menon Cheng Soon Ong
NICTA and The Australian National University
0 / 34
Linking losses for density ratio and class-probability estimation - - PowerPoint PPT Presentation
Linking losses for density ratio and class-probability estimation Aditya Krishna Menon Cheng Soon Ong NICTA and The Australian National University 0 / 34 Linking losses for density ratio and class-probability estimation Aditya Krishna Menon
NICTA and The Australian National University
0 / 34
Data61 and The Australian National University
0 / 34
1 / 34
1 / 34
−5 5 0.5 1 1.5 2 p q
2 / 34
−5 5 0.5 1 1.5 2 p q r
2 / 34
3 / 34
3 / 34
3 / 34
Ranking Bregman
4 / 34
Ranking Bregman
4 / 34
Ranking Bregman
4 / 34
Ranking Bregman
4 / 34
5 / 34
2 and
Class conditionals
−3 −2 −1 1 2 3 0.2 0.4 0.6 0.8 1 6 / 34
2 and
Class conditionals Marginal and class-probability function
−3 −2 −1 1 2 3 0.2 0.4 0.6 0.8 1 −3 −2 −1 1 2 3 0.2 0.4 0.6 0.8 6 / 34
7 / 34
−3 −2 −1 1 2 3 1 2 3 4 5 6 v ℓ(y, v)
7 / 34
−3 −2 −1 1 2 3 1 2 3 4 5 6 v ℓ(y, v)
5" 10" 1" 20" 10" 6" 4" 30"
7 / 34
8 / 34
Class-probability estimation (CPE)
class-probability function
0.6"
8 / 34
Class-probability estimation (CPE)
class-probability function
0.6"
Density ratio estimation (DRE)
class-conditional density ratio
−5 5 0.5 1 1.5 2 p q r 8 / 34
s∈S
s∈RX L(s;D,ℓ) = Ψ◦η
9 / 34
s∈S
s∈RX L(s;D,ℓ) = Ψ◦η
9 / 34
−3 −2 −1 1 2 3 1 2 3 4 5 6 v ℓ(y, v)
Logistic loss Ψ−1 : v → σ(v)
−3 −2 −1 1 2 3 1 2 3 4 5 6 v ℓ(y, v)
Exponential loss Ψ−1 : v → σ(2v)
−3 −2 −1 1 2 3 1 2 3 4 5 6 v ℓ(y, v)
Square hinge loss Ψ−1 : v → min(max(0,(v+1)/2),1)
10 / 34
s∈S
s∈S
11 / 34
12 / 34
12 / 34
13 / 34
s∈S
s∈S
14 / 34
s∈S
s∈S
14 / 34
s∈S
s∈S
14 / 34
15 / 34
15 / 34
16 / 34
ℓ′(−1,v)
16 / 34
ℓ′(−1,v)
dr (v).
16 / 34
ℓ′(−1,v)
dr (v).
16 / 34
17 / 34
17 / 34
17 / 34
17 / 34
17 / 34
18 / 34
18 / 34
19 / 34
20 / 34
20 / 34
20 / 34
s∗∈RX L(s∗;D,ℓ)
21 / 34
s∗∈RX L(s∗;D,ℓ)
21 / 34
s∗∈RX L(s∗;D,ℓ)
21 / 34
s∗∈RX L(s∗;D,ℓ)
21 / 34
1+z
22 / 34
1+z
22 / 34
1+z
22 / 34
1+z
22 / 34
y (x−z)·f ′′(z)dz.
1+x y 1+y
23 / 34
u 1+u, with dz = du (1+u)2,
x
y
24 / 34
u 1+u, with dz = du (1+u)2,
x
y
x
y (x−u)·f ′′
24 / 34
u 1+u, with dz = du (1+u)2,
x
y
x
y (x−u)·f ′′
24 / 34
u 1+u, with dz = du (1+u)2,
x
y
x
y (x−u)·f ′′
24 / 34
dr (x),Ψ−1 dr (y)
25 / 34
dr (x),Ψ−1 dr (y)
dr (x) = η
25 / 34
dr (x),Ψ−1 dr (y)
dr (x) = η
25 / 34
Bregman
26 / 34
Bregman
26 / 34
27 / 34
28 / 34
28 / 34
28 / 34
0.2 0.4 0.6 0.8 1 2 4 6 8 10 c w(c)
Logistic loss w(c) =
1 2·c·(1−c)
0.2 0.4 0.6 0.8 1 2 4 6 8 10 c w(c)
Exponential loss w(c) =
1 4·c3/2·(1−c)3/2
0.2 0.4 0.6 0.8 1 2 4 6 8 10 c w(c)
Square hinge loss w(c) = 2
29 / 34
ℓ(−1,v) = 1 2 ·v2 and ℓ(1,v) = −v.
30 / 34
31 / 34
Bregman Ranking
32 / 34
33 / 34
1Drop by the poster for more (Paper ID 152)
34 / 34