Learning from Corrupted Binary Labels via Class-Probability Estimation
Aditya Krishna Menon Brendan van Rooyen Cheng Soon Ong Robert C. Williamson xxx
National ICT Australia and The Australian National University
1 / 57
Learning from Corrupted Binary Labels via Class-Probability - - PowerPoint PPT Presentation
Learning from Corrupted Binary Labels via Class-Probability Estimation Aditya Krishna Menon Brendan van Rooyen Cheng Soon Ong Robert C. Williamson xxx National ICT Australia and The Australian National University 1 / 57 Learning from binary
National ICT Australia and The Australian National University
1 / 57
2 / 57
3 / 57
4 / 57
5 / 57
6 / 57
S ⇠ Dn
7 / 57
S ⇠ Dn S ⇠ Dn
8 / 57
9 / 57
10 / 57
11 / 57
12 / 57
13 / 57
14 / 57
15 / 57
S ⇠ Dn S ⇠ Dn
16 / 57
S ⇠ Dn S ⇠ Dn
17 / 57
18 / 57
19 / 57
20 / 57
21 / 57
Q(x) +α
Q(x) +(1β)
22 / 57
π·η(x) π·η(x)+(1π)·π
23 / 57
nature corruptor class-prob estimator classifier
D D ˆ η
Kernel logistic regression
24 / 57
nature corruptor class-prob estimator classifier
D D ˆ η
Kernel logistic regression
25 / 57
26 / 57
2
I (Lipton et al., 2014, Koyejo et al., 2014) 27 / 57
2
I (Lipton et al., 2014, Koyejo et al., 2014)
28 / 57
29 / 57
noise
nature corruptor class-prob estimator classifier
D D ˆ η ˆ α, ˆ β, ˆ π
sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t))
30 / 57
noise
nature corruptor class-prob estimator classifier
D D ˆ η ˆ α, ˆ β, ˆ π
Kernel logistic regression sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t))
31 / 57
noise
nature corruptor class-prob estimator classifier
D D ˆ η ˆ α, ˆ β, ˆ π
Kernel logistic regression sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t))
32 / 57
noise estimator nature corruptor class-prob estimator classifier
D D ˆ η ˆ α, ˆ β, ˆ π
Kernel logistic regression sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t))
33 / 57
34 / 57
x2X η(x) = 0
x2X η(x) = 1
35 / 57
x2X η(x) = 0
x2X η(x) = 1
36 / 57
x2X η(x)
x2X η(x)
37 / 57
38 / 57
noise estimator nature corruptor class-prob estimator classifier
D D ˆ η ˆ η ˆ α, ˆ β, ˆ π
Range of ˆ η Kernel logistic regression sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t)) 39 / 57
noise estimator nature corruptor class-prob estimator classifier
D D ˆ η ˆ η ˆ α, ˆ β, ˆ π
Range of ˆ η Kernel logistic regression sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t)) 40 / 57
41 / 57
42 / 57
43 / 57
44 / 57
45 / 57
BER(f) = (1α β)1 ·regretD BER(f).
BER(f) ! 0 by class-probability estimation
46 / 57
47 / 57
I because of eigenvector interpretation
I because of nature of φα,β,π
49 / 57
50 / 57
I corrupted data used for training and validation
51 / 57
0.1 0.2 0.3 0.4 0.49 −0.3 −0.2 −0.1 0.1 Ground−truth noise Bias of Estimate segment
Mean Median
0.1 0.2 0.3 0.4 0.49 −0.2 −0.125 −0.05 0.025 0.1 Ground−truth noise Bias of Estimate spambase
Mean Median
0.1 0.2 0.3 0.4 0.49 −0.04 −0.025 −0.01 0.005 0.02 Ground−truth noise Bias of Estimate mnist
Mean Median
52 / 57
53 / 57
54 / 57
noise estimator nature corruptor class-prob estimator classifier
D D ˆ η ˆ η ˆ α, ˆ β, ˆ π
Range of ˆ η Omit for BER Kernel logistic regression sign( ˆ η(x)φ ˆ
α, ˆ β, ˆ π(t))
55 / 57
I c.f. non-robustness of convex surrogate minimisation 56 / 57
1Drop by the poster for more (Paper ID 69)
57 / 57