Summary
◮ Overfitting arises when we evaluate and train on the same data. ◮ We can bound error of a fixed function with Hoeffding’s inequality. ◮ Next lecture we’ll get a version sensitive to function class size.
41 / 61
Summary Overfitting arises when we evaluate and train on the same - - PowerPoint PPT Presentation
Summary Overfitting arises when we evaluate and train on the same data. We can bound error of a fixed function with Hoeffdings inequality. Next lecture well get a version sensitive to function class size. 41 / 61 Part 3. . .
41 / 61
10
10 10
1
C
0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08
misclassification rate
train test 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
. 2
8
. . . 0.400 0.400 . 4 0.800 0.800 0.800 1 . 2 1 . 2
C = 1.481101 , λ = 0.006752 ; train 0.010000 , test 0.060000
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
8 0.000 . . 0.080 0.080 . 1 6 0.160 . 2 4 0.240
C = 0.040000 , λ = 0.250000 ; train 0.050000 , test 0.076000
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
. 6
. 6
8
0.000 0.000 . 0.800 0.800 . 8 0.800 1.600 1.600 2.400
C = 20.480000 , λ = 0.000488 ; train 0.000000 , test 0.076000
42 / 61
i=1(2Zi − 1).
200 400 600 800 1000 60 40 20 20 40 60 80 43 / 61
i=1(2Zi − 1).
200 400 600 800 1000 60 40 20 20 40 60 80
43 / 61
n
n
i=1,
44 / 61
i=1 where Zi := 1[h(Xi) = Yi].
i=1, we can’t guarantee independence of Zi.
45 / 61
i=1 are independent for each
j∈{1,...,k} EZ1,j − 1
n
i=1,
46 / 61
i=1 are independent for each
j∈{1,...,k} EZ1,j − 1
n
i=1,
46 / 61
47 / 61
47 / 61
47 / 61
47 / 61
k
k
47 / 61
k
k
k
k
47 / 61
48 / 61
i=1.
48 / 61
i=1.
48 / 61
i=1.
48 / 61
i=1.
48 / 61
i=1.
48 / 61
i=1,
i=1,
49 / 61
50 / 61
50 / 61
50 / 61
50 / 61
50 / 61
51 / 61
51 / 61
51 / 61
51 / 61
51 / 61
51 / 61
51 / 61
Tx + b ≥ 0] : a ∈ Rd, b ∈ R
52 / 61
Tx + b ≥ 0] : a ∈ Rd, b ∈ R
52 / 61
Tx + b ≥ 0] : a ∈ Rd, b ∈ R
52 / 61
Tx + b ≥ 0] : a ∈ Rd, b ∈ R
52 / 61
Tx + b ≥ 0] : a ∈ Rd, b ∈ R
52 / 61
Tx + b ≥ 0] : a ∈ Rd, b ∈ R
52 / 61
53 / 61
53 / 61
53 / 61
53 / 61
53 / 61
53 / 61
54 / 61
54 / 61
54 / 61
54 / 61
54 / 61
55 / 61
55 / 61
f∈F
n
2).
56 / 61
f∈F
n
2).
56 / 61
f∈F
n
2).
56 / 61
f∈F
n
2).
56 / 61
f∈F
n
2).
56 / 61
57 / 61
57 / 61
57 / 61
57 / 61
57 / 61
57 / 61
57 / 61
f∈F
n
2).
58 / 61
f∈F
n
2).
Tw : w ≤ W}) ≤ RW
58 / 61
f∈F
n
2).
Tw : w ≤ W}) ≤ RW
58 / 61
59 / 61
59 / 61
60 / 61
61 / 61