Learning From Data Lecture 6 Bounding The Growth Function
Bounding the Growth Function Models are either Good or Bad The VC Bound - replacing |H| with mH(N)
- M. Magdon-Ismail
CSCI 4100/6100
Learning From Data Lecture 6 Bounding The Growth Function Bounding - - PowerPoint PPT Presentation
Learning From Data Lecture 6 Bounding The Growth Function Bounding the Growth Function Models are either Good or Bad The VC Bound - replacing |H| with m H ( N ) M. Magdon-Ismail CSCI 4100/6100 recap: The Growth Function m H ( N ) A new
Bounding the Growth Function Models are either Good or Bad The VC Bound - replacing |H| with mH(N)
CSCI 4100/6100
recap: The Growth Function mH(N)
A new measure for the diversity of a hypothesis set. H(x1, . . . , xN) = {(h(x1), . . . , h(xN))} The dichotomies (N-tuples) H implements on x1, . . . , xN.
H H viewed through D
x1,...,xN |H(x1, . . . , xN)|.
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 2 /31
Example growth functions − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 3 /31
Quiz I − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 4 /31
Answer − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 5 /31
Quiz II − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 6 /31
Answer − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 7 /31
Quiz III − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 8 /31
Answer − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 9 /31
Quiz IV − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 10 /31
Answer − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 11 /31
Combinatorial puzzle again − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 12 /31
Can’t add a 6th dichotomy − →
A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 13 /31
B(N, K) − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 14 /31
B(4, 3) − →
A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 15 /31
Two kinds of dichotomys − →
A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 16 /31
Reorder the dichotomys − →
β: prefix appears twice
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 17 /31
Bound for α + β − →
A list on 3 points, with no 3 shattered (why?)
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 18 /31
Bound for β − →
If 2 points are shattered, then using the mirror di- chotomies you shatter 3 points (why?)
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 19 /31
Combine the bounds − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 20 /31
Simple boundary cases − →
(why?)
(why?)
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 21 /31
Getting B(3, 2) − →
ց ↓
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 22 /31
Filling the table − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 23 /31
B(N, k) ≤
k−1
N i
→
k−1
Proof: (Induction on N.)
1
k−1
N
i
Lemma. N
k
N
k−1
N+1
k
B(N + 1, k) ≤ B(N, k) + B(N, k − 1) ≤
k−1
N
i
k−2
N
i
k−1
N
i
k−1
N
i−1
1 +
k−1
N
i
N
i−1
1 +
k−1
N+1
i
=
k−1
N+1
i
A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 24 /31
mH(N) ≤ B(N, k) − →
Consider any k points. They cannot be shattered (otherwise k woud not be a break point). B(N, k) is largest such list. mH(N) ≤ B(N, k)
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 25 /31
Once broken, forever polynomial − →
Once bitten, twice shy . . . Once Broken, Forever Polynomial
k−1
k−1
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 26 /31
There’s good, bad, no ugly − →
N log mH(N) the good H the bad H the ugly H N mH(N) 1 2 3 4 5 · · · 2-D perceptron 2 4 8 14 · · · · · · ≤ N3 + 1 1-D pos. ray 2 3 4 5 · · · · · · ≤ N1 + 1 2-D pos. rectangles 2 4 8 16 < 25 · · · ≤ N4 + 1
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 27 /31
We have a bound on mH; next: |H| ← mH? − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 28 /31
Ghost ‘test’ set D′ represents Eout − →
Age Income Age Income
Age Income
in Eout Probability distribution
in
Ein E′
in
E′
in is like a test error on N new points.
Ein deviates from Eout implies Ein deviates from E′
in.
Ein and E′
in have the same distribution.
P[(E′
in(g), Ein(g)) “deviate”] ≥ 1 2 P [(Eout(g), Ein(g)) “deviate”]
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 29 /31
D ∪ D′ = ⇒ mH(2N) − →
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 30 /31
The VC-Bound − →
N log 4mH(2N) δ
c A M L Creator: Malik Magdon-Ismail
Bounding the Growth Function: 31 /31