BREAK POINTS BREAK POINTS Matthieu R Bloch Tuesday, March 3, 2020 - - PowerPoint PPT Presentation

break points break points
SMART_READER_LITE
LIVE PREVIEW

BREAK POINTS BREAK POINTS Matthieu R Bloch Tuesday, March 3, 2020 - - PowerPoint PPT Presentation

BREAK POINTS BREAK POINTS Matthieu R Bloch Tuesday, March 3, 2020 1 LOGISTICS LOGISTICS TAs and Office hours Tuesday: Dr. Bloch (College of Architecture Cafe) - 11am - 11:55am Tuesday: TJ (VL C449 Cubicle D) - 1:30pm - 2:45pm Thursday:


slide-1
SLIDE 1

Matthieu R Bloch Tuesday, March 3, 2020

BREAK POINTS BREAK POINTS

1

slide-2
SLIDE 2

LOGISTICS LOGISTICS

TAs and Office hours Tuesday: Dr. Bloch (College of Architecture Cafe) - 11am - 11:55am Tuesday: TJ (VL C449 Cubicle D) - 1:30pm - 2:45pm Thursday: Hossein (VL C449 Cubicle B): 10:45pm - 12:00pm Friday: Brighton (TSRB 523a) - 12pm-1:15pm Projects Proposals due Friday March 13, 2020 Midterm Thursday March 5th Sample midterm posted (do not share) Open notes but not open electronics (no space on the desks :()

2

slide-3
SLIDE 3

RECAP: DICHOTOMY RECAP: DICHOTOMY

The union bound naturally depends on the size of , but this is not what matters We will consider the number of hypotheses that lead to distinct labeling for a dataset

  • Definition. (Dichotomy)

For a dataset and set of hypotheses , the set of dichotomies generated by

  • n

is By definition and in general

  • Definition. (Growth function)

For a set of hypotheses , the growth function of is The growth function does not depend on the datapoints and

H D ≜ {xi}N

i=1

H H D H({ ) ≜ {{h( ) : h ∈ H} xi}N

i=1

xi }N

i=1

|H({ )| ≤ xi}N

i=1

2N |H({ )| ≪ |H| xi}N

i=1

H H (N) ≜ |H({ )| mH max

{xi}N

i=1

xi}N

i=1

{xi}N

i=1

(N) ≤ mH 2N

3

slide-4
SLIDE 4

RECAP: EXAMPLES RECAP: EXAMPLES

Positive rays: Positive intervals:

H ≜ {h : R → {±1} : x ↦ sgn (x − a)|a ∈ R} = N + 1 mH H ≜ {h : R → {±1} : x ↦ 1{x ∈ [a; b]} − 1{x ∉ [a; b]}|a < b ∈ R} = ( ) + 1 = + N + 1 mH N + 1 2 1 2 N 2 1 2

4

slide-5
SLIDE 5

EXAMPLES EXAMPLES

Convex sets: because we can generate all dichotomies

  • Definition. (Shattering)

If can generate all dichotomies on , we say that shatters

H ≜ {h : → {±1}|{x ∈ : h(x) = +1} is convex} R2 R2 (N) = mH 2N H {xi}N

i=1

H {xi}N

i=1

5

slide-6
SLIDE 6

6

slide-7
SLIDE 7

EXAMPLES EXAMPLES

Linear classifiers: The growth function is a worst case measure, hence 4 points cannot always be shattered and

H ≜ {h : → {±1} : x ↦ sgn ( x + b)|w ∈ , b ∈ R} R2 w⊺ R2 (3) = 8 mH (4) = 14 < mH 24

7

slide-8
SLIDE 8

8

slide-9
SLIDE 9

WISHFUL THINKING WISHFUL THINKING

The growth function can sometimes be smaller than What if we could use instead of in our analysis? We know that with probability at least What if we could show that with probability at least Our ability to generalize then depend on the behavior of the growth function If is exponential in , we can’t say much If is polynomial in , then for large enough

mH |H| mH |H| 1 − δ R( ) ≤ ( ) + h∗ R ˆN h∗ log 1 2N 2|H| δ − − − − − − − − − − √ 1 − δ R( ) ≤ ( ) + ? h∗ R ˆN h∗ log 1 2N 2 (N) mH δ − − − − − − − − − − − − − √ (N) mH N (N) mH N R( ) ≈ ( ) h∗ R ˆN h∗ N

9

slide-10
SLIDE 10

10

slide-11
SLIDE 11

BREAK POINTS BREAK POINTS

  • Definition. (Break point)

If not data set of size can be shattered by , then is a break point for If is a break point then Examples Positive rays: break point Postive Intervals: break point Convex sets: break point Linear classifiers: break point We will spend some time proving the following result Proposition. If there exists any break point for , then is polynomial in If there is no break point for , then

k H k H k (k) < mH 2k k = 2 k = 3 k = ∞ k = 4 H (N) mH N H (N) = mH 2N

11

slide-12
SLIDE 12

BREAK POINTS BREAK POINTS

Definition. Assume has break point . is the maximum number of dichotomies of points such that no subset of size can be shattered by the dichotomies. is a purely combinatorial quantity, does not depend on Example. Assume has break point . How many dichotomies can we generate of set of size 3? By definition, if is a break point for , then

H k B(N, k) N k B(N, k) H H 2 k H (N) ≤ B(N, k) mH

12

slide-13
SLIDE 13

13

slide-14
SLIDE 14

SAUER’S LEMMA SAUER’S LEMMA

Lemma. , for , Lemma (Sauer's lemma) is polynomial in

  • f degree

Conclusion: if has a break point, is polynomial in

B(N, 1) = 1 B(1, k) = 2 k > 1 ∀k > 1 B(N, k) ≤ B(N − 1, k) + B(N − 1, k − 1) B(N, k) ≤ ( ) ∑

i=0 k−1

N i B(N, k) N k − 1 H (N) mH N

14

slide-15
SLIDE 15

15

slide-16
SLIDE 16

16

slide-17
SLIDE 17

17

 