PAC Learning and The VC Dimension Rectangle Game Fix a rectangle - - PowerPoint PPT Presentation

▶

Sep 20, 2022 47 likes •343 views

PAC Learning and The VC Dimension Rectangle Game Fix a rectangle (unknown to you): From An Introduction to Computational Learning Theory by Keanrs and Vazirani Rectangle Game Draw points from some fjxed unknown distribution: Rectangle

SLIDE 1

PAC Learning and The VC Dimension

SLIDE 2

 Fix a rectangle (unknown to you):

Rectangle Game

From An Introduction to Computational Learning Theory by Keanrs and Vazirani

SLIDE 3

 Draw points from some fjxed unknown

distribution:

Rectangle Game

SLIDE 4

 You are told the points and whether they are

in or out:

Rectangle Game

SLIDE 5

 You propose a hypothesis:

Rectangle Game

SLIDE 6

 Your hypothesis is tested on points drawn

from the same distribution:

Rectangle Game

SLIDE 7

 We want an algorithm that:

With high probability will choose a hypothesis that is

approximately correct.

Goal

SLIDE 8

 Choose the minimum area rectangle containing

all the positive points:

Minimum Rectangle Learner:

SLIDE 9

 Derive a PAC bound:  For fjxed:

R : Rectangle
D : Data Distribution
ε : Test Error
δ : Probability of failing
m : Number of Samples

How Good is this?

h R

SLIDE 10

 We want to show that with high probability the

area below measured with respect to D is bounded by ε :

Proof:

h R < ε

SLIDE 11

 We want to show that with high probability the

area below measured with respect to D is bounded by ε :

Proof:

h R < ε/4

SLIDE 12

 Defjne T to be the region that contains

exactly ε/4 of the mass in D sweeping down from the top of R.

 p(T’) > ε/4 = p(T) IFF

T’ contains T

 T’ contains T IFF

none of our m samples are from T

 What is the probability

that all samples miss T

Proof:

h R < ε/4 T’ T

SLIDE 13

 What is the probability that all m samples

miss T:

 What is the probability that

we miss any of the rectangles?

Union Bound

Proof:

h R < ε/4 T’ T

SLIDE 14

Union Bound

A B

SLIDE 15

 What is the probability that all m samples

miss T:

 What is the probability that

we miss any of the rectangles:

Union Bound

Proof:

h R T = ε/4

SLIDE 16

 Probability that any region has weight greater

than ε/4 after m samples is at most:

 If we fjx m such that:  Than with probability 1- δ

we achieve an error rate of at most ε

Proof:

h R T = ε/4

SLIDE 17

 Common Inequality:  We can show:  Obtain a lower bound on the samples:

Extra Inequality

SLIDE 18

 Provides a measure of the complexity of a

“hypothesis space” or the “power” of “learning machine”

 Higher VC dimension implies the ability to

represent more complex functions

 The VC dimension is the maximum number

f points that can be arranged so that f

shatters them.

 What does it mean to shatter?

VC – Dimension

SLIDE 19

 A classifjer f can shatter a set of points if

and only if for all truth assignments to those points f gets zero training error

 Example: f(x,b) = sign(x.x-b)

Defjne: Shattering

SLIDE 20

 What is the VC Dimension of the classifjer:

f(x,b) = sign(x.x-b)

Example Continued:

SLIDE 21

 Conjecture:  Easy Proof (lower Bound):

VC Dimension of 2D Half-Space:

SLIDE 22

 Harder Proof (Upper Bound):

VC Dimension of 2D Half-Space:

SLIDE 23

 VC Dimension Conjecture:

VC-Dim: Axis Aligned Rectangles

SLIDE 24

 VC Dimension Conjecture: 4  Upper bound (more Diffjcult):

VC-Dim: Axis Aligned Rectangles

SLIDE 25

 What is the VC Dimension of:

f(x,{w,b})=sign( w . x + b )
X in R^d

 Proof (lower bound):

Pick {x_1, …, x_n} (point) locations:
Adversary gives assignments {y_1, …, y_n} and

you choose {w_1, …, w_n} and b:

General Half-Spaces in (d – dim)

SLIDE 26

Extra Space:

SLIDE 27

 Proof (upper bound): VC-Dim = d+1

Observe that the last d+1 points can always be

expressed as:

General Half-Spaces

SLIDE 28

 Proof (upper bound):

VC-Dim = d+1

Observe that the last d+1 points

can always be expressed as:

SLIDE 29