Computational Learning Theory: Shattering and VC Dimensions Machine - PowerPoint PPT Presentation

Computational Learning Theory: Shattering and VC Dimensions Machine Learning 1 Slides based on material from Dan Roth, Avrim Blum, Tom Mitchell and others

This lecture: Computational Learning Theory • The Theory of Generalization • Probably Approximately Correct (PAC) learning • Positive and negative learnability results • Agnostic Learning • Shattering and the VC dimension 2

This lecture: Computational Learning Theory • The Theory of Generalization • Probably Approximately Correct (PAC) learning • Positive and negative learnability results • Agnostic Learning • Shattering and the VC dimension 3

Infinite Hypothesis Space The previous analysis was restricted to finite hypothesis spaces • Some infinite hypothesis spaces are more expressive than others • – E.g., Rectangles, vs. 17- sides convex polygons vs. general convex polygons – Linear threshold function vs. a combination of LTUs Need a measure of the expressiveness of an infinite hypothesis • space other than its size The Vapnik-Chervonenkis dimension (VC dimension) provides such • a measure – “What is the expressive capacity of a set of functions?” Analogous to |H|, there are bounds for sample complexity using • VC(H) 4

Infinite Hypothesis Space The previous analysis was restricted to finite hypothesis spaces • Some infinite hypothesis spaces are more expressive than others • – E.g., Rectangles, vs. 17- sides convex polygons vs. general convex polygons – Linear threshold function vs. a combination of LTUs Need a measure of the expressiveness of an infinite hypothesis • space other than its size The Vapnik-Chervonenkis dimension (VC dimension) provides such • a measure – “What is the expressive capacity of a set of functions?” Analogous to |𝐼| , there are bounds for sample complexity using • 𝑊𝐷(𝐼) 7

Learning Rectangles Assume the target concept is an axis parallel rectangle Y X 8

Learning Rectangles Assume the target concept is an axis parallel rectangle Points outside Y are negative Points outside Points outside Points inside are positive are negative are negative X Points outside are negative 9

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + - X 10

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + - X 11

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + - + + X 12

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + - + + X 13

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + + + - + + + + + + X 14

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + + + + - + + + + + + X 15

Learning Rectangles Assume the target concept is an axis parallel rectangle Y + + + + + - + + + + + + X Will we be able to learn the target rectangle? Can we come close? 16

Let’s think about expressivity of functions Suppose we have two points. Can linear classifiers correctly classify any labeling of these points? Linear functions are expressive enough to shatter 2 points What about fourteen points? 17

Let’s think about expressivity of functions There are four ways to label two points 18

Let’s think about expressivity of functions There are four ways to label two points And it is possible to draw a line that separates positive and negative points in all four cases Linear functions are expressive enough to shatter 2 points What about fourteen points? 19

Let’s think about expressivity of functions There are four ways to label two points And it is possible to draw a line that separates positive and negative points in all four cases We say that linear functions are expressive enough to shatter two points What about fourteen points? 20

Let’s think about expressivity of functions There are four ways to label two points And it is possible to draw a line that separates positive and negative points in all four cases We say that linear functions are expressive enough to shatter two points What about fourteen points? 21

Shattering 22

Shattering 23

Shattering 24

Shattering What about this labeling? 25

Shattering This particular labeling of the points cannot be separated by any line 26

Shattering Linear functions are not expressive enough to shatter fourteen points Because there is at least one labeling that can not be separated by them 30

Shattering Linear functions are not expressive enough to shatter fourteen points Because there is at least one labeling that can not be separated by them Of course, a more complex function could separate them 31

Shattering Definition : A set S of examples is shattered by a set of functions H if for every partition of the examples in S into positive and negative examples there is a function in H that gives exactly these labels to the examples Intuition : A rich set of functions shatters large sets of points 32

Shattering Definition : A set S of examples is shattered by a set of functions H if for every partition of the examples in S into positive and negative examples there is a function in H that gives exactly these labels to the examples Intuition : A rich set of functions shatters large sets of points Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 𝑏 0 Points in this region Points outside the will be labeled as shaded region will be positive labeled as negative 33

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 34

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 0 If we have a set S with only this one point 35

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 + 𝑏 0 If we have a If the point is labeled +, we set S with only can find an 𝑏 that is to the this one point right of that point This hypothesis correctly labels the point as positive 36

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 − 𝑏 0 If we have a If the point is labeled − , we set S with only can find an 𝑏 that is to the this one point right of that point This hypothesis correctly labels the point as negative 37

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 − 𝑏 0 If we have a If the point is labeled − , we set S with only can find an 𝑏 that is to the this one point right of that point This hypothesis correctly labels the point as negative Any set of one point can be shattered by the hypothesis class of left bounded intervals 38

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 Let us consider a set with two points 0 If we have a set S with these two points 39

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 Let us consider a set with two points 0 If we have a set S with these two points We can label the points such that no hypothesis in our class can match the labels 40

Left bounded intervals Example 1: Hypothesis class of left bounded intervals on the real axis: [0,a) for some real number a>0 Let us consider a set with two points + − 0 If we have a set S with these two points We can label the points such that no hypothesis in our class can match the labels 41

Computational Learning Theory: Shattering and VC Dimensions Machine - PowerPoint PPT Presentation

Computational Learning Theory: Shattering and VC Dimensions Machine Learning 1 Slides based on material from Dan Roth, Avrim Blum, Tom Mitchell and others This lecture: Computational Learning Theory The Theory of Generalization Probably

Shattering of the Neutron Star Crust Stephanie J. Erickson University of Southampton 3 April

CTA WEIGHTS AND CTA WEIGHTS AND DIMENSIONS DIMENSIONS INITIATIVES INITIATIVES Meeting of the

Module 4: Building Working with Standard Dimensions Dimensions Using the Basic Level

Presentation 5-6 Erin Merryn - Erins Law The very first part of healing is shattering the

E xploding or shattering glass cookware sur- faced as an issue of concern during the past two

Shattering Fundamental Design Barriers of End-to-End Ultrahigh Data-Rate Transceivers: Direct

Computational Learning Theory: Probably Approximately Correct (PAC) Learning Machine Learning 1

Computational Learning Theory: Agnostic Learning Machine Learning 1 Slides based on material

5DL 5DL FIVE DIMENSIONS OF FIVE DIMENSIONS OF LEADERSHIP LEADERSHIP And And Psychosynthesis

Regular Polytopes Laura Mancinska University of Waterloo, Department of C&O January 23,

Random Walks in Two Dimensions Leena Salmela January 31st, 2006 January 31st, 2006 Leena

Beyond three dimensions Non Cartesian representations R.W. Oldford Beyond three dimensions Hey!

Statistical and Computational Statistical and Computational Learning Theory Learning Theory

Computational Learning Theory: Positive and negative learnability results Machine Learning 1

Computational Learning Theory: Occams Razor Machine Learning 1 Slides based on material from

Computational Learning Theory: An Analysis of a Conjunction Learner Machine Learning Slides

ProtoDUNE SP TPC ADC Calibration Linearity and NL Measurements Note: Slides updated since

Wherefore Art Thou R3579X? Anonymized Social Networks, Hidden Patterns, and Structural Stenography

Omer Boyaci, Victoria Beltran and Henning Schulzrinne IRT Pizza Talk Nov 2010 SECE allows

Lecture 15: Inheritance 2/27/2015 Guest Lecturer: Marvin Zhang Some

Verbal Reasoning Faryaneh Poursardar Virginia Tech Slides based on the Problem Solving and

Whats next for e4 Tom Schindl <tom.schindl@bestsolution.at> Twitter: @tomsontom

Introduction to Submodular Functions S. Thomas McCormick Satoru Iwata Sauder School of Business,

Implementation of a compiler from Pluscal to TLA+ with Tom Marc PINHEDE ESIAL-Telecom Nancy 1 /

Computational Learning Theory: Shattering and VC Dimensions Machine - PowerPoint PPT Presentation

Computational Learning Theory: Shattering and VC Dimensions Machine Learning 1 Slides based on material from Dan Roth, Avrim Blum, Tom Mitchell and others This lecture: Computational Learning Theory The Theory of Generalization Probably

Shattering of the Neutron Star Crust Stephanie J. Erickson University of Southampton 3 April

CTA WEIGHTS AND CTA WEIGHTS AND DIMENSIONS DIMENSIONS INITIATIVES INITIATIVES Meeting of the

Module 4: Building Working with Standard Dimensions Dimensions Using the Basic Level

Presentation 5-6 Erin Merryn - Erins Law The very first part of healing is shattering the

E xploding or shattering glass cookware sur- faced as an issue of concern during the past two

Shattering Fundamental Design Barriers of End-to-End Ultrahigh Data-Rate Transceivers: Direct

Computational Learning Theory: Probably Approximately Correct (PAC) Learning Machine Learning 1

Computational Learning Theory: Agnostic Learning Machine Learning 1 Slides based on material

5DL 5DL FIVE DIMENSIONS OF FIVE DIMENSIONS OF LEADERSHIP LEADERSHIP And And Psychosynthesis

Regular Polytopes Laura Mancinska University of Waterloo, Department of C&amp;O January 23,

Random Walks in Two Dimensions Leena Salmela January 31st, 2006 January 31st, 2006 Leena

Beyond three dimensions Non Cartesian representations R.W. Oldford Beyond three dimensions Hey!

Statistical and Computational Statistical and Computational Learning Theory Learning Theory

Computational Learning Theory: Positive and negative learnability results Machine Learning 1

Computational Learning Theory: Occams Razor Machine Learning 1 Slides based on material from

Computational Learning Theory: An Analysis of a Conjunction Learner Machine Learning Slides

ProtoDUNE SP TPC ADC Calibration Linearity and NL Measurements Note: Slides updated since

Wherefore Art Thou R3579X? Anonymized Social Networks, Hidden Patterns, and Structural Stenography

Omer Boyaci, Victoria Beltran and Henning Schulzrinne IRT Pizza Talk Nov 2010 SECE allows

Lecture 15: Inheritance 2/27/2015 Guest Lecturer: Marvin Zhang Some

Verbal Reasoning Faryaneh Poursardar Virginia Tech Slides based on the Problem Solving and

Whats next for e4 Tom Schindl &lt;tom.schindl@bestsolution.at&gt; Twitter: @tomsontom

Introduction to Submodular Functions S. Thomas McCormick Satoru Iwata Sauder School of Business,

Implementation of a compiler from Pluscal to TLA+ with Tom Marc PINHEDE ESIAL-Telecom Nancy 1 /

Regular Polytopes Laura Mancinska University of Waterloo, Department of C&O January 23,

Whats next for e4 Tom Schindl <tom.schindl@bestsolution.at> Twitter: @tomsontom