Testing by Implicit Learning Ilias Diakonikolas Columbia University - PowerPoint PPT Presentation

1 Testing by Implicit Learning Ilias Diakonikolas Columbia University March 2009

2 What this talk is about Recent results on testing some natural types of functions: – Decision trees 1 OR 0 1 0 1 0 1 AND AND AND – DNF formulas, more general Boolean formulas – Sparse polynomials over finite fields Exploiting learning techniques to do testing .

3 Based on joint works with: Homin Lee (Columbia) Kevin Matulef (MIT) Rocco Servedio (Columbia) Krzysztof Onak (MIT) Andrew Wan (Columbia) Ronitt Rubinfeld (MIT and TAU)

4 Take-home message learning there are close connections between these topics approximation testing Seems natural… – Goal of learning is to produce an approximation to the function – Goal of testing is to determine whether function “approximately” has some property

5 Overview of talk 0. Basics of learning, testing, approximation 1. A technique: “testing by implicit learning” a little learning theory a little learning theory + + new testing results for many new testing results for many a little approximation a little approximation classes of functions classes of functions + + testing ideas from [FKRSS04] testing ideas from [FKRSS04] [DLMORSW07] [DLMORSW07] 2. A specific class of functions: sparse polynomials testing learning approximation

6 I. Approximation Given a function goal is to obtain a “simpler” function such that • Measure distance between functions under uniform distribution.

7 Approximation – example Let be any -term DNF formula: There is an -approximating DNF with terms where each term contains variables [V88] • Any term with variables is satisfied with probability • Delete all (at most ) such terms from to get

8 Approximation – example Let be any -term DNF formula: There is an -approximating DNF with terms where each term contains variables [V88] • Any term with variables is satisfied with probability • Delete all (at most ) such terms from to get

9 II. Learning a concept class “PAC learning concept class under the uniform distribution” Setup: Learner is given a sample of labeled examples • Target function is unknown to learner • Each example in sample is independent, uniform over Goal: For every , with probability learner should output a hypothesis such that

10 Learning via “Occam’s Razor” A learning algorithm for is proper if it outputs hypotheses from . Generic proper learning algorithm for any (finite) class : • Draw labeled examples • Output any that is consistent with all examples. finding such an may be computationally hard… Why it works: • Suppose true error rate of is • Then Pr[ consistent with random examples] error So Pr[any “bad” is output] <

11 III. Property testing Goal: infer “global” property of function via few “local” inspections Tester makes black-box queries to arbitrary oracle for distance Tester must output • “yes” whp if • “no” whp if is -far from every Usual focus: information-theoretic # queries required

12 Testing via proper learning [GGR98]: properly learnable � � � � testable with same # queries. • Run algorithm to learn to high accuracy; hypothesis obtained is • Draw random examples, use them to estimate to high accuracy distance Why it works: • � estimated error of is small is far from � estimated error • of is large since is far from Great! But... Even for very simple classes of functions over variables (like literals), any learning algorithm must use examples… and in testing, we want query complexity independent of

13 Some known property testing results Class of functions over # of queries parity functions [BLR93] deg- polynomials [AKK+03] literals [PRS02] conjunctions [PRS02] -juntas [FKRSS04] -term monotone DNF [PRS02] Different algorithm tailored for each of these classes. Question: [PRS02] what about non-monotone -term DNF?

14 New property testing results Theorem: [DLMORSW07] The class of over is testable with poly(s/ ) queries. s-term DNF s-leaf decision trees size-s branching programs size-s Boolean formulas (AND/OR/NOT gates) size-s Boolean circuits (AND/OR/NOT gates) s-sparse polynomials over GF(2) s-sparse algebraic circuits over GF(2) s-sparse algebraic computation trees over GF(2) All results follow from “testing by implicit learning” approach.

15 Overview of talk 0. Some basics 1. A technique: “testing by implicit learning” a little learning theory a little learning theory + + new testing results for many new testing results for many a little approximation a little approximation classes of functions classes of functions + + testing ideas from [FKRSS04] testing ideas from [FKRSS04] [DLMORSW07] [DLMORSW07] Running example: testing whether is an -term DNF versus -far from every -term DNF

16 Straight-up testing by learning? Recall [GGR98]: properly learnable � • testable with same # queries • Occam’s Razor: can properly learn any from examples But for = {all -term DNF over }, this is examples… We want a -query algorithm.

17 Approximation to the rescue? We also have approximation: Take : makes so close to that we can pretend • Given any -term DNF , there is a -approximating DNF with terms where each term contains variables. So can try to learn = {all -term -DNF over } Now Occam requires examples…better, but still depends on

18 Getting rid of ? Each approximating DNF depends only on variables. Suppose we knew those variables. Then we’d have = {all -term -DNF over so Occam would need only examples, independent of ! But, can’t explicitly identify even one variable with examples...

19 The fix: implicit learning High-level idea: Learn the “structure” of without explicitly identifying the relevant variables Algorithm tries to find an approximator where is an unknown mapping.

20 Implicit learning How can we learn “structure” of without knowing relevant variables? Need to generate many correctly labeled random examples of : the -term -DNF approximator for each string is bits Then can do Occam (brute-force search for consistent DNF).

21 Implicit learning cont Vars of are the variables that have high influence in f : flipping the bit is likely to change value of f • setting of other variables almost always doesn’t matter bits Given random -bit labeled 1 0 0 1 1 0 1 1 1 0 1 1 0 0 0 1 1 0 1 0 1 1 0 0 1 1 0 1 1 10 1 0 1 1 1 0 1 example , want to construct -bit example 0 1 1 1 1 0 0 0 Do this using techniques of [FKRSS02] “Testing Juntas”

Use independence test of [FKRSS02] Let be a subset of variables. “Independence test” [FKRSS02]: • Fix a random assignment to variables not in 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 0 1 0 1 0 1 1 0 0 1 0 0 1 0 1 1 • Draw two independent settings of variables in , query on these 2 points 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 0 1 0 0 1 0 1 0 0 1 0 0 1 0 1 0 1 1 0 0 1 0 0 1 0 1 1 1 1 0 0 1 1 0 1 1 1 0 0 1 1 0 0 1 1 0 1 0 1 1 1 1 0 0 0 1 0 1 0 1 1 0 0 1 0 0 1 0 1 1 0 Intuition: – if has all low-influence variables, see same value whp – if has a high-influence variable, see different value sometimes

Constructing our examples Given random -bit labeled 1 0 0 1 1 0 1 1 1 0 1 1 0 0 0 1 1 0 1 0 1 1 0 0 1 1 0 1 1 10 1 0 1 1 1 0 1 example , want to construct -bit 0 1 1 1 1 0 0 0 example Follow [FKRSS02]: – Randomly partition variables into blocks; run independence test on each block ? ? ? ? ? ? ? ? ? – Can determine which blocks have high-influence variables – Each block should have at most one high-influence variable (birthday paradox)

Constructing our examples Given random -bit labeled 1 0 0 1 1 0 1 1 1 0 1 1 0 0 0 1 1 0 1 0 1 1 0 0 1 1 0 1 1 10 1 0 1 1 1 0 1 example , want to construct -bit 0 1 1 1 1 0 0 0 example We know which blocks have high-influence variables; need to determine how the high-influence variable in the block is set. Consider a fixed high-influence block String partitions into : bits set to 0 in bits set to 1 in Run independence test on each of to see which one has the high-influence variable. Repeat for all high-influence blocks to get all bits of

Testing by Implicit Learning Ilias Diakonikolas Columbia University - PowerPoint PPT Presentation

1 Testing by Implicit Learning Ilias Diakonikolas Columbia University March 2009 2 What this talk is about Recent results on testing some natural types of functions: Decision trees 1 OR 0 1 0 1 0 1 AND AND AND DNF

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

Implicit learning and working memory in children Nadiia Denhovska ndenhovska@uclan.ac.uk

Implicit Bias: Transcript Inclusive Teaching Series: Implicit Bias Welcome to the third module of

Implicit Extremes and Implicit MaxStable Laws Stilian Stoev ( sstoev@umich.edu ) University of

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Implicit Surfaces CPSC 599.86 / 601.86 Sonny Chan University of Calgary (some board work happened

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

Implicit Bias and Race Mikah K. Thompson, Esq. Director of Affirmative Action & Adjunct

Theory and Practice Implicit Leadership Theory (ILT) Romance of Leadership (RoL) Implicit

CS 4700: Foundations of Artificial Intelligence Prof. Bart Selman selman@cs.cornell.edu

Decision Tree Learning: Part 1 Yingyu Liang Computer Sciences 760 Fall 2017

Learning From Data Lecture 15 Reflecting on Our Path - Epilogue to Part I What We Did The

The simpler the better: Thinning out MIP's by Occam's razor Matteo Fischetti, University of

1 Hypothesis Space Inductive Learning Hypothesis Restrict learned functions a priori to a

For Monday No reading No homework Program 1 Questions? Homework Decision Tree

Dark Matter Radio (DM Radio) Kent Irwin for the DM Radio Collaboration DM Radio Pathfinder

Twelve Key Ideas In Machine Learning Pedro Domingos Dept. of Computer Science & Eng.

Testing by Implicit Learning Ilias Diakonikolas Columbia University - PowerPoint PPT Presentation

1 Testing by Implicit Learning Ilias Diakonikolas Columbia University March 2009 2 What this talk is about Recent results on testing some natural types of functions: Decision trees 1 OR 0 1 0 1 0 1 AND AND AND DNF

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

Implicit learning and working memory in children Nadiia Denhovska ndenhovska@uclan.ac.uk

Implicit Bias: Transcript Inclusive Teaching Series: Implicit Bias Welcome to the third module of

Implicit Extremes and Implicit MaxStable Laws Stilian Stoev ( sstoev@umich.edu ) University of

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Implicit Surfaces CPSC 599.86 / 601.86 Sonny Chan University of Calgary (some board work happened

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

Implicit Bias and Race Mikah K. Thompson, Esq. Director of Affirmative Action &amp; Adjunct

Theory and Practice Implicit Leadership Theory (ILT) Romance of Leadership (RoL) Implicit

CS 4700: Foundations of Artificial Intelligence Prof. Bart Selman selman@cs.cornell.edu

Decision Tree Learning: Part 1 Yingyu Liang Computer Sciences 760 Fall 2017

Learning From Data Lecture 15 Reflecting on Our Path - Epilogue to Part I What We Did The

The simpler the better: Thinning out MIP's by Occam's razor Matteo Fischetti, University of

1 Hypothesis Space Inductive Learning Hypothesis Restrict learned functions a priori to a

For Monday No reading No homework Program 1 Questions? Homework Decision Tree

Dark Matter Radio (DM Radio) Kent Irwin for the DM Radio Collaboration DM Radio Pathfinder

Twelve Key Ideas In Machine Learning Pedro Domingos Dept. of Computer Science &amp; Eng.

Implicit Bias and Race Mikah K. Thompson, Esq. Director of Affirmative Action & Adjunct

Twelve Key Ideas In Machine Learning Pedro Domingos Dept. of Computer Science & Eng.