Top-down induction of decision trees: rigorous guarantees and - PowerPoint PPT Presentation

Top-down induction of decision trees: rigorous guarantees and inherent limitations Guy Blanc, Jane Lange, Li-Yang Tan

This work: Learning decision trees from labeled data x 1 0 1 x f(x) 000010101 0 x 2 x 3 0 1 0 1 011011010 1 100100111 1 0 1 1 x 2 101001000 1 0 1 001010010 0 1 0

“In experimental and applied machine learning work, it is hard to exaggerate the influence of top-down heuristics for building a decision tree from labeled sample data” - [Kearns and Mansour 96]

Decision trees also intensively studied in TCS ● Query model of computation ● Quantum complexity ● Derandomization ● ... ● Learning theory ○ [Ehrenfeucht-Haussler 89, Goldreich-Levin 89, Kushilevitz-Mansour 92, … MR02, OS07, GKK08, HKY18, CM19, …]

Theory vs. practice of learning decision trees: A disconnect Theoretical Practical heuristics algorithms work work “top-down” “bottom-up” ID3, C4.5, CART [EH89, MR02] Our results (Part 1): Our results (Part 2): Rigorous guarantees and Theoretical algorithms inherent limitations with improved guarantees

Top-down induction of decision trees 1) Determine “good” variable x 4 to query as root 0 1 2) Recurse on both subtrees x 4 = 0 x 4 = 1 f f

Top-down induction of decision trees 1) Determine “good” variable x 4 to query as root 0 1 2) Recurse on both subtrees x 4 = 0 x 4 = 1 f f “Good” variable = one that is very “relevant,” “important,” “influential”

Our splitting criterion: Influence Basic and well-studied notion with applications throughout TCS

Our algorithm: TopDown 1) Query the most influential x 4 variable of f at the root 0 1 2) Recurse on both subtrees x 4 = 0 x 4 = 1 f f Our results: Provable guarantees and inherent limitations of TopDown

A guarantee for all functions Theorem: Let f be a size-s decision tree. TopDown builds a tree of size at most that ε -approximates f A matching lower bound Theorem: For any s and ε, there is a size-s decision tree f such that the size of TopDown(f, ε ) is

A guarantee for monotone functions Theorem: Let f be a monotone size-s decision tree. TopDown builds a tree of size at most that ε -approximates f. A near-matching lower bound Theorem: For any s and ε, there is a monotone size-s decision tree f such that the size of TopDown(f, ε ) is . A bound of poly(s) had been conjectured by [FP04].

Algorithmic consequences ● Properly learn decision trees in time ○ Runtime compares favorably with best algorithm with provable guarantee [EH89] ○ Downside: requires query access to the function ● For monotone functions, properly learn decision trees in time using only random examples ○ For monotone functions, influence = splitting criteria used in practical heuristics (ID3, C4.5, and CART) ○ Provable guarantees on these heuristics for a broad and natural class of data sets

Theory vs. practice of learning decision trees: A disconnect Theoretical Practical heuristics algorithms work work “top-down” “bottom-up” ID3, C4.5, CART [EH89, MR02] Our results (Part 1): Our results (Part 2): Rigorous guarantees and Theoretical algorithms inherent limitations with improved guarantees

Improving Ehrenfeucht-Haussler (1989) Theorem [EH89]: There is a quasi-polynomial time algorithm for properly learning decision trees. Theorem (Our work): There is a quasi-polynomial time algorithm for properly learning decision trees with polynomial memory and sample complexity.

Thank you! Practical heuristics Theoretical work “top-down” algorithms work “bottom-up” ID3, C4.5, CART [EH89, MR02] Our results (Part 1): Our results (Part 2): Rigorous guarantees and Theoretical algorithms inherent limitations with improved guarantees

Top-down induction of decision trees: rigorous guarantees and - PowerPoint PPT Presentation

Top-down induction of decision trees: rigorous guarantees and inherent limitations Guy Blanc, Jane Lange, Li-Yang Tan This work: Learning decision trees from labeled data x 1 0 1 x f(x) 000010101 0 x 2 x 3 0 1 0 1 011011010 1

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Induction Stepwise induction (for T PA , T cons ) Complete induction (for T PA , T cons )

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction

Why Algorithmic and Rigorous Polynomial Approximations? Rigorous Polynomial Approximation =

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

IAML: Decision Trees Chris Williams and Victor Lavrenko School of Informatics Semester 1 1 / 17

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

Mathematical Induction Lecture 10-11 Menu Mathematical Induction Strong Induction

MA THEMA TICAL INDUCTION Induction and Deduction Mathematical Induction (its

Beyond Inductive Definitions Induction-Recursion, Induction-Induction, Coalgebras Anton

Chapter 3: Floorplanning Sadiq M. Sait & Habib Youssef King Fahd University of Petroleum

Tutorial April 27, 2020 [1]: import matplotlib.pyplot as plt import sys sys.stderr =

The Role of Machine Learning in Network Automation Alberto Leon-Garcia University of Toronto

Static Program Analysis Foundations of Abstract Interpretation Sebastian Hack, Christian Hammer,

Welcome to PBA Software development CPHBUSINESS Agenda Who are we brief introduction

CORPORATE TAXPREP www.htkacademy.com www.htkacademy.com Lecture Component Agenda Acronyms

Forward Looking Statement Certain information presented, including discussions of future plans and

UNDERSTANDING TORT LAW PRIVATE NUISANCE 03 FIVE CASES Fontainebleau Hotel Corp v Forty-Five

Top-down induction of decision trees: rigorous guarantees and - PowerPoint PPT Presentation

Top-down induction of decision trees: rigorous guarantees and inherent limitations Guy Blanc, Jane Lange, Li-Yang Tan This work: Learning decision trees from labeled data x 1 0 1 x f(x) 000010101 0 x 2 x 3 0 1 0 1 011011010 1

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Induction Stepwise induction (for T PA , T cons ) Complete induction (for T PA , T cons )

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction

Why Algorithmic and Rigorous Polynomial Approximations? Rigorous Polynomial Approximation =

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

IAML: Decision Trees Chris Williams and Victor Lavrenko School of Informatics Semester 1 1 / 17

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

Mathematical Induction Lecture 10-11 Menu Mathematical Induction Strong Induction

MA THEMA TICAL INDUCTION Induction and Deduction Mathematical Induction (its

Beyond Inductive Definitions Induction-Recursion, Induction-Induction, Coalgebras Anton

Chapter 3: Floorplanning Sadiq M. Sait &amp; Habib Youssef King Fahd University of Petroleum

Tutorial April 27, 2020 [1]: import matplotlib.pyplot as plt import sys sys.stderr =

The Role of Machine Learning in Network Automation Alberto Leon-Garcia University of Toronto

Static Program Analysis Foundations of Abstract Interpretation Sebastian Hack, Christian Hammer,

Welcome to PBA Software development CPHBUSINESS Agenda Who are we brief introduction

CORPORATE TAXPREP www.htkacademy.com www.htkacademy.com Lecture Component Agenda Acronyms

Forward Looking Statement Certain information presented, including discussions of future plans and

UNDERSTANDING TORT LAW PRIVATE NUISANCE 03 FIVE CASES Fontainebleau Hotel Corp v Forty-Five

Chapter 3: Floorplanning Sadiq M. Sait & Habib Youssef King Fahd University of Petroleum