PAC Learning Matt Gormley Lecture 14 Oct. 17, 2018 1 ML Big - PowerPoint PPT Presentation

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University PAC Learning Matt Gormley Lecture 14 Oct. 17, 2018 1

ML Big Picture Learning Paradigms: Problem Formulation: Vision, Robotics, Medicine, What is the structure of our output prediction? What data is available and NLP, Speech, Computer when? What form of prediction? boolean Binary Classification • supervised learning categorical Multiclass Classification • unsupervised learning ordinal Ordinal Classification Application Areas • semi-supervised learning • real Regression reinforcement learning Key challenges? • active learning ordering Ranking • imitation learning multiple discrete Structured Prediction • domain adaptation • multiple continuous (e.g. dynamical systems) online learning Search • density estimation both discrete & (e.g. mixed graphical models) • recommender systems cont. • feature learning • manifold learning • dimensionality reduction Facets of Building ML Big Ideas in ML: • ensemble learning Systems: Which are the ideas driving • distant supervision How to build systems that are development of the field? • hyperparameter optimization robust, efficient, adaptive, • inductive bias effective? Theoretical Foundations: • generalization / overfitting 1. Data prep • bias-variance decomposition What principles guide learning? 2. Model selection • 3. Training (optimization / generative vs. discriminative q probabilistic search) • deep nets, graphical models q information theoretic 4. Hyperparameter tuning on • PAC learning q evolutionary search validation data • distant rewards 5. (Blind) Assessment on test q ML as optimization data 2

LEARNING THEORY 3

Questions For Today 1. Given a classifier with zero training error, what can we say about generalization error? (Sample Complexity, Realizable Case) 2. Given a classifier with low training error, what can we say about generalization error? (Sample Complexity, Agnostic Case) 3. Is there a theoretical justification for regularization to avoid overfitting? (Structural Risk Minimization) 4

PAC/SLT models for Supervised Learning PAC / SLT Model Data Distribution D on X Source Expert / Oracle Learning Algorithm Labeled Examples (x 1 ,c*(x 1 )),…, ( x m ,c*(x m )) c* : X ! Y Alg.outputs h : X ! Y x 1 > 5 + + - + - + +1 x 6 > 2 - - - - -1 +1 6 Slide from Nina Balcan

Two Types of Error True Error (aka. expected risk ) Train Error (aka. empirical risk ) 7

PAC / SLT Model 8

Three Hypotheses of Interest 9

PAC LEARNING 10

Probably Approximately Correct (PAC) Learning Whiteboard: – PAC Criterion – Meaning of “Probably Approximately Correct” – PAC Learnable – Consistent Learner – Sample Complexity 11

Generalization and Overfitting Whiteboard: – Realizable vs. Agnostic Cases – Finite vs. Infinite Hypothesis Spaces 12

PAC Learning 13

SAMPLE COMPLEXITY RESULTS 14

Sample Complexity Results We’ll start with the Four Cases we care about… finite case… Realizable Agnostic 15

Sample Complexity Results Four Cases we care about… Realizable Agnostic 16

Example: Conjunctions In-Class Quiz: Suppose H = class of conjunctions over x in {0,1} M If M = 10, ! = 0.1, δ = 0.01, how many examples suffice? Realizable Agnostic 17

Learning Theory Objectives You should be able to… • Identify the properties of a learning setting and assumptions required to ensure low generalization error • Distinguish true error, train error, test error • Define PAC and explain what it means to be approximately correct and what occurs with high probability • Apply sample complexity bounds to real-world learning examples • Distinguish between a large sample and a finite sample analysis • Theoretically motivate regularization 43

PAC Learning Matt Gormley Lecture 14 Oct. 17, 2018 1 ML Big - PowerPoint PPT Presentation

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University PAC Learning Matt Gormley Lecture 14 Oct. 17, 2018 1 ML Big Picture Learning Paradigms: Problem Formulation:

Guiding Financial Controls and Practices for PACs and PAC Treasurers PAC Treasurers Workshop

NAPSLO PAC Contributions How contributing to the NAPSLO PAC will benefit you, your company and the

WELCOME June 2011 PAC Presentation Opening Remarks Introductions June 2011 PAC

AAOS Orthopaedic PAC The Orthopaedic PAC is the only national political action committee

LArIAT Fermilab PAC Meeting November 11, 2016 Jen Raaf PAC Charge Fermilab PAC Meeting, J.

The PAC Learning Framework Guoqing Zheng January 20, 2015 Guoqing Zheng The PAC Learning

HERITAGE SQUARE CONSIDERATIONS Public Process Project Advisory Committee Meetings: PAC Meeting

Interferometric Sensor (MAGIS-100) PAC Meeting Jason Hogan on behalf of the MAGIS

PAC Learning prof. dr Arno Siebes Algorithmic Data Analysis Group Department of Information and

PAC Team P resentation Provosts Assessment Committee (PAC) Fall Convocation 2018 University

SCOTT RIT-PAC III Objectives Describe the SCOTT RIT-PAC III and its components

PAC 101 BY PAST PAC CHAIR, BRIGITTA SHORE PURPOSE OF A P ARENT A DVISORY C OUNCIL To advocate

Gui Guidi ding ng Fi Financi nancial al Controls and and Pr Practices for PA PACs and

Country Paper Presentation on Implementation of SEA-PAC Action Plan (2017-2019) at 13 th SEA-

PAC ACIFIC IFIC RI RING NG OF FI FIRE RE Photo credit: wikipedia.org PAC ACIFIC IFIC TY

w pac.edu.au e admissions@pac.edu.au ABN 235 392 909 73 Anna Karenina by Leo Tolstoy; Ulysses by

PAC Learning Learning Theory Readings: Matt Gormley Murphy -- Bishop

a HIGH IMPEDANCE SENSORS I Photodiode Preamplifiers I Piezoelectric Sensors N Accelerometers N

Conformational Variability Experience with Ribosomes Exploration of reconstruction strategy

A New DC Muon Beam Source: MuSIC - Status and Prospects - Akira SATO Department of Osaka

Computational Learning Theory: Probably Approximately Correct (PAC) Learning Machine Learning 1

A Search for the LHCb Charmed Pentaquark using Photoproduction of J/ at Threshold in Hall

Data Dependent Priors in PAC-Bayes Bounds John Shawe-Taylor University College London Joint work

Program-level Assessment Committee (PAC) Meeting Minutes April 1, 2019 Attendance: Paul Mixon,