Empirical Confidence Models for Supervised Machine Learning - PowerPoint PPT Presentation

Empirical Confidence Models for Supervised Machine Learning Margarita Castro 1 , Meinolf Sellmann 2 , Zhaoyuan Yang 2 , Nurali Virani 2 1 University of Toronto, Mechanical and Industrial Engineering 2 General Electric, Global Research Center May, 2020

Motivation 2 ML in high-stake context Main Issues: u We can’t expect the models to be perfect. Self-Driving Cyber Cars security u Summarize statistics (e.g., accuracy) can be misleading to assess a specific prediction. Healthcare diagnosis

Empirical Confidence for Regression 3 We propose: 𝑌 Run time instance A model that can declare its own incompetence. Regression Model + Competence Assessor “We develop techniques that learn when models generated by certain learning techniques on a particular data set can be expected to perform well, and when not .” 𝑍′ Competence Prediction 𝐷 Level Trusted, Cautioned or Not Trusted

Outline of the Talk 4 Part 1: Competence Assessor Part 2: Numerical Evaluation u Overall framework. u Experimental Setting. u Meta-features. u Results. u Meta Training Data. u Conclusions.

5 Empirical Competence Assessor PART 1

Competence Assessor Pipeline 6 Prediction Run-time input y ′ 𝑦 Regressor 𝐷 Meta Feature Competence (𝑌, 𝑍) Builder Assessor Input for 1 2 Competence Competence Level Training Set Assessor (𝑌, 𝑍) Primary Technique Regressor Model (e.g., Random Forest) Training Set

Meta Feature Builder 7 Relate run-time input with training Relating Input and Training set: data and regressor technique u Different distances measures depending Run-time Prediction on the regressor technique input 𝑦 𝑔 𝑦 = 𝑧′ 𝑒: 𝐺×𝐺 → ℝ ! u Neighborhood 𝑂(𝑦) based on the distance 𝑦 y ′ Meta Feature measure 𝑒 ⋅,⋅ . Builder u We consider 𝑙 nearest neighbors 6 meta with 𝑙 = 5. features (𝑌, 𝑍) Training Set

Our Six Meta Features 8 1. Average Distance to the Neighborhood 2. Average Prediction Distance 3. Deviation from regressor’s prediction 𝑒 𝑦, 𝑦 ( 𝑔 𝑦 − 𝑔(𝑦 ( ) 𝑁 " 𝑦 ≔ 3 𝑁 ) 𝑦 ≔ 3 𝑙 𝑙 # ! ,% ! ∈' # # ! ,% ! ∈' # 𝑡(𝑦) 𝑁 * 𝑦 ≔ 𝑔 𝑦 − 3 𝑧 𝑒(𝑦, 𝑦′) # ! ,% ! ∈' # u Measure how far the run-time input from the training data set. u Relationship between predictions at the vicinity of current input.

Our Six Meta Features 9 4. Average training error on 𝑂(𝑦) 6. Target value variability on 𝑂(𝑦) 5. Variance training error on 𝑂(𝑦) 𝑧 ( − 9 𝑧 ) 𝑁 , 𝑦 ≔ 3 𝑙 − 1 𝑡 𝑦 𝑔 𝑦 ( − 𝑧′ # ! ,% ! ∈' # 𝑁 + 𝑦 ≔ 3 𝑒 𝑦 ( , 𝑦 # ! ,% ! ∈' # 𝑔 𝑦 ' − 𝑧 ' − 𝑁 ( 𝑦 ) u Variance of true value in 𝑂(𝑦) . 𝑁 ! 𝑦 ≔ $ 𝑙 − 1 " ! ,$ ! ∈& " u Accuracy of regressor in the immediate vicinity.

Training Data For Competence Assesor 10 Validation Splitter Training Set Base Technique Regressor 𝑍 𝑍′ Meta Feature 𝐷 Builder Training Data for Competence Assessor

Splitter Procedure 11 Standard Cross-Validation Projection Splits Random splitting into ℎ ∈ {3,5,10} buckets. Assess i.i.d. assumption of the technique. u u One validation bucket and the rest as base. Create interpolation and extrapolation u u scenarios. Project over 1 st and 2 nd PC dimension and sort the u Base training data before splitting. Training Set Base Training Set Validation Validation Projected and sorted data

Training Meta Model 12 Classification Label (C) Training Techniques u Based on the true error of the learned u Off-the-shelf SVM and Random Forest model. Classifier. u Sort the absolute residual values in u Our goal is to test the framework in several datasets. ascending order and set the labels as: u 80% smaller à Trusted Note: More sophisticated techniques can be u 80-95% à Cautioned used for specific applications. u Last 5% à Not trusted Note: the labeling can be modified for specific applications

13 Numerical Evaluation PART 2

Experimental Setting 14 Objective: Cross-Validation Tasks Evaluate our Empirical Competence u Standard cross-validation. Model (ECM) over different u Interpolation and Extrapolation: scenarios. u Cluster data and take complete clusters as test set. u PC projections (1 st and 3 rd ). u Six UCI benchmark data-sets. u Regressors: Linear, Random Forest, and SVR. (Off-the-shelf) u Task: standard, interpolation, and extrapolation.

Proof-of-Concept Experiment 15 Setting: Linear Regression Model u 1-dimension data following a linear regression with random noise. ECM u Interpolation task. Predictions u Regressors: u Linear regression. Random Forest u Random forest. Model

Evaluating ECM over Airfoil Dataset 16 Trusted Cautious Not Trusted Bigger MSE for C and NT classes.

Evaluate Effectiveness of Pipeline 17 Baseline: Competence assessor trained over original data (only standard splitting and no meta features) Trusted Warned ECM has lower MSE for Trusted class and higher MSE for Warned class.

Conclusions & Future Works 18 u We present an Empirical Confidence Model (ECM) that assess the reliability of the regression model predictions. u We show the effectives of ECM for i.i.d. and non-i.i.d. train/test splits. u Future works: u Study other reliability measures as meta-features. u Integrate our methodology in an active learning setting.

Thank You! Empirical Confidence Models for Supervised Machine Learning

Empirical Confidence Models for Supervised Machine Learning - PowerPoint PPT Presentation

Empirical Confidence Models for Supervised Machine Learning Margarita Castro 1 , Meinolf Sellmann 2 , Zhaoyuan Yang 2 , Nurali Virani 2 1 University of Toronto, Mechanical and Industrial Engineering 2 General Electric, Global Research Center May,

THE LISTING PRESENTATION A Natural Close! CONFIDENCE CONFIDENCE CONFIDENCE CONFIDENCE Hi

CS70: Jean Walrand: Lecture 29. Confidence? Confidence? Confidence is essential is many

Creating Confidence Intervals using Excel 2013 XL8A-V0R XL8A-V0R XL8A-V0R Create Confidence

Creating Confidence Intervals using Excel 2010 5/08/2015 V0M V0M V0M Create Confidence

STAT 113 Confidence Intervals Colin Reimer Dawson Oberlin College October 3, 2017 1 / 51

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Scikit-Learn: Machine Learning with Introduction to Scikit-Learn: Machine Learning

Machine Learning for NLP Supervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

PCA CS 446 Supervised learning So far, weve done supervised learning: Given (( x i , y i )) ,

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Introduction to Machine Learning Vapnik Chervonenkis Theory Barnabs Pczos Empirical Risk

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Fairness in Machine Learning Fairness in Supervised Learning Make decisions by machine learning:

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Learning Analytics and Academ ic Analytics - I nvestigating How Students Learn and How Effective

Public Ou Outrea each Public Outreach eastcountyAWP.com Widespread Support Measuring Support

Cartan subgroups of definable groups Margarita Otero Universidad Aut onoma de Madrid (joint

On improvements in Exact Real Arithmetic for Initial Value Problems Franz Braue 1 Margarita

Scheduling in Echtzeitbetriebssystemen Prof. Dr. Margarita Esponda Freie Universitt Berlin

The Sparqling system S. Di Bartolomeo, G. Pepe, V. Santarelli, D.F. Savo The 4th International

Collaborative NLP-aided ontology modelling Chiara Ghidini Marco Rospocher

Knowledge Engineering Pitfalls Knowledge Engineering Pitfalls Which one is better to represent