Computer Science What to Expect of Classifiers? Reasoning about - PowerPoint PPT Presentation

Computer Science What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features Pasha Khosravi, Yitao Liang, YooJung Choi and Guy Van den Broeck. Computer Science Department, UCLA Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Motivation Classifier Train } (ex. Logistic Regression) Predict Test samples with Missing Features Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 2

Common Approaches • Common approach is to fill out the missing features, i.e. doing imputation. • They make unrealistic assumptions (mean, median, etc). • More sophisticated methods such as MICE don’t scale to bigger problems (also have assumptions). • We want a more principled way of dealing with this while staying efficient. Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 3

Generative vs Discriminative Models Discriminative Models Generative Models (ex. Logistic Regression) (ex. Naïve Bayes) 𝑸 𝑫 𝒀) 𝑸(𝑫,𝒀) Missing Features Classification Accuracy Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 5

Expected Predication • How can we leverage both discriminative and generative models? • “Expected Prediction” is a principled way to reason about outcome of a classifier, 𝐺(𝑌) , w.r.t. a feature distribution 𝑄(𝑌) . M : Missing features y : Observed Features Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 7

Expected Predication Intuition • Imputation Techniques : Replace the missing-ness uncertainty with one or multiple possible inputs, and evaluate the models. • Expected Prediction : Considers all possible inputs and reason about expected behavior of the classifier. Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 8

Hardness of Taking Expectations • How can we compute the expected prediction? • In general, it is intractable for arbitrary pairs of discriminative and generative models. • Even when F is Logistic Regression and P is Naïve Bayes, the task is NP-Hard. Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 9

Conformant learning Given a discriminative classifier and a dataset, learn a generative model that 1. Conforms to the classifier. 2. Maximizes the likelihood of joint feature distribution P(X) No missing features → Same quality of classification Has missing features → No problem, do inference Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 10

Naïve Conformant Learning (NaCL) We focus on of Conformant Learning involving Logistic Regression and Naïve Bayes • Given a NB model there is unique LR model that conform to it • Given a LR model there is many NB models that conform to it Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 11

Naïve Conformant Learning (NaCL) • We showed that we can write the Naïve Conformant Learning Optimization task as a Geometric Program. • Geometric Programs are a special type of constraint optimization problems that have an exact and efficient algorithm to optimize, and modern GP solvers can handle large problems. • For NaCL , we have O(𝑜𝑙) number of constraints. 𝑜 is the number of features, and 𝑙 is the number of classes. Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 12

Naïve Conformant Learning (NaCL) NaCL Logistic Regression } Weights “Best” Conforming Naïve Bayes GitHub : github.com/UCLA-StarAI/NaCL Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 13

Experiments: Fidelity to Original Classifier Using Cross Entropy to compare - probabilities of the original classifier vs probabilities of NaCL’s learned model Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 14

Experiments: Classification Accuracy Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 15

Other Applications We saw Expected Prediction is very effective with handling missing features. What else can we do? • Explanations • Feature Selection • Fairness Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 16

Local Explanations using Missing-ness Goal : To explain an instance of classification • Support Features : Making them missing → probability goes down • Opposing Features : Making them missing → probability goes up Sufficient Explanations Remove maximum number of supporting features until expected classification is about to change, then show the remaining support features. Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 17

Conclusion • Expected Prediction is an effective tool for several applications such as missing data, generating explanations • We introduced NaCL, an efficient algorithm, to convert a Logistic Regression model to a conforming Naïve Bayes model. • Future work would be looking at more expressive pair of models, and potentially choose models that make the expected prediction tractable . Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 18

Thank You Thank You Thank You What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features GitHub: github.com/UCLA-StarAI/NaCL Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features August 15, 2019 19

Computer Science What to Expect of Classifiers? Reasoning about - PowerPoint PPT Presentation

Computer Science What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features Pasha Khosravi, Yitao Liang, YooJung Choi and Guy Van den Broeck. Computer Science Department, UCLA Computer Science Department What to

Nonlinear Classifiers II 2 Nonlinear Classifiers: Introduction Classifiers Supervised

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

Fusion of Continuous Output Classifiers Classifiers Jacob Hays Amit Pillay James DeFelice

Machine Learning Nave Bayes classifiers Types of classifiers We can divide the large

Occasion-level Classifiers or Event-level Classifiers? -Evidence from Child Language Acquisition

CS440/ECE448 Lecture 22: Including Slides by Svetlana Lazebnik, 10/2016 Linear Classifiers

Visualization for Explainable Classifiers Yao MING THE HONG KONG UNIVERSITY OF SCIENCE AND

Data Dependence in Data Dependence in Combining Classifiers Combining Classifiers Mohamed

Automatically Evading Classifiers A Case Study on PDF Malware Classifiers Weilin Xu

Evaluation of Classifiers Evaluation of Classifiers ROC Curves ROC Curves Reject Curves Reject

Linear Classifiers: Expressiveness Machine Learning 1 Lecture outline Linear models:

On Robust Trimming of Bayesian Network Classifiers YooJung Choi and Guy Van den Broeck UCLA

Linear Classifiers and the Perceptron William Cohen February 4, 2008 1 Linear classifiers

MAXIMUM MARGIN CLASSIFIERS MAXIMUM MARGIN CLASSIFIERS Matthieu R Bloch Tuesday, February 11,

Linear, Binary SVM Classifiers COMPSCI 371D Machine Learning COMPSCI 371D Machine

Off- -The The- -Shelf Classifiers Shelf Classifiers Off A method that can be applied directly

Rostra: A Framework for Detecting Redundant Object-Oriented Unit Tests 1 2 Tao Xie

for CAN-FD/PN Transceivers iCC 2017 1 Content Standards and their problems Consequences

DO IT IN CODE (NOT YAML)! Fedor Korotkov Viktor Gamov @fedor @gamussa Copenhagen Denmark

SGGS: A CDCL-like first-order theorem-proving method 1 Maria Paola Bonacina Dipartimento di

Week 6 Congruence Relation Modulo n Discrete Math Marie Demlov

Quick course in Universal Algebra and Tame Congruence Theory Ross Willard University of

Atkin-Swinnerton-Dyer Congruences on Modular Forms for Noncongruence Subgroups in memory of A. O.

Discrete Mathematics in Computer Science Divisibility Malte Helmert, Gabriele R oger