A A Simple e Unifi fied ed Framework ork for or De Detec - PowerPoint PPT Presentation

A A Simple e Unifi fied ed Framework ork for or De Detec ecti ting Out-of of-Di Distri tributi tion on Sa Samp mples a and A Adversarial At Attacks Honglak Lee 3,2 Jinwoo Shin 1,4 Kimin Lee 1 Kibok Lee 2 1 Korea Advanced Institute of Science and Technology (KAIST) 2 University of Michigan 3 Google Brain 4 AItrics NeurIPS 2018 Motréal

Motivation: Detecting Abnormal Samples • A classifier can provide a meaningful answer only if a test sample is reasonably similar to the training samples • However, it sees many unknown/unseen test samples in practice • E.g., training data = animal 99% classifier dog cat 1

Motivation: Detecting Abnormal Samples • A classifier can provide a meaningful answer only if a test sample is reasonably similar to the training samples • However, it sees many unknown/unseen test samples in practice • E.g., training data = animal 99% 99% classifier classifier dog dog cat cat 1

Motivation: Detecting Abnormal Samples • A classifier can provide a meaningful answer only if a test sample is reasonably similar to the training samples • However, it sees many unknown/unseen test samples in practice • E.g., training data = animal 99% 99% classifier classifier dog dog cat cat • It raises a critical concern when deploying the classifier in real-world systems • E.g., Rarely-seen items can cause the self-driving car accident Deep neural networks Sunflower à Go straight à Crash!! 1

Motivation: Detecting Abnormal Samples • A classifier can provide a meaningful answer only if a test sample is reasonably similar to the training samples • However, it sees many unknown/unseen test samples in practice • E.g., training data = animal 99% 99% classifier classifier dog dog cat cat • It raises a critical concern when deploying the classifier in real-world systems • E.g., Rarely-seen items can cause the self-driving car accident Deep neural networks Sunflower à Go straight à Crash!! • Our goal is to design the classifier to say “I don’t know” 1

Motivation: Detecting Abnormal Samples • Detecting test samples drawn sufficiently far away from the training distribution statistically or adversarially Confidence Training distribution, e.g., animal score or Test sample Deep classifier Adversarial samples Unseen samples 2

Motivation: Detecting Abnormal Samples • Detecting test samples drawn sufficiently far away from the training distribution statistically or adversarially Confidence Training distribution, e.g., animal score or Test sample Deep classifier Adversarial samples Unseen samples How to define a confidence score 2

Motivation: Detecting Abnormal Samples • Detecting test samples drawn sufficiently far away from the training distribution statistically or adversarially Confidence Training distribution, e.g., animal score or Test sample Deep classifier Adversarial samples Unseen samples How to define a confidence score • One can consider a posterior distribution, i.e., 𝑄(𝑧|𝑦) , from a classifier Decision boundary Training samples Unknown samples • However, it is well known that the posterior distribution can be easily overconfident even for such abnormal samples [Balaji ‘17] 2

Motivation: Detecting Abnormal Samples • Detecting test samples drawn sufficiently far away from the training distribution statistically or adversarially Confidence Training distribution, e.g., animal score or Test sample Deep classifier Adversarial samples Unseen samples How to define a confidence score • One can consider a posterior distribution, i.e., 𝑄(𝑧|𝑦) , from a classifier • For the issue, we consider to model the data distribution, i.e., 𝑄(𝑦|𝑧) 2

Mahalanobis Distance-based Confidence Score • Main idea: Post-processing a generative classifier • Given a pre-trained softmax classifier, we post-process a simple generative classifier on hidden feature spaces: Class-wise Gaussian distribution • How to estimate the parameters? • Empirical class mean and covariance matrix • Using training data 3

Mahalanobis Distance-based Confidence Score • Main idea: Post-processing a generative classifier • Given a pre-trained softmax classifier, we post-process a simple generative classifier on hidden feature spaces: Class-wise Gaussian distribution • Why Gaussian? the posterior distribution of the generative classifier (with a tied covariance) is equivalent to the softmax classifier • Empirical observation • ResNet-34 trained on CIFAR-10 • Hidden features follow class-conditional unimodal distributions [T-SNE of penultimate features] 3

Mahalanobis Distance-based Confidence Score • Main idea: Post-processing a generative classifier • Given a pre-trained softmax classifier, we post-process a simple generative classifier on hidden feature spaces: Class-wise Gaussian distribution • Why Gaussian? the posterior distribution of the generative classifier (with a tied covariance) is equivalent to the softmax classifier • Our main contribution: New confidence score • Mahalanobis distance between a test sample and a closest class Gaussian M ( x ) = max log P ( f ( x ) | y = c ) c µ c ) > b = max − ( f ( x ) − b Σ ( f ( x ) − b µ c ) 3 c

Experimental Results Out-of-distribution: TinyImageNet 100 ODIN Mahalanobis (ours) • Application to detecting out-of-distribution samples 90 80 • State-of-the-art baseline: ODIN [Liang’ 18] 70 • Maximum value of a posterior distribution after 60 post-processing 50 • DenseNet-110 [Huang ‘17] trained on the CIFAR- 40 100 dataset 30 • Our method outperforms the ODIN TNR AUROC Detection at TPR 95% accuracy • Application to detecting the adversarial samples Dataset: CIFAR-10 100 LID Mahalanobis (ours) • State-of-the-art baseline: LID [Ma’ 18] AUROC (%) • KNN based confidence score: Local Intrinsic Dimensionality 90 • ResNet-34 [He’ 16] trained on the CIFAR-10 dataset • Our method outperforms the LID 80 4 FGSM BIM DeepFool CW

Conclusion • Deep generative classifiers have been largely dismissed recently • Deep discriminative classifiers (e.g., softmax classifier) typically outperform them for fully- supervised classification settings 5

Conclusion • Deep generative classifiers have been largely dismissed recently • Deep discriminative classifiers (e.g., softmax classifier) typically outperform them for fully- supervised classification settings • We found that the (post-processed) deep generative classifier can outperform the softmax classifier across multiple tasks: • Detecting out-of-distribution samples • Detecting adversarial samples 5

Conclusion • Deep generative classifiers have been largely dismissed recently • Deep discriminative classifiers (e.g., softmax classifier) typically outperform them for fully- supervised classification settings • We found that the (post-processed) deep generative classifier can outperform the softmax classifier across multiple tasks: • Detecting out-of-distribution samples • Detecting adversarial samples • Other contributions in our paper • More calibration techniques: input pre-processing, feature ensemble • More applications: class-incremental learning • More evaluations: robustness of our method • Poster session: Room 210 & 230 AB #30 Thanks for your attention 5

A A Simple e Unifi fied ed Framework ork for or De Detec - PowerPoint PPT Presentation

A A Simple e Unifi fied ed Framework ork for or De Detec ecti ting Out-of of-Di Distri tributi tion on Sa Samp mples a and A Adversarial At Attacks Honglak Lee 3,2 Jinwoo Shin 1,4 Kimin Lee 1 Kibok Lee 2 1 Korea Advanced

Black Oak Mine Unifi fied Scho hool District ct Committed to educational excellence. A

Black Oak Mine Unifi fied Scho hool District ct Committed to educational excellence. A

Unifi Alternative Investment Fund Pre refac face Investor Predicament Unifi AIF Proposition

Unifi Alternative Investment Fund Pre reface Investor Predicament Unifi AIF Proposition

Unifi Alternative Investment Fund Prefac face Investor Predicament Unifi AIF Proposition

Unifi High Yield Fund Preface Investor Predicament Unifi HYF Proposition High Yield Fund

Surveillance Event Detec/on 11:30 11:50 University of Ottawa (VIVA_uOttawa)

Classification of curves Simple, not closed Simple, closed Closed, not simple Not simple, not

School Innova+on Fund 1 Framing our Iden.fied Schools 50 Innova+on Schools 7 Focus Schools

I NFECTIOUS D ISEASES S OCIETY OF N EW Y ORK N EW Y ORK C ITY H EALTH D EPARTMENT J OINT M

Session 2 : European Medicines Netw ork Coordinating role of the Agency w ithin the netw ork-

District 5 W ork Program Total District 5 W ork Program Fiscal Years 2 0 1 5 / 1 6 2 0 1

Privacy Preserving Privacy Preserving Netw ork Flow Netw ork Flow Recording Recording Bilal

Academic Netw ork Construction In China Academic Netw ork Construction In China A Brief

On Evolution of On Evolution of C2 Netw ork Topology C2 Netw ork Topology IC ICCR CRTS 2010

TEDDY TEDDY T ask-force in E urope for D rug D evelopment for the Y oung 16 February 2009,

On the Structure of Large Equidistant Grassmannian Codes Ago-Erik Riet 1 joint work with Daniele

Improvement of the sunflower bound Jozefien Dhaeseleer September 2017 1 / 12 2 Definition

Slice rank of tensors and its applications Wenjie Fang, LIP, ENS de Lyon Work of Terence Tao and

Randomness vs structure DNF compression and sunflower lemma Kewen Wu Peking University Joint

English Language 1. Joy of Learning 2. Unit Coverage 3. Level Focuses 4. Level-Wide Strategies 5.

The QuaSEFE Problem Patrizio Angelini, Henry F orster, Michael Hoffmann, Michael Kaufmann,

SUNFLOWER --- In-beam Gamma-ray spectroscopy at RIBF He Wang RIKEN Nishina Center The 9 th

www.film-english.com by Kieran Donaghy

Sambuz

Useful Links

Newsletter

Mail Us