Skip The Question You Dont Know: An Embedding Space Approach - PDF document

Skip The Question You Don’t Know: An Embedding Space Approach Kaiyuan Chen Jinghao Zhao Department of Computer Science Department of Computer Science University of California, Los Angeles University of California, Los Angeles Los Angeles, California 90095 Los Angeles, California 90095 Email: chenkaiyuan@ucla.edu Email: jzhao@cs.ucla.edu Abstract —Deep neural network gives people power to gen- 8 eralize hidden patterns behind training data. However, due to limitations on available data collection methods, what neural networks learn should never be expected to deal with all the scenarios: predicting on samples that rarely appear in training 8 set will have very low accuracy. Thus, we design an end-to-end neural network. It learns an inherent discriminative embedding 8 on the training set to perform out-of-distribution(OOD) detection and classification at the same time: both OOD data points and points that resemble those with different labels can be visually observed in this embedding space. Based on this model, we also devise a training scheme that trains on only inliers. Fig. 1. A traditional MNIST prediction model that pretends it knows the Experiments on various datasets and metrics validate that our answer, but what it should return is simply "I don’t know" method outperforms the state-of-art OOD detector. Index Terms —Out-of-Distribution Detection, Machine Learn- ing, Neural Network, Novelty Detection, Embedding Masana et al. [ 4 ] identify examples that do not resemble the I. I NTRODUCTION distribution seen in the training. Popular anomaly detection methods such as probabilistic Gaussian Mixture Model(GMM) We consider the following scenario: Alice is taking an exam. by Zong et al.[ 1 ] and Wu et al. [ 5 ] model the probability of She encounters a multiple choice question that she has never occurrence, and deep autoencoder, which reconstructs the input met in textbooks and thus she has low expectation on answering by assuming anomalous data cannot be compressed. However, it correctly. To avoid an extra cost for answering it wrong, these approaches treat anomaly detection as a pre-processing she decides to skip it and continues working on those that step before feeding data into other classifiers. To integrate the she practiced a lot. Machine learning models should do the process of classification and anomaly detection, Hendrycks et same. While machine learning algorithms have gained great al.[ 6 ] summarize two popular baselines: one is to treat softmax success in making predictions on data points frequently appear score of output layer as confidence directly and the other is in training set, they are troubled by testing data points that to make use of autoencoder Xia et al.[ 2 ].These methods have rarely occur in training data. Zong et al. [ 1 ] and Xia et al. [ 2 ] gained great success on separating OOD examples. However, by have demonstrated that predictions on outliers have much lower Hendrycks et al.[ 6 ]’s emprical comparison,reconstruction-based accuracy than predictions on inliers. If we mistakenly feed autoencoders can attain much higher accuracy that softmax- a wrong query to any state-of-art machine learning models, based approaches; the reconstruction-based baseline does not they will assign the data point to an undefined label, but the make use of reconstruction and classification. logical answer should be simply "I don’t know". For example, in figure 1, when we feed a hand-drawn cat to a MNIST Zhang et al.[ 7 ] propose using auxiliary unsupervised loss classifier, the machine learning model should not "pretend" can help classification results. Inspired by this problem, we that it knows the answer. This feature becomes a requirement thereby ask: could we build an end-to-end model that jointly when it comes to serious settings such as medical treatment performs out-of-distribution detection and classification ? As like Chen et al.[ 3 ]: when a patient has a new variation of a result, we devise an architecture: besides using traditional disease that is previously unknown to training dataset, machine reconstruction loss, we add another dimension: clustering error learning models should yield it to human experts instead of of embedding space. The architecture builds up a clustered making an undefined decision instead of "guessing" randomly. embedding space and performs different tasks(classification In order to empower machine learning models with the ability and out-of-distribution detection) upon its embedding clusters. to "expect" and to "skip", previous works devise different An intuitive sketch of our model can be found in Figure 2. Out-of-Distribution(OOD) detection schemes. For example, We make adopted a differentiable and autoencoder-friendly

Skip The Question You Dont Know: An Embedding Space Approach - PDF document

Skip The Question You Dont Know: An Embedding Space Approach Kaiyuan Chen Jinghao Zhao Department of Computer Science Department of Computer Science University of California, Los Angeles University of California, Los Angeles Los Angeles,

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

Graph Drawing Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 )

Planarity Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 ) assigns

They Don t Want Them Or You t Want Them Or You They Don Don t Have Them: t Have

OPTIMIZATION OF SKIP-GRAM MODEL Chenxi Wu Final Presentation for STA 790 Word Embedding

The Art The Art when you don't know! Define what you want when you do know! of of Know

Don Juans Troubles Don Juans Troubles Hey, Anna, how are you? Don Juans Troubles Hey,

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

drop hum run If a word Yes! skip has only one syllable Yes! ends with a single consonant

The SDS Skip Subsea Deployment Systems Ltd. Subsea Deployment Systems Ltd. SUBSEA SKIP An

A Distributed Polylogarithmic Time Algorithm for Self-Stabilizing Skip Graphs Christian Decker

Skip Lists + S 3 + S 2 15 + S 1 15 23 + S 0 10

Skip Lists + S 3 S 2 + 15 S 1 + 15 23 S 0 + 10 15

Embedding 3-manifolds via surgery on surfaces Kyle Larson University of Texas at Austin

Pass-the-Hash II: Admins Revenge Skip Duckwall & Chris Campbell Do you know who I am?

ANALYSTS & INVESTORS VISIT ROSEBEL MINE Management Team Rosebel September 12 th , 2017 TSX:

How to grow London How to grow London Londons growth how big and how fast? Does

INDOSTAR CAPITAL FINANCE LIMITED Q4 & FY20 Results Update 17 June 2020 Disclaimer This

INDOSTAR CAPITAL FINANCE LIMITED 18 June 2020 Disclaimer This presentation and the accompanying

Agility Earnings Call Presentation Full Year 2018 Results Disclaimer This presentation is

GAMBLING PREFERENCES, OPTIONS MARKETS, AND VOLATILITY Benjamin M. Blau a , T. Boone Bowles b , and

versus Unlined NMFS Sea Scallop Dredge 2018 RSA NOAA Award NA18NMF4540012 Sally Roman David

Q1 2019 Results Hanover May 9, 2019 Ticker: CON ADR-Ticker: CTTAY Twitter:

Skip The Question You Dont Know: An Embedding Space Approach - PDF document

Skip The Question You Dont Know: An Embedding Space Approach Kaiyuan Chen Jinghao Zhao Department of Computer Science Department of Computer Science University of California, Los Angeles University of California, Los Angeles Los Angeles,

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

Graph Drawing Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 )

Planarity Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 ) assigns

They Don t Want Them Or You t Want Them Or You They Don Don t Have Them: t Have

OPTIMIZATION OF SKIP-GRAM MODEL Chenxi Wu Final Presentation for STA 790 Word Embedding

The Art The Art when you don't know! Define what you want when you do know! of of Know

Don Juans Troubles Don Juans Troubles Hey, Anna, how are you? Don Juans Troubles Hey,

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

drop hum run If a word Yes! skip has only one syllable Yes! ends with a single consonant

The SDS Skip Subsea Deployment Systems Ltd. Subsea Deployment Systems Ltd. SUBSEA SKIP An

A Distributed Polylogarithmic Time Algorithm for Self-Stabilizing Skip Graphs Christian Decker

Skip Lists + S 3 + S 2 15 + S 1 15 23 + S 0 10

Skip Lists + S 3 S 2 + 15 S 1 + 15 23 S 0 + 10 15

Embedding 3-manifolds via surgery on surfaces Kyle Larson University of Texas at Austin

Pass-the-Hash II: Admins Revenge Skip Duckwall &amp; Chris Campbell Do you know who I am?

ANALYSTS &amp; INVESTORS VISIT ROSEBEL MINE Management Team Rosebel September 12 th , 2017 TSX:

How to grow London How to grow London Londons growth how big and how fast? Does

INDOSTAR CAPITAL FINANCE LIMITED Q4 &amp; FY20 Results Update 17 June 2020 Disclaimer This

INDOSTAR CAPITAL FINANCE LIMITED 18 June 2020 Disclaimer This presentation and the accompanying

Agility Earnings Call Presentation Full Year 2018 Results Disclaimer This presentation is

GAMBLING PREFERENCES, OPTIONS MARKETS, AND VOLATILITY Benjamin M. Blau a , T. Boone Bowles b , and

versus Unlined NMFS Sea Scallop Dredge 2018 RSA NOAA Award NA18NMF4540012 Sally Roman David

Q1 2019 Results Hanover May 9, 2019 Ticker: CON ADR-Ticker: CTTAY Twitter:

Pass-the-Hash II: Admins Revenge Skip Duckwall & Chris Campbell Do you know who I am?

ANALYSTS & INVESTORS VISIT ROSEBEL MINE Management Team Rosebel September 12 th , 2017 TSX:

INDOSTAR CAPITAL FINANCE LIMITED Q4 & FY20 Results Update 17 June 2020 Disclaimer This