On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference
Yonatan Belinkov*, Adam Poliak*, Benjamin Van Durme, Stuart Shieber, Alexander Rush
*SEM, Minneapolis, MN June 7, 2019
On Adversarial Removal of Hypothesis-only Bias in Natural Language - - PowerPoint PPT Presentation
On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference Yonatan Belinkov * , Adam Poliak*, Benjamin Van Durme, Stuart Shieber, Alexander Rush *SEM, Minneapolis, MN June 7, 2019 Co-Authors Yonatan Belinkov Adam Poliak
Yonatan Belinkov*, Adam Poliak*, Benjamin Van Durme, Stuart Shieber, Alexander Rush
*SEM, Minneapolis, MN June 7, 2019
Alexander Rush Stuart Shieber Adam Poliak Benjamin Van Durme
Co-Authors
Yonatan Belinkov
Background
Natural Language Inference
Premise: The brown cat ran Hypothesis: The animal moved
4
Natural Language Inference
Premise: The brown cat ran Hypothesis: The animal moved entailment neutral contradiction
5
Natural Language Inference
Premise: The brown cat ran Hypothesis: The animal moved
6
entailment neutral contradiction
Natural Language Inference
Premise: The brown cat ran Hypothesis: The animal moved
7
entailment neutral contradiction
Natural Language Inference
Premise: The brown cat ran Hypothesis: The animal moved
8
entailment neutral contradiction
*SEM 2018
Hypothesis Only NLI
10
Hypothesis Only NLI
Hypothesis: A woman is sleeping
11
Hypothesis Only NLI
Hypothesis: A woman is sleeping Premise:
12
Hypothesis Only NLI
Hypothesis: A woman is sleeping Premise:
13
entailment neutral contradiction
Hypothesis Only NLI
Hypothesis: A woman is sleeping Premise:
14
entailment neutral contradiction
SNLI Results
15
A woman is sleeping
16
Hypothesis: A woman is sleeping Premises:
17
Hypothesis: A woman is sleeping Premises: A woman sings a song while playing piano
18
Hypothesis: A woman is sleeping Premises: This woman is laughing at her baby shower
19
Hypothesis: A woman is sleeping Premises: A woman with glasses is playing jenga
20
21
Studies in eliciting norming data are prone to repeated responses across subjects
22
(see McRae et al. (2005) and discussion in §2 of Zhang et. al. (2017)’s Ordinal Common-sense Inference)
Problem:
Hypothesis-only biases mean that models may not learn the true relationship between premise and hypothesis
23
24
Strategies for dealing with dataset biases
○ $$$ ○ More bias
Strategies for dealing with dataset biases
○ $$$ ○ More bias
○ Hard to scale ○ May still have biases (see SWAG → BERT → HellaSWAG)
Strategies for dealing with dataset biases
○ $$$ ○ More bias
○ Hard to scale ○ May still have biases (see SWAG → BERT → HellaSWAG)
○ Not all bias is bad ○ Biased datasets may have other useful information
Design architectures that facilitate learning less biased representations
NLI Model Components
g – classifier f - encoder p h
Baseline NLI Model
p h
Method 1 –
p h
Method 1 –
p h
Method 1 –
Reverse gradients: Penalize hypothesis encoder if classifier does well
p h
Method 2 –
p h
Method 2 –
Perturb training examples
hypothesis encoder
p’ h
Degradation in domain
Degradation in domain
Hidden biases - Adversarial Classifier
Hidden biases - Adversarial Classifier
Hidden biases - Adversarial Classifier
Hidden biases - Adversarial Data
Hidden biases - Adversarial Data
Indicator Words
Poliak et al (*SEM 2018) Gururangan et al (*NAACL 2018)
Decrease in correlation with contradiction
Relative improvement when predicting contradiction
ACL 2019
Method 1 –
Method 2 –
Conclusions
biases in NLI
comprehension, visual question answering, etc.
SiVL 2019
Conclusions
biases in NLI
○ Not all bias may be removed ○ The goal matters: some bias may be helpful in certain scenarios