On Adversarial Removal of Hypothesis-only Bias in Natural Language - PowerPoint PPT Presentation

On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference Yonatan Belinkov * , Adam Poliak*, Benjamin Van Durme, Stuart Shieber, Alexander Rush *SEM, Minneapolis, MN June 7, 2019

Co-Authors Yonatan Belinkov Adam Poliak Benjamin Van Durme Alexander Rush Stuart Shieber

Background

Natural Language Inference Premise: The brown cat ran Hypothesis: The animal moved 4

Natural Language Inference Premise: The brown cat ran Hypothesis: The animal moved entailment neutral contradiction 5

*SEM 2018

Hypothesis Only NLI 10

Hypothesis Only NLI Hypothesis: A woman is sleeping 11

Hypothesis Only NLI Premise: Hypothesis: A woman is sleeping 12

Hypothesis Only NLI Premise: Hypothesis: A woman is sleeping entailment neutral contradiction 13

Hypothesis Only NLI Premise: Hypothesis: A woman is sleeping entailment neutral contradiction 14

SNLI Results 15

A woman is sleeping 16

Premises: Hypothesis: A woman is sleeping 17

Premises: A woman sings a song while playing piano Hypothesis: A woman is sleeping 18

Premises: This woman is laughing at her baby shower Hypothesis: A woman is sleeping 19

Premises: A woman with glasses is playing jenga Hypothesis: A woman is sleeping 20

Why is she sleeping? 21

Studies in eliciting norming data are prone to repeated responses across subjects (see McRae et al. (2005) and discussion in §2 of Zhang et. al. (2017)’s Ordinal Common-sense Inference) 22

Problem: Hypothesis-only biases mean that models may not learn the true relationship between premise and hypothesis 23

How to handle such biases? 24

Strategies for dealing with dataset biases ● Construct new datasets (Sharma et al. 2018) ○ $$$ ○ More bias

Strategies for dealing with dataset biases ● Construct new datasets (Sharma et al. 2018) ○ $$$ ○ More bias ● Filter “easy” examples (Gururangan et al. 2018) ○ Hard to scale ○ May still have biases (see SWAG → BERT → HellaSWAG)

Strategies for dealing with dataset biases ● Construct new datasets (Sharma et al. 2018) ○ $$$ ○ More bias ● Filter “easy” examples (Gururangan et al. 2018) ○ Hard to scale ○ May still have biases (see SWAG → BERT → HellaSWAG) ● Forgo datasets with known biases ○ Not all bias is bad ○ Biased datasets may have other useful information

Our solution: Design architectures that facilitate learning less biased representations

Adversarial Learning to the Rescue

NLI Model Components g – classifier f - encoder p h

Baseline NLI Model p h

Method 1 – Adv. Hypothesis-Only Classifier p h

Method 1 – Adv. Hypothesis-Only Classifier Reverse gradients: Penalize hypothesis encoder if classifier p h does well

Method 2 – Adv. Training Examples p h

Method 2 – Adv. Training Examples Perturb training examples ● Randomly swap premises ● Reverse gradients into hypothesis encoder p’ h

Results & Analysis

What happens to model performance?

Degradation in domain

Are biases removed?

Hidden biases - Adversarial Classifier

Hidden biases - Adversarial Data

What happens to specific biases?

Indicator Words Gururangan et al (*NAACL 2018) Poliak et al (*SEM 2018)

Decrease in correlation with contradiction Relative improvement when predicting contradiction

What is this good for?

Are less biased models more transferable?

ACL 2019

Method 1 – Adv. Hypothesis-Only Classifier

Method 2 – Adv. Training Examples

Conclusions ● Adversarial learning may help combat hypothesis-side biases in NLI ● Applicable to other tasks with one-sided biases: reading comprehension, visual question answering, etc.

SiVL 2019

Conclusions ● Adversarial learning may help combat hypothesis-side biases in NLI ● Applicable to other tasks with one-sided biases ● May reduce the amount of bias and improve transferability ● But, the methods should be handled with care ○ Not all bias may be removed ○ The goal matters: some bias may be helpful in certain scenarios ● Acknowledgements

On Adversarial Removal of Hypothesis-only Bias in Natural Language - PowerPoint PPT Presentation

On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference Yonatan Belinkov * , Adam Poliak, Benjamin Van Durme, Stuart Shieber, Alexander Rush SEM, Minneapolis, MN June 7, 2019 Co-Authors Yonatan Belinkov Adam Poliak

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Evaluation and Bias Removal of Evaluation and Bias Removal of Multi- -Look Effect on Look

BIAS BIAS LIGHT LIGHT & & MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Metals and Ammonia Metals and Ammonia Removal from Wastewaters Removal from Wastewaters Removal

Expectancy bias and Bias and forensic evidence Bias and speech research forensic speech

Publication bias in QCA Publication bias in QCA Publication bias in QCA Meaning, diagnosis and

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

ENEE416 9/29/2011 Wet Etch Etch: removal of material from wafer (e.g. removal silicon

IDORA MILL/BEAVER CREEK MINE TAILING REMEDIATION Idora Mill Pre Removal Mill Site Pre

Equity & Excellence: Hidden Bias Implicit Bias Inherent Bias

Bias in, Bias out: Gender Equality and the Fourth Industrial Revolution Debra Howcroft and

Review Selection bias, overfitting Bias v. variance v. residual Bias-variance tradeoff

go to the source The Media Bias Chart The Media Bias Chart A new taxonomy for discussing the

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Making Generative Classifiers Robust to Selection Bias Andrew Smith Charles Elkan November

Towards Robust Natural Language Understanding Group 3 Shengshuo L, Xuhui Z, Zeyu L, Xinyi W,

FAST UNCERTAINTY ESTIMATES AND BAYESIAN MODEL AVERAGING OF DNNS WESLEY MADDOX JOINT WORK WITH

Staying Well and Achieving Goals Piper S. Meyer-Kalos, Ph.D. Susan Gingerich, MSW Delbert

Graphs and Markov chains Graphs as matrices 0 1 2 3 4 If there is an edge (arrow) from node

Lube : Mitigating Bottlenecks in Hao Wang* Wide Area Data Analytics Baochun Li i Qua Wide Area

Moving tables across clusters Scaling a high traffic database Nice to meet you! Developer on

ASPE Training: Professional Development Days 1 Who is ASPE? We provide innovative, custom

The Express Elevator to Success A Program Starter Kit Who We Are Karen Brown Associate

On Adversarial Removal of Hypothesis-only Bias in Natural Language - PowerPoint PPT Presentation

On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference Yonatan Belinkov * , Adam Poliak*, Benjamin Van Durme, Stuart Shieber, Alexander Rush *SEM, Minneapolis, MN June 7, 2019 Co-Authors Yonatan Belinkov Adam Poliak

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Evaluation and Bias Removal of Evaluation and Bias Removal of Multi- -Look Effect on Look

BIAS BIAS LIGHT LIGHT &amp; &amp; MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Metals and Ammonia Metals and Ammonia Removal from Wastewaters Removal from Wastewaters Removal

Expectancy bias and Bias and forensic evidence Bias and speech research forensic speech

Publication bias in QCA Publication bias in QCA Publication bias in QCA Meaning, diagnosis and

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

ENEE416 9/29/2011 Wet Etch Etch: removal of material from wafer (e.g. removal silicon

IDORA MILL/BEAVER CREEK MINE TAILING REMEDIATION Idora Mill Pre Removal Mill Site Pre

Equity &amp; Excellence: Hidden Bias Implicit Bias Inherent Bias

Bias in, Bias out: Gender Equality and the Fourth Industrial Revolution Debra Howcroft and

Review Selection bias, overfitting Bias v. variance v. residual Bias-variance tradeoff

go to the source The Media Bias Chart The Media Bias Chart A new taxonomy for discussing the

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Making Generative Classifiers Robust to Selection Bias Andrew Smith Charles Elkan November

Towards Robust Natural Language Understanding Group 3 Shengshuo L, Xuhui Z, Zeyu L, Xinyi W,

FAST UNCERTAINTY ESTIMATES AND BAYESIAN MODEL AVERAGING OF DNNS WESLEY MADDOX JOINT WORK WITH

Staying Well and Achieving Goals Piper S. Meyer-Kalos, Ph.D. Susan Gingerich, MSW Delbert

Graphs and Markov chains Graphs as matrices 0 1 2 3 4 If there is an edge (arrow) from node

Lube : Mitigating Bottlenecks in Hao Wang* Wide Area Data Analytics Baochun Li i Qua Wide Area

Moving tables across clusters Scaling a high traffic database Nice to meet you! Developer on

ASPE Training: Professional Development Days 1 Who is ASPE? We provide innovative, custom

The Express Elevator to Success A Program Starter Kit Who We Are Karen Brown Associate

On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference Yonatan Belinkov * , Adam Poliak, Benjamin Van Durme, Stuart Shieber, Alexander Rush SEM, Minneapolis, MN June 7, 2019 Co-Authors Yonatan Belinkov Adam Poliak

BIAS BIAS LIGHT LIGHT & & MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Equity & Excellence: Hidden Bias Implicit Bias Inherent Bias