Breaking NLI Systems with Sentences that Require Simple Lexical - PowerPoint PPT Presentation

Breaking NLI Systems with Sentences that Require Simple Lexical Inferences Max Glockner 1 , Vered Shwartz 2 and Yoav Goldberg 2 1 TU Darmstadt 2 Bar-Ilan University July 18, 2018

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street ⇒ NEUTRAL NEUTRAL Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street ⇒ NEUTRAL NEUTRAL 3. A magician performing for an audience in a nightclub Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street ⇒ NEUTRAL NEUTRAL 3. A magician performing for an audience in a nightclub ⇒ CONTRADICTION CONTRADICTION Event co-reference assumption Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Classifier Extract Features Premise Hypothesis Encoder Encoder Hypothesis Premise 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet SOTA exceeds human performance... 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet SOTA exceeds human performance... 1 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet SOTA exceeds human performance... 1 1 [Gururangan et al., 2018, Poliak et al., 2018]: by learning “easy clues” Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Do neural NLI models implicitly learn lexical semantic relations? Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 4 / 13

New Test Set We constructed a new test set to answer this question Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ w ′ is in the SNLI vocabulary and in pre-trained embeddings Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ w ′ is in the SNLI vocabulary and in pre-trained embeddings Crowdsourcing labels (mostly contradictions!) Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ w ′ is in the SNLI vocabulary and in pre-trained embeddings Crowdsourcing labels (mostly contradictions!) Contradiction The man is holding a saxophone → The man is holding an electric guitar Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

Breaking NLI Systems with Sentences that Require Simple Lexical - PowerPoint PPT Presentation

Breaking NLI Systems with Sentences that Require Simple Lexical Inferences Max Glockner 1 , Vered Shwartz 2 and Yoav Goldberg 2 1 TU Darmstadt 2 Bar-Ilan University July 18, 2018 SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural

Analyzing Compositionality-Sensitivity of NLI Models Yixin Nie, Yicheng Wang, Mohit Bansal

Breaking out of the box Understanding rela5onships between learning and assessment Breaking

CSS creative by @aganaplocha breaking the norm with CSS creative by @aganaplocha breaking

St Werburghs Church Dr. Michael ONeill NLI MS 104 Observations from ye Vestry Book of

Natural Language Interfaces to Databases By Kshitij Bhardwaj Abhimanyu Rawal Aman Parnami

APPLICATION GRAMMARS CHRISTINA UNGER (MERCURY.AI) EXAMPLE: NATURAL LANGUAGE INTERFACES NLI

NLI Update 2012 Elhanan Adler, Marina Goldsmith NLIs Mission and Goals NLIs Mission and

searching at the NLI Elhanan Adler elhanana@savion.huji.ac.il US-style cataloging All entry

SUSY breaking and the MSSM Spontaneous SUSY breaking at tree-level ORaifeartaigh, Fayet,

Breaking Gridlock, Breaking Ground: Tackling Anchorage Housing Affordability Presented by Michele

Breaking Down Barrie iers Asya Choudry Genetic Counsellor Community Engagement Manager for

Transitions in Place Leonard Hock, D.O., M.A.C.O.I., CMD Breaking Bad News Goals and Objectives

THE MSSM FROM SS BREAKING MARIANO QUIROS, ICREA/IFAE HEP 2006 THE MSSM FROM SS BREAKING

The Benefit of the Doubt The Swedish Legal System Why Breaking the Law? Judas Priest Breaking

Breaking Up Breaking Up Is Is Hard Hard To Do (But It Might To Do (But It Might Be Be Easier

2018-2019 BUDGET Breaking Down the Budget Breaking Down the Budget This presentation focuses on

The effects of policies on financial inequalities within households: a cross country comparison

The Logic of Sense and Reference Reinhard Muskens Tilburg Center for Logic and Philosophy of

1 Housekeeping All participants are muted. This event will be recorded and available on

Work Programme official statistics after June 2013: content and presentation Response to public

James R Hurford Language Evolution and Computation Research Unit, University of Edinburgh

for the Worldwide Cement Industry Confidential August 2016 Introduction A key sector face to

One People's Public Trust (OPPT) DULY VERIFIED as ISSUED, with due standing, authority and

Data Representation general principles and pointers Wilfried Cools & Lara Stas Key message

Breaking NLI Systems with Sentences that Require Simple Lexical - PowerPoint PPT Presentation

Breaking NLI Systems with Sentences that Require Simple Lexical Inferences Max Glockner 1 , Vered Shwartz 2 and Yoav Goldberg 2 1 TU Darmstadt 2 Bar-Ilan University July 18, 2018 SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural

Analyzing Compositionality-Sensitivity of NLI Models Yixin Nie*, Yicheng Wang*, Mohit Bansal

Breaking out of the box Understanding rela5onships between learning and assessment Breaking

CSS creative by @aganaplocha breaking the norm with CSS creative by @aganaplocha breaking

St Werburghs Church Dr. Michael ONeill NLI MS 104 Observations from ye Vestry Book of

Natural Language Interfaces to Databases By Kshitij Bhardwaj Abhimanyu Rawal Aman Parnami

APPLICATION GRAMMARS CHRISTINA UNGER (MERCURY.AI) EXAMPLE: NATURAL LANGUAGE INTERFACES NLI

NLI Update 2012 Elhanan Adler, Marina Goldsmith NLIs Mission and Goals NLIs Mission and

searching at the NLI Elhanan Adler elhanana@savion.huji.ac.il US-style cataloging All entry

SUSY breaking and the MSSM Spontaneous SUSY breaking at tree-level ORaifeartaigh, Fayet,

Breaking Gridlock, Breaking Ground: Tackling Anchorage Housing Affordability Presented by Michele

Breaking Down Barrie iers Asya Choudry Genetic Counsellor Community Engagement Manager for

Transitions in Place Leonard Hock, D.O., M.A.C.O.I., CMD Breaking Bad News Goals and Objectives

THE MSSM FROM SS BREAKING MARIANO QUIROS, ICREA/IFAE HEP 2006 THE MSSM FROM SS BREAKING

The Benefit of the Doubt The Swedish Legal System Why Breaking the Law? Judas Priest Breaking

Breaking Up Breaking Up Is Is Hard Hard To Do (But It Might To Do (But It Might Be Be Easier

2018-2019 BUDGET Breaking Down the Budget Breaking Down the Budget This presentation focuses on

The effects of policies on financial inequalities within households: a cross country comparison

The Logic of Sense and Reference Reinhard Muskens Tilburg Center for Logic and Philosophy of

1 Housekeeping All participants are muted. This event will be recorded and available on

Work Programme official statistics after June 2013: content and presentation Response to public

James R Hurford Language Evolution and Computation Research Unit, University of Edinburgh

for the Worldwide Cement Industry Confidential August 2016 Introduction A key sector face to

One People's Public Trust (OPPT) DULY VERIFIED as ISSUED, with due standing, authority and

Data Representation general principles and pointers Wilfried Cools &amp; Lara Stas Key message

Analyzing Compositionality-Sensitivity of NLI Models Yixin Nie, Yicheng Wang, Mohit Bansal

Data Representation general principles and pointers Wilfried Cools & Lara Stas Key message