Discourse Marker Augmented Network with Reinforcement Learning for - - PowerPoint PPT Presentation

▶

Mar 21, 2024 231 likes •473 views

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference Authors Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He Organization Zhejiang University, China What is Natural

SLIDE 1

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference

Authors

Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He

Organization

Zhejiang University, China

SLIDE 2

What is Natural Language Inference (NLI)?

Premise Hypothesis

Entailment

SLIDE 3

What is Natural Language Inference (NLI)?

?

Premise Hypothesis

Neutral

SLIDE 4

What is Natural Language Inference (NLI)?

Premise Hypothesis

Contradiction

SLIDE 5

Applications

Question Answering
Machine Translation
Semantic Search
Text Summarization

…

SLIDE 6

Discourse Marker

A discourse marker is a word or

a phrase that plays a role in managing the flow and structure of discourse.

Examples: so, because, and, but, or…

SLIDE 7

Discourse Marker & NLI?

But Because If Although And So

Entailment Neutral Contradiction

SLIDE 8

Related Works

Datasets

SNLI (Bowman et al., 2015) MultiNLI (Williams et al., 2017)

SOTA Neural Network Models

CAFE (Tay et al., 2017) KIM (Chen et al., 2017) DIIN (Gong et al., 2018)

SLIDE 9

Related Works

Transfer Learning for NLI

Skip-thoughts (Vendrov et al., 2016) Cove (McCann et al., 2017)

Discourse Marker Applications

DisSent (Nie et al., 2017)

SLIDE 10

Discourse Marker Prediction (DMP)

It’s rainy outside but we will not take the umbrella It’s rainy outside + But + We will not take the umbrella (S1, S2) Neural Networks M So Because But … … If

SLIDE 11

Discourse Marker Prediction (DMP)

Glove Glove

BiLSTM Sentence Representations Sentence1 Sentence2 Prediction Last hidden state Max pooling over all the hidden states To Be Transferred

SLIDE 12

Discourse Marker Augmented Network (NLI Model)

Glove Glove Char Char POS POS NER NER EM EM

BiLSTM Hypothesis

Encoding Layer

Premise

SLIDE 13

Discourse Marker Augmented Network (NLI Model)

Glove Glove Char Char POS POS NER NER EM EM

BiLSTM Hypothesis Premise BiLSTM Sentence Representations Pre-trained DMP Model:

SLIDE 14

Discourse Marker Augmented Network (NLI Model)

Interaction ------ Similarity Matrix

The i-th word of the premise The j-th word of the hypothesis The sentence representation of the premise The sentence representation of the hypothesis

SLIDE 15

Discourse Marker Augmented Network (NLI Model)

Attention Mechanism Prediction Similarity Matrix

Modeling vector of the premise Modeling vector of the hypothesis The sentence representation of the premise The sentence representation of the hypothesis

SLIDE 16

Training

Cross Entropy Loss

Correct Label: neutral Original Labels: neutral, neutral, entailment, entailment, neutral

SLIDE 17

Training

Previous action policy that predicts the label given P and H.

SLIDE 18

Experiments (Datasets)

Stanford Natural Language Inference (SNLI) (Bowman et al., 2015)

570k human annotated sentence pairs

Multi-Genre Natural Language Inference

(MultiNLI) (Williams et al., 2017) 433k human annotated sentences pairs

BookCorpus (Zhu et al., 2015)

6.5M pairs of sentences for 8 discourse markers

SLIDE 19

Experiments (Results)

Sentence Encoding- Based Models Other Neural Network Models Ensemble Models

SLIDE 20

Experiments (Analysis)

SLIDE 21

Experiments (Analysis)

Premise: “3 young man in hoods standing in the middle of a quiet street facing the camera.” Hypothesis: “Three people sit by a busy street bare-headed.

SLIDE 22

Conclusion

We solve the task of the natural language inference via

transferring knowledge from another supervised task.

We propose a new objective function to make full use of

the labels’ information.

In the future work, we would like to explore some other

transfer learning sources.

SLIDE 23

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference

What is Natural Language Inference (NLI)?

Entailment

What is Natural Language Inference (NLI)?

?

Neutral

What is Natural Language Inference (NLI)?

Contradiction

Applications

…

Discourse Marker

a phrase that plays a role in managing the flow and structure of discourse.

Discourse Marker & NLI?

But Because If Although And So

Entailment Neutral Contradiction

Related Works

SNLI (Bowman et al., 2015) MultiNLI (Williams et al., 2017)

CAFE (Tay et al., 2017) KIM (Chen et al., 2017) DIIN (Gong et al., 2018)

Related Works

Skip-thoughts (Vendrov et al., 2016) Cove (McCann et al., 2017)

DisSent (Nie et al., 2017)

Discourse Marker Prediction (DMP)

It’s rainy outside but we will not take the umbrella It’s rainy outside + But + We will not take the umbrella (S1, S2) Neural Networks M So Because But … … If

Discourse Marker Prediction (DMP)

Discourse Marker Augmented Network (NLI Model)

Encoding Layer

Discourse Marker Augmented Network (NLI Model)

Discourse Marker Augmented Network (NLI Model)

Interaction ------ Similarity Matrix

Discourse Marker Augmented Network (NLI Model)

Attention Mechanism Prediction Similarity Matrix

Training

Training

Experiments (Datasets)

Experiments (Results)

Experiments (Analysis)

Experiments (Analysis)

Conclusion

transferring knowledge from another supervised task.

the labels’ information.

transfer learning sources.

Thank You !