NeuralREG: an end-to-end approach for Referring Expression - - PowerPoint PPT Presentation

neuralreg an end to end approach for referring expression
SMART_READER_LITE
LIVE PREVIEW

NeuralREG: an end-to-end approach for Referring Expression - - PowerPoint PPT Presentation

NeuralREG: an end-to-end approach for Referring Expression Generation Thiago Castro Ferreira1 Diego Moussallem2 kos Kdr1 Emiel Krahmer1 Sander Wubben1 TiCC - Tilburg University1 AKSW Research Group, University of Leipzig, Germany2


slide-1
SLIDE 1

NeuralREG: an end-to-end approach for Referring Expression Generation

Thiago Castro Ferreira1 Diego Moussallem2 Ákos Kádár1 Emiel Krahmer1 Sander Wubben1 TiCC - Tilburg University1 AKSW Research Group, University of Leipzig, Germany2 Supported by the National Council of Scientific and Technological Development from Brazil (CNPq).

slide-2
SLIDE 2

NATURAL LANGUAGE GENERATION NATURAL LANGUAGE GENERATION

Non-linguistic data natural language

Subject Relation Object Aarhus_Airport cityServed Aarhus,_Denmark Aarhus_Airport elevation 25.0 Aarhus_Airport runwayName 10R/28L

↓NLG

The Aarhus Airport is located in Aarhus, Denmark. It is situated 25.0 meters above sea level. The airport has a runway called 10R/28L.

slide-3
SLIDE 3

REFERRING EXPRESSION GENERATION (REG) REFERRING EXPRESSION GENERATION (REG)

Task responsible for generating references to discourse entities Subject Relation Object Aarhus_Airport cityServed Aarhus,_Denmark Aarhus_Airport elevation 25.0 Aarhus_Airport runwayName 10R/28L

1 2 1 3 1 4

↓REG

The Aarhus Airport is located in Aarhus, Denmark . It is situated 25.0 meters above sea level . The airport has a runway called 10R/28L .

1 2 1 3 1 4

slide-4
SLIDE 4

MOTIVATION MOTIVATION

Novel "end-to-end" NLG models

Generation of delexicalized templates from different meaning representations...

AMR template text

(Konstas et al., 2017) (Castro Ferreira et al., 2017)

→ →

Dialog Act template dialogue text

(Wen et al., 2015) (Dušek and Jurčíček, 2016)

→ →

RDF triples template text

WebNLG Challenge (Gardent et al., 2017)

→ →

...for accounting data sparsity and unseen entities (Konstas et al., 2017)

slide-5
SLIDE 5

DATA DATA

WebNLG corpus

25,298 text describing 9,674 triple sets Manually delexicalized

slide-6
SLIDE 6

TEMPLATE GENERATION TEMPLATE GENERATION

Subject Relation Object SUBJECT-1 cityServed OBJECT-1 SUBJECT-1 elevation OBJECT-2 SUBJECT-1 runwayName OBJECT-3

↓template

SUBJECT-1 is located in OBJECT-1 . SUBJECT-1 is situated OBJECT-2 meters above sea level . SUBJECT-1 has a runway called OBJECT-3 .

slide-7
SLIDE 7

WIKIFICATION WIKIFICATION

Tag Entity SUBJECT-1 Aarhus_Airport OBJECT-1 Aarhus,_Denmark OBJECT-2 25.0 OBJECT-3 10R/28L

↓Wiki

Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L .

Conversion in constant time

slide-8
SLIDE 8

GOAL GOAL

Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L .

↓REG

The Aarhus Airport is located in Aarhus, Denmark . It is situated 25.0 meters above sea level . The airport has a runway called 10R/28L .

Underestimated process so far.

slide-9
SLIDE 9

PROBLEM PROBLEM

Aarhus Airport is located in Aarhus, Denmark . Aarhus Airport is situated 25.0 meters above sea level . Aarhus Airport has a runway called 10R/28L . vs. The Aarhus Airport is located in Aarhus, Denmark . It is situated 25.0 meters above sea level . The airport has a runway called 10R/28L .

REG is crucial for the coherence of the text

slide-10
SLIDE 10

REG MODELS REG MODELS

Extensively studied in pipeline architectures of NLG

GREC Challenges (Belz et al., 2010)

Decisions taken by different subtasks (modular)

Choice of referential form Surface realization

Bottlenecks

Feature engineering Difficulties in developing and maintaining Propagation of errors in cascade along the modules

slide-11
SLIDE 11

NEURALREG NEURALREG

End-to-end REG approach taking context into account

No need for feature engineering Choice of referential and surface realization in one go!

slide-12
SLIDE 12

INPUT INPUT

Target

Target reference to be realized

Pre-context

Lowercased, tokenized and delexicalized piece of text before the target reference

Pos-context

Lowercased, tokenized and delexicalized piece of text afuer the target reference

slide-13
SLIDE 13

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS

Pre-context Target pos-context

The Aarhus Airport

slide-14
SLIDE 14

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS

Pre-context Target pos-context

Aarhus, Denmark

slide-15
SLIDE 15

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS

Pre-context Target pos-context

It

slide-16
SLIDE 16

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS

Pre-context Target pos-context

25.0

slide-17
SLIDE 17

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS

Pre-context Target Pre-context

The airport

slide-18
SLIDE 18

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS

Pre-context Target Pre-context

10R/28L

slide-19
SLIDE 19

NEURALREG NEURALREG

Encoder Attention-Decoder architecture Context encoders

Vector representations for pre- and pos-contexts

Decoder

Combining representations and decoding the referring expression

slide-20
SLIDE 20

NEURALREG NEURALREG

EOS Aarhus_Airport is located in Aarhus,_Denmark . Pre-context

Aarhus_Airport

TARGET is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L . EOS Pos-Context

slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23
slide-24
SLIDE 24
slide-25
SLIDE 25
slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28
slide-29
SLIDE 29
slide-30
SLIDE 30
slide-31
SLIDE 31

DECODER DECODER

= ( , [ , , ]) si Φdec si−1 ci Vyi−1 Vtarget = beam(softmax( + b)) yi Wcsi

evaluation of 3 methods to compute ...

ci

slide-32
SLIDE 32

SEQ2SEQ SEQ2SEQ

Average and concat matrixes and

h(pre) h(pos)

= ĥ

(k) 1 N ∑N i h(k) i

= [ , ] ci ĥ

(pre) ĥ (pos)

slide-33
SLIDE 33

CATT CATT

Concatenative attention

= tanh( + ) e(k)

ij

v(k)T

a

W (k)

a si−1

U (k)

a h(k) j

= α(k)

ij exp( ) e(k)

ij

exp( ) ∑N

n=1

e(k)

in

= c(k)

i

∑N

j=1 α(k) ij h(k) j

= [ , ] ci c(pre)

i

c(pos)

i

slide-34
SLIDE 34

HIERATT HIERATT

Hierarchical Attention (Libovický and Helcl, 2017)

= tanh( + ) e(k)

i

v(k)T

b

W (k)

b si−1

U (k)

b c(k) i

= β(k)

i exp( ) e(k)

i

exp( ) ∑n e(n)

i

= ci ∑k β(k)

i U (k) b c(k) i

slide-35
SLIDE 35

NEURALREG NEURALREG

= ( , [ , , ]) si Φdec si−1 ci Vyi−1 Vtarget

NeuralREG+Seq2Seq

= [avg( ), avg( )] ci h(pre) h(pos)

NeuralREG+CAtt

= [attend( ), attend( )] ci h(pre) h(pos)

NeuralREG+HierAtt

= hierattend(attend( ), attend( )) ci h(pre) h(pos)

slide-36
SLIDE 36

EVALUATION EVALUATION

WebNLG corpus

25,298 text describing 9,674 triple sets Manually delexicalized

78,901 references to 1,483 entities

Train: 63,031 - Dev: 7,127 - Test: 8,743

slide-37
SLIDE 37

BASELINES BASELINES

Only Names Ferreira

slide-38
SLIDE 38

ONLY NAMES ONLY NAMES

(WikiID) : underline whitespace

Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L .

↓REG

Aarhus Airport is located in Aarhus, Denmark . Aarhus Airport is situated 25.0 meters above sea level . Aarhus Airport has a runway called 10R/28L .

slide-39
SLIDE 39

FERREIRA FERREIRA

Choice of referential form

(Castro Ferreira et al., 2016) Aarhus_Airport is located in Aarhus,_Denmark . Aarhus_Airport is situated 25.0 meters above sea level . Aarhus_Airport has a runway called 10R/28L .

↓form

NAME is located in NAME . PRONOUN is situated NAME meters above sea level . DESCRIPTION has a runway called NAME .

S1 O2 S1 O3 S1 O4

slide-40
SLIDE 40

FERREIRA FERREIRA

Surface Realization

NAME is located in NAME . PRONOUN is situated NAME meters above sea level . DESCRIPTION has a runway called NAME .

S1 O2 S1 O3 S1 O5

↓realize

Pick the most frequent referring expression, given entity, form, syntactic position and referential status.

Features extracted from the dependency tree of the wikified text

slide-41
SLIDE 41

AUTOMATIC EVALUATION AUTOMATIC EVALUATION

REG metrics

Accuracy, string edit distance and pronoun accuracy

Text metrics

Text accuracy and BLEU

slide-42
SLIDE 42

REG METRICS REG METRICS

Acc String Pronoun Only Names

  • Ferreira

NeuralREG+Seq2Seq 75% NeuralREG+CAtt 74% 2.25 75% NeuralREG+HierAtt 73%

A A A A A

slide-43
SLIDE 43

TEXT METRICS TEXT METRICS

Acc BLEU Only Names Ferreira NeuralREG+Seq2Seq NeuralREG+CAtt 30% 79.39 NeuralREG+HierAtt

A A

slide-44
SLIDE 44

HUMAN EVALUATION HUMAN EVALUATION

Material

144 trials ( 6 triple set sizes 4 instances 6 text versions)

= × ×

Method

Latin square design 24 trials/list ( 144 trials 6 lists) 60 participants (10 participants/list)

= ÷

Metrics

Fluency, Grammaticality and Clarity 7-Likert scale

slide-45
SLIDE 45

HUMAN EVALUATION HUMAN EVALUATION

Fluency Grammar Clarity Only Names Ferreira NeuralREG+Seq2Seq NeuralREG+CAtt NeuralREG+HierAtt Original 5.41 5.17 5.42

A A A

slide-46
SLIDE 46

CONCLUSION CONCLUSION

First end-to-end approach for REG in text discourse

Improvements over reference accuracy and text fluency Concatenative attention (CAtt) best decoding method

Delexicalized version of WebNLG corpus

Useful resource for NLG in general

Data and code available

https://github.com/ThiagoCF05/NeuralREG

slide-47
SLIDE 47

QUESTIONS? QUESTIONS?

Thank you! :-)

https://github.com/ThiagoCF05/NeuralREG

slide-48
SLIDE 48

SETTINGS SETTINGS

Layers LSTM Training Method Adam Matrices init Xavier Batch Size 40 Epochs 60 Embedding Size 300 Hidden Layer Size 512 Dropout 0.2/0.3