Learning Sentence Planning Rules with Bayesian Methods David M. - - PowerPoint PPT Presentation

learning sentence planning rules with bayesian methods
SMART_READER_LITE
LIVE PREVIEW

Learning Sentence Planning Rules with Bayesian Methods David M. - - PowerPoint PPT Presentation

Learning Sentence Planning Rules with Bayesian Methods David M. Howcroft Department of Language Science and Technology Saarland Informatics Campus, Saarland University, Germany 24 October 2018 Howcroft (UdS) Learning Sentence Planning 24 Oct


slide-1
SLIDE 1

Learning Sentence Planning Rules with Bayesian Methods

David M. Howcroft

Department of Language Science and Technology Saarland Informatics Campus, Saarland University, Germany

24 October 2018

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 1 / 23

slide-2
SLIDE 2

Natural Language Generation

How do we transform non-linguistic input... name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood ...into a natural language text? Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood

  • restaurant. However, Due Fratelli’s price is average, while Andalucia’s price

is more expensive.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 2 / 23

slide-3
SLIDE 3

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-4
SLIDE 4

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates!

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-5
SLIDE 5

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates! inform(NAME, CUISINE)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-6
SLIDE 6

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates! inform(NAME, CUISINE) → “NAME is a CUISINE restaurant”

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-7
SLIDE 7

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates! inform(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food”

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-8
SLIDE 8

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates! inform(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” But this doesn’t generalize; only good for limited applications!

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-9
SLIDE 9

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates! inform(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” But this doesn’t generalize; only good for limited applications! Solution: modularity for reusability

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-10
SLIDE 10

Templates and traditional approaches to NLG

name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Simplest case: use templates! inform(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” But this doesn’t generalize; only good for limited applications! Solution: modularity for reusability ◮ Factor out language-general elements

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 3 / 23

slide-11
SLIDE 11

‘The’ NLG Pipeline

Text Planning Meaning Representation (MR): DB records, slot-value pairs, etc Content Selection Document Structuring Lexicalization Aggregation Referring Expres- sion Generation Sentence Planning Linearization Morphosyntactic Agreement Punctuation & Capitalization Surface Realization Natural Language Text

More reusable components, but still requires a lot of human attention.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 4 / 23

slide-12
SLIDE 12

End-to-end machine learning approaches to NLG

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-13
SLIDE 13

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-14
SLIDE 14

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

inform(name=DueFratelli, cuisine=Italian) inform(name=Andalucia, cuisine=Spanish,Seafood)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-15
SLIDE 15

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

inform(name=DueFratelli, cuisine=Italian) inform(name=Andalucia, cuisine=Spanish,Seafood)

Represent texts as sequences of words...

Due Fratelli serves Italian food. Andalucia is a Spanish, seafood restaurant.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-16
SLIDE 16

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

inform(name=DueFratelli, cuisine=Italian) inform(name=Andalucia, cuisine=Spanish,Seafood)

Represent texts as sequences of words...

Due Fratelli serves Italian food. Andalucia is a Spanish, seafood restaurant.

Apply your favorite sequence-to-sequence model

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-17
SLIDE 17

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

inform(name=DueFratelli, cuisine=Italian) inform(name=Andalucia, cuisine=Spanish,Seafood)

Represent texts as sequences of words...

Due Fratelli serves Italian food. Andalucia is a Spanish, seafood restaurant.

Apply your favorite sequence-to-sequence model ◮ Bayesian networks for gen. w/active learning (Mairesse et al. 2010; BAGEL)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-18
SLIDE 18

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

inform(name=DueFratelli, cuisine=Italian) inform(name=Andalucia, cuisine=Spanish,Seafood)

Represent texts as sequences of words...

Due Fratelli serves Italian food. Andalucia is a Spanish, seafood restaurant.

Apply your favorite sequence-to-sequence model ◮ Bayesian networks for gen. w/active learning (Mairesse et al. 2010; BAGEL) ◮ Semantically-Conditioned LSTM (Wen et al. 2015) ◮ TGen (Dušek & Jurčíček 2016) ◮ Neural Checklist Model (Kiddon et al. 2016)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-19
SLIDE 19

End-to-end machine learning approaches to NLG

Represent meanings as collections of slot-value pairs...

inform(name=DueFratelli, cuisine=Italian) inform(name=Andalucia, cuisine=Spanish,Seafood)

Represent texts as sequences of words...

Due Fratelli serves Italian food. Andalucia is a Spanish, seafood restaurant.

Apply your favorite sequence-to-sequence model ◮ Bayesian networks for gen. w/active learning (Mairesse et al. 2010; BAGEL) ◮ Semantically-Conditioned LSTM (Wen et al. 2015) ◮ TGen (Dušek & Jurčíček 2016) ◮ Neural Checklist Model (Kiddon et al. 2016) Problem: shallow representations and relative opacity

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 5 / 23

slide-20
SLIDE 20

Can we have the best of both worlds?

From the rule-based approach: ◮ existing resources for NLG; ◮ richer semantic & syntactic structure; ◮ inspectability; and ◮ modularity where it’s helpful. From the ML approach: ◮ reduced development effort Let’s find out! Narrowing our focus: ◮ assume text planning already done ◮ learn sentence planning rules ◮ use existing systems for surface realization

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 6 / 23

slide-21
SLIDE 21

Text Plans with Discourse Structure

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 7 / 23

slide-22
SLIDE 22

Text Plans with Discourse Structure

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 7 / 23

slide-23
SLIDE 23

Text Plans with Discourse Structure

Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood

  • restaurant. However, Due Fratelli’s price is average, while Andalucia’s price

is more expensive.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 7 / 23

slide-24
SLIDE 24

Morphosyntactic Reps for Surface Realization

Text Plan

contrast price SoniaRose 51 price Bienvenue 35

‘Logical Form’ be price the at Sonia Rose dollar 51 while be price the at Bienvenue dollar 35

A r g 1 Arg1 Arg2 A r g 2 A r g 1 A r g 2 A r g Det Mod Arg1 Arg1 Arg1 M

  • d

Arg0 Det Mod Arg1 Arg1 Arg1

◮ OpenCCG for surface realization ◮ morphosyntactic rep: logical forms

◮ think ‘lemmatized dependency trees’

◮ based on CCGbank (Hockenmaier 2006) = ⇒ WSJ coverage

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 8 / 23

slide-25
SLIDE 25

How to represent tree-to-tree mappings?

Synchronous Tree Substitution Grammars

contrast price SoniaRose 51 price Bienvenue 35

but be price the at Sonia Rose dollar 51 be price the at Bienvenue dollar 35

A r g 1 Arg1 Arg2 A r g 2 A r g 1 A r g 2 First Arg0 Det Mod Arg1 Arg1 Mod Next Arg0 Det Mod Arg1 Arg1 Mod

contrast x y but x y

Arg1 Arg2 First Next

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 9 / 23

slide-26
SLIDE 26

How to represent tree-to-tree mappings?

Synchronous Tree Substitution Grammars

contrast price SoniaRose 51 price Bienvenue 35

but be price the at Sonia Rose dollar 51 be price the at Bienvenue dollar 35

A r g 1 Arg1 Arg2 A r g 2 A r g 1 A r g 2 First Arg0 Det Mod Arg1 Arg1 Mod Next Arg0 Det Mod Arg1 Arg1 Mod

price Bienvenue 35 be price the at Bienvenue dollar 35

Arg1 Arg2 Arg0 Det Mod Arg1 Arg1 Mod

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 10 / 23

slide-27
SLIDE 27

How to represent tree-to-tree mappings?

Synchronous Tree Substitution Grammars

contrast price SoniaRose 51 price Bienvenue 35

but be price the at Sonia Rose dollar 51 be price the at Bienvenue dollar 35

A r g 1 Arg1 Arg2 A r g 2 A r g 1 A r g 2 First Arg0 Det Mod Arg1 Arg1 Mod Next Arg0 Det Mod Arg1 Arg1 Mod

price x y be price the at x dollar y

Arg1 Arg2 Arg0 Det Mod Arg1 Arg1 Mod

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 11 / 23

slide-28
SLIDE 28

Synchronous Derivation with TSGs

contrast x y but x y price x y be price the at x dollar y

Arg1 Arg2 First Next Arg1 Arg2 Arg0 Det Mod Arg1 Arg1 Mod

SoniaRose 51 price Bienvenue 35 Sonia Rose 51 be price the at Bienvenue dollar 35

Arg1 Arg2 Arg0 Det Mod Arg1 Arg1 Mod

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 12 / 23

slide-29
SLIDE 29

How to learn tree-to-tree mappings?

Hierarchical Dirichlet Processes ◮ prior biases towards small, reusable trees ◮ observations provide evidence for larger trees

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 13 / 23

slide-30
SLIDE 30

How to learn tree-to-tree mappings?

Hierarchical Dirichlet Processes ◮ prior biases towards small, reusable trees ◮ observations provide evidence for larger trees

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 13 / 23

slide-31
SLIDE 31

How to learn tree-to-tree mappings?

Hierarchical Dirichlet Processes ◮ prior biases towards small, reusable trees ◮ observations provide evidence for larger trees

Chinese Restaurant Process

P(e) = freq(e) #obs+α + α #obs +α Pprior(e), (1) where α is the concentration parameter, freq(e) is the number of times we have observed the elementary tree e, #obs is the total number of

  • bservations, and Pprior is our prior.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 13 / 23

slide-32
SLIDE 32

Statistical Model for sTSGs for SP

AT P n NT P l TT P l TP pair (l,l) TLF NLF ALF n l l LF Al LT P LLF Alignments Figure: Dependencies in our statistical model, omitting parameters for clarity. Each node represents a Dirichlet process over base distributions with α = 1. n here indexes node labels, while l similarly represents tree locations.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 14 / 23

slide-33
SLIDE 33

Some example texts

1 Chanpen Thai has the best overall quality among the selected restaurants.

Its price is 24 dollars and it has good service. This Thai restaurant has good food quality, with decent decor.

2 Since Komodo’s price is 29 dollars and it has good decor, it has the best

  • verall quality among the selected restaurants.

3 Azuri Cafe, which is a Vegetarian restaurant has very good food quality.

Its price is 14 dollars. It has the best overall quality among the selected restaurants.

4 Komodo has very good service. It has food food quality, with very good

food quality, it has very good food quality and its price is 29 dollars.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 15 / 23

slide-34
SLIDE 34

Training data?

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 16 / 23

slide-35
SLIDE 35

Training data?

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 16 / 23

slide-36
SLIDE 36

Training data?

Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood

  • restaurant. However, Due Fratelli’s price is average, while Andalucia’s price

is more expensive.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 16 / 23

slide-37
SLIDE 37

Training data?

Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood

  • restaurant. However, Due Fratelli’s price is average, while Andalucia’s price

is more expensive. ◮ The SPaRKy Restaurant Corpus (Walker et al. 2007)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 16 / 23

slide-38
SLIDE 38

Training data?

Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood

  • restaurant. However, Due Fratelli’s price is average, while Andalucia’s price

is more expensive. ◮ The SPaRKy Restaurant Corpus (Walker et al. 2007)

◮ 1800 texts from an NLG system

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 16 / 23

slide-39
SLIDE 39

Training data?

Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood

  • restaurant. However, Due Fratelli’s price is average, while Andalucia’s price

is more expensive. ◮ The SPaRKy Restaurant Corpus (Walker et al. 2007)

◮ 1800 texts from an NLG system ◮ discourse semantics, but limited variation

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 16 / 23

slide-40
SLIDE 40

Crowdsourced Corpora

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 17 / 23

slide-41
SLIDE 41

Crowdsourced Corpora

BAGEL Corpus (Mairesse et al. 2010)

◮ 404 utterances for 202 dialogue acts ◮ e.g. inform(name=DueFratelli;price=$$;cuisine=Italian)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 17 / 23

slide-42
SLIDE 42

Crowdsourced Corpora

BAGEL Corpus (Mairesse et al. 2010)

◮ 404 utterances for 202 dialogue acts ◮ e.g. inform(name=DueFratelli;price=$$;cuisine=Italian)

SFX-restaurants (Wen et al. 2015)

◮ 5k utterances+DAs

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 17 / 23

slide-43
SLIDE 43

Crowdsourced Corpora

BAGEL Corpus (Mairesse et al. 2010)

◮ 404 utterances for 202 dialogue acts ◮ e.g. inform(name=DueFratelli;price=$$;cuisine=Italian)

SFX-restaurants (Wen et al. 2015)

◮ 5k utterances+DAs

E2E Challenge Dataset (Novikova et al. 2016, 2017)

◮ 50k utterances+DAs ◮ increased variation (image-based elicitation)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 17 / 23

slide-44
SLIDE 44

Building the corpus

Objective: the best of both worlds

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-45
SLIDE 45

Building the corpus

Objective: the best of both worlds

1 discourse-level semantic representation

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-46
SLIDE 46

Building the corpus

Objective: the best of both worlds

1 discourse-level semantic representation 2 with a good amount of variation

◮ esp. with respect to information density

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-47
SLIDE 47

Building the corpus

Objective: the best of both worlds

1 discourse-level semantic representation 2 with a good amount of variation

◮ esp. with respect to information density

Method: collect paraphrases

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-48
SLIDE 48

Building the corpus

Objective: the best of both worlds

1 discourse-level semantic representation 2 with a good amount of variation

◮ esp. with respect to information density

Method: collect paraphrases ◮ already have discourse-level semantics

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-49
SLIDE 49

Building the corpus

Objective: the best of both worlds

1 discourse-level semantic representation 2 with a good amount of variation

◮ esp. with respect to information density

Method: collect paraphrases ◮ already have discourse-level semantics ◮ more variation than in the original SPaRKy corpus

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-50
SLIDE 50

Building the corpus

Objective: the best of both worlds

1 discourse-level semantic representation 2 with a good amount of variation

◮ esp. with respect to information density

Method: collect paraphrases ◮ already have discourse-level semantics ◮ more variation than in the original SPaRKy corpus

2 conditions: default vs. elderly audience

We are adding variety to an existing dialogue system and we need your help! In this task, you will be given a text about one or more restaurants written by our existing system. Your job is to express the same facts, describing the restaurant(s) as you would describe them to your...

default: ...friends or family. elderly: ...85-year-old grandmother.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 18 / 23

slide-51
SLIDE 51

Corpus Statistics

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 19 / 23

slide-52
SLIDE 52

Corpus Statistics

◮ about 5k texts, with discourse-level semantic annotations

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 19 / 23

slide-53
SLIDE 53

Corpus Statistics

◮ about 5k texts, with discourse-level semantic annotations ◮ significantly lower information density in the elderly condition

100 200 5.0 7.5 10.0 12.5

  • Avg. Surprisal (in bits; 30 bins)

Frequency .id

default elderly

Subjects use lower−surprisal sentences addressing grandma

Average surprisal across texts

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 19 / 23

slide-54
SLIDE 54

Examples (1)

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 20 / 23

slide-55
SLIDE 55

Examples (1)

One Italian restaurant is called Caffe Buon Gusto. However, John’s Pizzeria is an Italian pizza restaurant. Choose Caffe Buon Gusto if you desire a traditional Italian restaurant. Otherwise, try out John’s Pizzeria.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 20 / 23

slide-56
SLIDE 56

Examples (1)

One Italian restaurant is called Caffe Buon Gusto. However, John’s Pizzeria is an Italian pizza restaurant. Choose Caffe Buon Gusto if you desire a traditional Italian restaurant. Otherwise, try out John’s Pizzeria.

  • cf. Caffe Buon Gusto is an Italian restaurant. John’s Pizzeria, on the other hand,

is an Italian, Pizza restaurant.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 20 / 23

slide-57
SLIDE 57

Examples (2)

Chez Joesphine is the best choice because of food quality, service and decor. Hands down, Chez Josephine has the best quality food out of all of these restaurants. Employees are always happy to help you and the atmosphere is fantastic.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 21 / 23

slide-58
SLIDE 58

Examples (2)

Chez Joesphine is the best choice because of food quality, service and decor. Hands down, Chez Josephine has the best quality food out of all of these restaurants. Employees are always happy to help you and the atmosphere is fantastic.

  • cf. Chez Josephine has the best overall quality among the selected restaurants. It

has very good service, with very good decor. It has very good food quality.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 21 / 23

slide-59
SLIDE 59

Summary

We have motivated a new approach to ML for NLG: We built a corpus which includes:

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-60
SLIDE 60

Summary

We have motivated a new approach to ML for NLG: We built a corpus which includes:

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-61
SLIDE 61

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules We built a corpus which includes:

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-62
SLIDE 62

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars We built a corpus which includes:

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-63
SLIDE 63

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars ◮ using hierarchical Dirichlet Processes. We built a corpus which includes:

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-64
SLIDE 64

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars ◮ using hierarchical Dirichlet Processes. We built a corpus which includes:

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-65
SLIDE 65

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars ◮ using hierarchical Dirichlet Processes. We built a corpus which includes: ◮ variation with respect to information density, and

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-66
SLIDE 66

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars ◮ using hierarchical Dirichlet Processes. We built a corpus which includes: ◮ variation with respect to information density, and ◮ a hierarchical semantic annotation.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-67
SLIDE 67

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars ◮ using hierarchical Dirichlet Processes. We built a corpus which includes: ◮ variation with respect to information density, and ◮ a hierarchical semantic annotation.

This work was supported by the DFG through SFB 1102 ’Information Density and Linguistic Encoding’.

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-68
SLIDE 68

Summary

We have motivated a new approach to ML for NLG: ◮ with a focus on learning sentence planning rules ◮ represented as synchronous tree-substitution grammars ◮ using hierarchical Dirichlet Processes. We built a corpus which includes: ◮ variation with respect to information density, and ◮ a hierarchical semantic annotation.

This work was supported by the DFG through SFB 1102 ’Information Density and Linguistic Encoding’.

Thank you!

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 22 / 23

slide-69
SLIDE 69

Did we achieve greater lexical variety?

corpus # texts Vocabulary BAGEL 404 74 SFX-restaurant 5192 353 Novikova et al. 1243 238 Original SRC 1760 99 Extended SRC 5356 577

Table: Vocabulary diversity and corpus size

Howcroft (UdS) Learning Sentence Planning 24 Oct 2018 23 / 23