Computational Models of Discourse: Generating Referring Expressions - - PowerPoint PPT Presentation

computational models of discourse generating referring
SMART_READER_LITE
LIVE PREVIEW

Computational Models of Discourse: Generating Referring Expressions - - PowerPoint PPT Presentation

Computational Models of Discourse: Generating Referring Expressions Caroline Sporleder Universit at des Saarlandes Sommersemester 2009 03.06.2009 Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse Generating


slide-1
SLIDE 1

Computational Models of Discourse: Generating Referring Expressions

Caroline Sporleder

Universit¨ at des Saarlandes

Sommersemester 2009 03.06.2009

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-2
SLIDE 2

Generating Referring Expressions

Example He claims record The 22-year-old computer science undergraduate from Bath is claiming a world record for the longest distance ridden on a unicycle in 24 hours. A unicycling student covered exactly 282 miles at Aberystwyth University’s athletics track. Sam Wakeling was aiming to beat the existing record of 235.3 miles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-3
SLIDE 3

Generating Referring Expressions

Example He claims record The 22-year-old computer science undergraduate from Bath is claiming a world record for the longest distance ridden on a unicycle in 24 hours. A unicycling student covered exactly 282 miles at Aberystwyth University’s athletics track. Sam Wakeling was aiming to beat the existing record of 235.3 miles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-4
SLIDE 4

Generating Referring Expressions

Example Unicycling student claims record A student is claiming a world record for the longest distance ridden

  • n a unicycle in 24 hours.

Sam Wakeling covered exactly 282 miles at Aberystwyth University’s athletics track. The 22-year-old computer science undergraduate from Bath was aiming to beat the existing record of 235.3 miles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-5
SLIDE 5

Referring Expressions in Linguistic Theory

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-6
SLIDE 6

Terminology

Referring Expression A linguistic expression (typically an NP) that a speaker uses to identify a (discourse) entity to the hearer. Referent The entity to which the speaker refers by a referring expression. Reference The process of identifying an entity by using a referring expression.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-7
SLIDE 7

Reference and Discourse

A given entity can be referred to in various ways:

1 Noam Chomsky has given a talk today. 2 One of the people working at MIT has given a talk today. 3 A person who is working at MIT has given a talk today. 4 A person has given a talk today. Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-8
SLIDE 8

Reference and Discourse

Reference and linguistic form The linguistic form of a referring expression reflects the current state of the discourse (and the speaker’s beliefs about the hearer’s discourse model)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-9
SLIDE 9

Reference and Discourse

Reference and linguistic form The linguistic form of a referring expression reflects the current state of the discourse (and the speaker’s beliefs about the hearer’s discourse model) Typically: new discourse referents are introduced by indefinite NPs (“a cat”)

  • ld discourse referents are referred to by definite NPs and

pronouns (“the cat”/”it”)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-10
SLIDE 10

Prince (1981, 1992): Linguistic Form & Familiarity Scale

Assumed Familiarity: From the point of view of the speaker/writer: Which assumptions about the hearer/reader influence the choice of the referring expression? From the point of view of the hearer/reader: Which conclusion is he/she going to draw from the choice of the referring expression?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-11
SLIDE 11

Dimensions of Familiarity (Prince 1981, 1992)

status of the referent discourse-new discourse-old (assumed by the speaker) hearer-new hearer-old

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-12
SLIDE 12

Dimensions of Familiarity (Prince 1981, 1992)

status of the referent discourse-new discourse-old (assumed by the speaker) hearer-new brand-new hearer-old brand-new: introduction of a new discourse referent representing an unknown entity (a student)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-13
SLIDE 13

Dimensions of Familiarity (Prince 1981, 1992)

status of the referent discourse-new discourse-old (assumed by the speaker) hearer-new brand-new — hearer-old brand-new: introduction of a new discourse referent representing an unknown entity (a student)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-14
SLIDE 14

Dimensions of Familiarity (Prince 1981, 1992)

status of the referent discourse-new discourse-old (assumed by the speaker) hearer-new brand-new — hearer-old unused brand-new: introduction of a new discourse referent representing an unknown entity (a student) unused: introduction of a new discourse referent representing a known entity (Queen Elisabeth)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-15
SLIDE 15

Dimensions of Familiarity (Prince 1981, 1992)

status of the referent discourse-new discourse-old (assumed by the speaker) hearer-new brand-new — hearer-old unused evoked brand-new: introduction of a new discourse referent representing an unknown entity (a student) unused: introduction of a new discourse referent representing a known entity (Queen Elisabeth) evoked: an entity is related to one which has been referred to before (in the discourse) The 22-year old computer science undergraduate from Bath

  • r is present in the situation

(you)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-16
SLIDE 16

Dimensions of Familiarity (Prince 1981, 1992)

status of the referent discourse-new discourse-old (assumed by the speaker) hearer-new brand-new — hearer-old unused evoked brand-new: introduction of a new discourse referent representing an unknown entity (a student) unused: introduction of a new discourse referent representing a known entity (Queen Elisabeth) evoked: an entity is related to one which has been referred to before (in the discourse) The 22-year old computer science undergraduate from Bath

  • r is present in the situation

(you) inferrable: introduction of a new discourse referent whose relation to a known entity is inferrable (like hearer-old but neither discourse-new nor discourse-old) (Peter walked towards the house. The door was open.)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-17
SLIDE 17

Prince’s Familiarity Hierarchy

Brand−new (unanchored) (Textually) Evoked Containing Inferrable (Noncontaining) Inferrable Brand−new Anchored Situationally Evoked

A

BN IC E Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-18
SLIDE 18

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Situationally Evoked E

Yesterday I got on a bus.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-19
SLIDE 19

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Situationally Evoked E

Somebody who works with Peter says he knows your sister.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-20
SLIDE 20

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Situationally Evoked E

Noam Chomsky went to Penn.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-21
SLIDE 21

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Situationally Evoked E

Somebody who works with Peter says he knows your sister.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-22
SLIDE 22

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Evoked E Situationally

Excuse me do you have the time?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-23
SLIDE 23

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Situationally Evoked E

Yesterday I got on a bus. The driver was drunk.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-24
SLIDE 24

Prince’s Familiarity Hierarchy

Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

Brand−new (unanchored) Brand−new Anchored

A

BN (Noncontaining) Inferrable Containing Inferrable IC (Textually) Evoked Situationally Evoked E

The pages of the book which I just bought, were falling out.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-25
SLIDE 25

Prince’s Familiarity Hierarchy

Brand−new (unanchored) (Textually) Evoked Containing Inferrable (Noncontaining) Inferrable Brand−new Anchored Situationally Evoked

A

BN IC E Evoked Unused Brand−new New Inferrable Assumed Familiarity U BN I E

S

E E S

  • > U > I > I C > BNA > BN

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-26
SLIDE 26

Prince’s Familiarityscale

E E S

  • > U > I > I C > BNA > BN

1 Noam Chomsky has given a talk today. (U) 2 One of the people working at MIT has given a talk today.

(I C)

3 A person who is working at MIT has given a talk today.

(BNA)

4 A person has given a talk today. (BN) Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-27
SLIDE 27

Generating Referring Expressions: Rule-based Approaches

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-28
SLIDE 28

Pick your Referring Expression

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-29
SLIDE 29

Generating Referring Expressions

Important for . . . generation (concept-to-text, text-to-text) Need to distinguish identification of target entity to hearer (discriminate between target entity and other entities in discourse)

  • ther communicative goals

I met an old friend yesterday. vs. The scoundrel is the one who betrayed us.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-30
SLIDE 30

Concept-to-text Generation

ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Greek Chios 600 BC e3 helmet silver Roman Rome 200 AD e4 helmet bronze Etruscan Pisa 400 BC

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-31
SLIDE 31

Concept-to-text Generation

ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Greek Chios 600 BC e3 helmet silver Roman Rome 200 AD e4 helmet bronze Etruscan Pisa 400 BC The silver coin is Greek. It was found on Chios and Ø dates from around 600 BC. The other coin is Roman. It is gold and Ø dates from around 50 BC. It was found in Pompeii. . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-32
SLIDE 32

Concept-to-text Generation

ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Greek Chios 600 BC e3 helmet silver Roman Rome 200 AD e4 helmet bronze Etruscan Pisa 400 BC The silver coin is Greek. It was found on Chios and Ø dates from around 600 BC. The other coin is Roman. It is gold and Ø dates from around 50 BC. It was found in Pompeii. . . Here are two coins: the silver one is Greek . . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-33
SLIDE 33

Rule-base Generation of Referring Expressions (REs)

(cf. Dale (1989), Reiter & Dale (1992), Dale & Reiter (1995))

EPICURE system generate cookery recipes deep generation three levels of semantic representation:

flexible knowledge base (KB) representing physical objects in a specific state deep semantic structure (DS) of a RE (recoverable by hearer) surface semantic structure (SS) of an RE (to be realised by the generation grammar)

to construct an RE:

1

construct deep semantic structure for a KB entity i.e. what do you want to say about the entity?

2

from the DS construct a surface semantic structure i.e. how dow you want to say it (pronominalisation etc.)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-34
SLIDE 34

Rule-base Generation of Referring Expressions (REs)

(cf. Dale (1989), Reiter & Dale (1992), Dale & Reiter (1995))

EPICURE system generate cookery recipes deep generation three levels of semantic representation:

flexible knowledge base (KB) representing physical objects in a specific state ⇒ in this domain, discourse entities change! deep semantic structure (DS) of a RE (recoverable by hearer) surface semantic structure (SS) of an RE (to be realised by the generation grammar)

to construct an RE:

1

construct deep semantic structure for a KB entity i.e. what do you want to say about the entity?

2

from the DS construct a surface semantic structure i.e. how dow you want to say it (pronominalisation etc.)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-35
SLIDE 35

Rule-base Generation of Referring Expressions (REs)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-36
SLIDE 36

Rule-base Generation of Referring Expressions (REs)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-37
SLIDE 37

Rule-base Generation of Referring Expressions (REs)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-38
SLIDE 38

Rule-base Generation of Referring Expressions (REs)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-39
SLIDE 39

Generation of Pronouns and Elided NPs

Pronominalisation: basically Centering-based approach (Cbs can be pronominalised) Elided (empty) NPs Cbs can be elided if they fill an optional grammatical role (e.g. indirect object) Fry the onionsi. Add the garlic Øi.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-40
SLIDE 40

Generation of Definite NPs

take into account Gricean Maxims:

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-41
SLIDE 41

Generation of Definite NPs

take into account Gricean Maxims: Quality Say the truth. Quantity Be as informative as possible. Relevance Be relevant. Manner Be brief. Avoid ambiguity. Be orderly.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-42
SLIDE 42

Generation of Definite NPs

take into account Gricean Maxims: Quality Say the truth. Quantity Be as informative as possible. Relevance Be relevant. Manner Be brief. Avoid ambiguity. Be orderly.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-43
SLIDE 43

Generation of Definite NPs

take into account Gricean Maxims: Quality Say the truth. Quantity Be as informative as possible. Relevance Be relevant. Manner Be brief. Avoid ambiguity. Be orderly. ⇒ Choose the shortest RE that discriminates the target entity from all other entities in the discourse model (the distractors).

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-44
SLIDE 44

Generation of Definite NPs

Strategy entities in the KB are described by attribute-value pairs (AVPs) ⇒ realise the smallest set of values that singles out the entity

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-45
SLIDE 45

Generation of Definite NPs

Strategy entities in the KB are described by attribute-value pairs (AVPs) ⇒ realise the smallest set of values that singles out the entity ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-46
SLIDE 46

Generation of Definite NPs

Strategy entities in the KB are described by attribute-value pairs (AVPs) ⇒ realise the smallest set of values that singles out the entity ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-47
SLIDE 47

Generation of Definite NPs

Strategy entities in the KB are described by attribute-value pairs (AVPs) ⇒ realise the smallest set of values that singles out the entity ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC Good: the gold coin (the golden one, the object made of gold) the coin found in Pompeii the coin dating from 50 BC

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-48
SLIDE 48

Generation of Definite NPs

Strategy entities in the KB are described by attribute-value pairs (AVPs) ⇒ realise the smallest set of values that singles out the entity ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC Good: the gold coin (the golden one, the object made of gold) the coin found in Pompeii the coin dating from 50 BC Not so good: the gold coin found in Pompeii (too informative) the Roman coin (not unique)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-49
SLIDE 49

Generation of Definite NPs

Search strategy: compute the discriminatory power of each AVP and choose the one with the highest until the entity is singled out Distractors: U = x1, x2, ..., xn Discriminatory power of an AVP given U: F(< a, v >, U) = n−k

n−1

1 ≤ k ≤ n where k is that number of distractors for which < a, v > is also true.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-50
SLIDE 50

Generation of Definite NPs

Example ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-51
SLIDE 51

Generation of Definite NPs

Example ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC F(< material, gold >, U) = 5−1

5−1 = 1

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-52
SLIDE 52

Generation of Definite NPs

Example ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC F(< material, gold >, U) = 5−1

5−1 = 1

F(< origin, Roman >, U) = 5−2

5−1 = 3 4 = .75

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-53
SLIDE 53

Generation of Definite NPs

Example ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC F(< material, gold >, U) = 5−1

5−1 = 1

F(< origin, Roman >, U) = 5−2

5−1 = 3 4 = .75

How to choose between equally scoring AVP sets?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-54
SLIDE 54

Generation of Definite NPs

Example ID what material

  • rigin

where found date e1 coin gold Roman Pompeii 50 BC e2 coin silver Roman Rome 30 AD e3 coin silver Greek Chios 600 BC e4 helmet silver Roman Rome 200 AD e5 helmet bronze Etruscan Pisa 400 BC F(< material, gold >, U) = 5−1

5−1 = 1

F(< origin, Roman >, U) = 5−2

5−1 = 3 4 = .75

How to choose between equally scoring AVP sets? ⇒ assume attributes are ordered a priory, e.g.: material > origin > where found etc.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-55
SLIDE 55

Open Issues

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-56
SLIDE 56

Open Issues

⇒ the small dog

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-57
SLIDE 57

Open Issues

⇒ the small dog ⇒ the Chihuahua

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-58
SLIDE 58

Open Issues

⇒ the small dog ⇒ the Chihuahua ⇒ the 15 cm tall dog

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-59
SLIDE 59

Open Issues

Humans do not always choose the shortest RE . . . ⇒ the white bird

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-60
SLIDE 60

Bibliography

Robert Dale. Cooking up referring expressions. In Proceedings of ACL, 1989. Robert Dale and Ehud Reiter. Computational interpretations of the gricean maxims in the generation of referring expressions. Cognitive Science, 19(2):233–263, 1995. Ellen Prince. Toward a taxonomy of given-new information. In P. Cole, editor, Radical Pragmatics, pages 223–56. Academic Press, New York, 1981. Ellen Prince. The ZPG letter: subjects, definiteness, and information-status. In S. Thompson and W. Mann, editors, Discourse descriptions: diverse analyses of a fund raising text, pages 295–325. John Benjamins, Philadelphia/Amsterdam, 1992. Ehud Reiter and Robert Dale. A fast algorithm for the generation of referring expressions. In Proceedings of Coling, 1992. Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse