NLG: Specific Components Texts NLG Systems Architecture modules - - PowerPoint PPT Presentation

nlg specific components
SMART_READER_LITE
LIVE PREVIEW

NLG: Specific Components Texts NLG Systems Architecture modules - - PowerPoint PPT Presentation

NLG: Specific Components Scott Farrar CLMA, University of Washington far- rar@u.washington.edu NLG: Specific Components Texts NLG Systems Architecture modules Scott Farrar Textplanner Microplanner CLMA, University of Washington


slide-1
SLIDE 1

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

NLG: Specific Components

Scott Farrar CLMA, University of Washington farrar@u.washington.edu March 8, 2010

1/42

slide-2
SLIDE 2

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Today’s lecture

1

Texts

2

NLG Systems

3

Architecture modules Textplanner Microplanner Surface realizer

SimpleNLG realizer

4

Hw7

2/42

slide-3
SLIDE 3

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-4
SLIDE 4

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-5
SLIDE 5

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-6
SLIDE 6

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-7
SLIDE 7

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-8
SLIDE 8

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-9
SLIDE 9

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (genealogy)

George Melvin Phillips (parents William D. Phillips and Matilda A. Jackson) was born 1864 in Athol, Somerset Co., MD. He died 14 April, 1933 in Allen, Wicomico Co., MD. Lillian White (parents George Melvin Phillips and Emma Washington Huffington) was born 27 October, 1908 in Allen, Wicomico Co., MD. She died 23 March, 1983 in Allen, Wicomico Co., MD. What are the units of information expressed this text? What would need to be in a database in order to generate it?

3/42

slide-10
SLIDE 10

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-11
SLIDE 11

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-12
SLIDE 12

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-13
SLIDE 13

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste

  • f local flavor. I am well educated with a great

career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-14
SLIDE 14

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste

  • f local flavor. I am well educated with a great

career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-15
SLIDE 15

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-16
SLIDE 16

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-17
SLIDE 17

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-18
SLIDE 18

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please.

4/42

slide-19
SLIDE 19

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please. There’s more than just facts to be reported. In NLG communicative intention is crucial.

4/42

slide-20
SLIDE 20

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please. There’s more than just facts to be reported. In NLG communicative intention is crucial.

4/42

slide-21
SLIDE 21

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Example text (personal ads)

I am a single girl new to area who would like to meet someone to hang out with and get a taste of local flavor. I am well educated with a great career...full-figured. I’m looking for mutual stimulating conversation with dating potential.... Don’t reply without picture, and be single–no married or attached guys please. There’s more than just facts to be reported. In NLG communicative intention is crucial.

4/42

slide-22
SLIDE 22

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

From knowledge to language

An important first step in NLG concerns planning the information needed to produce natural sounding, coherent text.

5/42

slide-23
SLIDE 23

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

From knowledge to language

An important first step in NLG concerns planning the information needed to produce natural sounding, coherent text. From a non-linguistic knowledge base, the system needs to identify the information of interest and combine it in a way consistent with the way humans package their beliefs, desires, intentions as language.

5/42

slide-24
SLIDE 24

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Today’s lecture

1

Texts

2

NLG Systems

3

Architecture modules Textplanner Microplanner Surface realizer

SimpleNLG realizer

4

Hw7

6/42

slide-25
SLIDE 25

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

NLG Three-step systems

Last time we said that a more fine-tuned approach to the notion of choice was needed for better control over the NLG

  • process. We compared a two- and a three-step approach.

Three-step architectures like WeatherReporter or the KNIGHT System are more flexible, and modular. There’s more control over the output.

7/42

slide-26
SLIDE 26

Main components in a three-step stystem

Module Content task Structure task Text Planner content determination document structuring Microplanner lexicalization; referring expression generation aggregation Surface Realizer linguistic realization structure realization

slide-27
SLIDE 27

NLG three-step architecture

slide-28
SLIDE 28

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Today’s lecture

1

Texts

2

NLG Systems

3

Architecture modules Textplanner Microplanner Surface realizer

SimpleNLG realizer

4

Hw7

10/42

slide-29
SLIDE 29

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Text planner: Purpose

The text planner decides what chunks of information to include (content determination), and how to structure them (text structuring).

11/42

slide-30
SLIDE 30

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Text planner: Purpose

The text planner decides what chunks of information to include (content determination), and how to structure them (text structuring).

Definition

The basic unit of information produced by text planner is the message: a configuration of significant predications from the knowledge source. Messages correspond to major textual units in the output (e.g., a full sentence or group of sentences).

11/42

slide-31
SLIDE 31

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Text planner: Purpose

The text planner decides what chunks of information to include (content determination), and how to structure them (text structuring).

Definition

The basic unit of information produced by text planner is the message: a configuration of significant predications from the knowledge source. Messages correspond to major textual units in the output (e.g., a full sentence or group of sentences). The text planner deals with information, not yet packaged in a linguistically suitable format, and with no reference to a NL grammar or lexicon.

11/42

slide-32
SLIDE 32

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Message types (genealogy vs. personal ads)

MarriageMessage BirthMessage OccupationMessage PhysicalTraitMessage RelationshipStatusMessage LikesMessage, DislikesMessage OccupationMessage

12/42

slide-33
SLIDE 33

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Text planner: Input

knowledge source: instances, classes, relations (possibly expressed in FOL, in a relational database, etc.) communicative goals: these are domain specific, though there are generalizations for all domains. E.g.,:

(comparePerson Sam Fred) (describeAll KB) (queryAge Fred)

13/42

slide-34
SLIDE 34

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Text planner: Output

The text planner outputs a text plan: a non-linguistic data structure that contains messages structured according to rhetorical relations. It does not specify any grammatical or lexical information. message: significant predications from the domain. E.g. ∃e BirthEvent(e) ∧ actor(e, GEORGE) ∧ location(e, SOMERSET CO MD), etc. rhetorical structure: relations between the chunks. E.g, elaboration, contrast, purpose, etc.

14/42

slide-35
SLIDE 35

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Target text elements

George Melvin Phillips was from Somerset Co., MD. George Melvin Phillips was born in 1864. But George Melvin Phillips died in 1933. He was married to Martha Hastings. George and Martha had four children.

15/42

slide-36
SLIDE 36

Output of text planner: text plan

slide-37
SLIDE 37

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Rhetorical Structure Theory (RST)

RST is a theory of the structure of texts that emphasizes textual function and the relationships between textual units. Consider several relation types among textual units T1 and T2:

17/42

slide-38
SLIDE 38

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Rhetorical Structure Theory (RST)

RST is a theory of the structure of texts that emphasizes textual function and the relationships between textual units. Consider several relation types among textual units T1 and T2: evidence - T1 is proven by T2. The defendant killed Smith. He had Smith’s blood on his hands.

17/42

slide-39
SLIDE 39

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Rhetorical Structure Theory (RST)

RST is a theory of the structure of texts that emphasizes textual function and the relationships between textual units. Consider several relation types among textual units T1 and T2: evidence - T1 is proven by T2. The defendant killed Smith. He had Smith’s blood on his hands. elaboration - the content of T1 is elaborated in T2. Mary had a little lamb, and she had it with mint sauce.

17/42

slide-40
SLIDE 40

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Rhetorical Structure Theory (RST)

RST is a theory of the structure of texts that emphasizes textual function and the relationships between textual units. Consider several relation types among textual units T1 and T2: evidence - T1 is proven by T2. The defendant killed Smith. He had Smith’s blood on his hands. elaboration - the content of T1 is elaborated in T2. Mary had a little lamb, and she had it with mint sauce. sequence - T1 precedes T2 in the narrative. John picked up his iPhone, then fired his boss with a text message.

17/42

slide-41
SLIDE 41

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Rhetorical Structure Theory (RST)

RST is a theory of the structure of texts that emphasizes textual function and the relationships between textual units. Consider several relation types among textual units T1 and T2: evidence - T1 is proven by T2. The defendant killed Smith. He had Smith’s blood on his hands. elaboration - the content of T1 is elaborated in T2. Mary had a little lamb, and she had it with mint sauce. sequence - T1 precedes T2 in the narrative. John picked up his iPhone, then fired his boss with a text message. contrast - shows that two elements T1 and T2 are contrasting with each other. The bride’s dress was red, while the bridesmaids’ were white.

17/42

slide-42
SLIDE 42

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Final structured text

George Melvin Phillips was from Somerset Co.,

  • MD. He was born in 1864, but died in 1933. He

was married to Martha Hastings. Later, George and Martha had four children ...

18/42

slide-43
SLIDE 43

Significant predications

Significant predications refer to the important knowledge to be linguistically packaged in the generated text. These must be determined in a domain specific manner, i.e., according to communicative goals.

Genealogy, encyclopedia entry, obituary, etc.

Roosevelt died of a cerebral hemorrhage on April 12, 1945

slide-44
SLIDE 44

Significant predications

Significant predications refer to the important knowledge to be linguistically packaged in the generated text. These must be determined in a domain specific manner, i.e., according to communicative goals.

Genealogy, encyclopedia entry, obituary, etc.

Roosevelt died of a cerebral hemorrhage on April 12, 1945

Buddy Holley’s coroner’s report

slide-45
SLIDE 45

Significant predications

Significant predications refer to the important knowledge to be linguistically packaged in the generated text. These must be determined in a domain specific manner, i.e., according to communicative goals.

Genealogy, encyclopedia entry, obituary, etc.

Roosevelt died of a cerebral hemorrhage on April 12, 1945

Buddy Holley’s coroner’s report

There was bleeding from both ears, and the face showed multiple lacerations. The consistency of the chest was soft due to extensive crushing injury to the bony structure. The left forearm was fractured 1/3 the way up from the wrist and the right elbow was fractured.

slide-46
SLIDE 46

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

A BirthMessage from hw7

This structured object represents the information to generate: FDR was born on January 30th, 1882 in Hyde Park, NY.

<BirthMessage> <person> <firstname>Franklin</firstname> <middlename>Delano</middlename> <lastname>Roosevelt</lastname> <gender>male</gender> </person> <date> <month>January</month> <day>30</day> <year>1882</year> </date> <location> <city>Hyde Park</city> <state>New York</state> </location> </BirthMessage>

20/42

slide-47
SLIDE 47

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

Early generation systems (pre-1990): consisted of only two modules, were brittle and did not perform well in terms of generating the overall textual structure.

21/42

slide-48
SLIDE 48

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

Early generation systems (pre-1990): consisted of only two modules, were brittle and did not perform well in terms of generating the overall textual structure.

21/42

slide-49
SLIDE 49

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

Early generation systems (pre-1990): consisted of only two modules, were brittle and did not perform well in terms of generating the overall textual structure.

1 Strategic generation: to determine the significant

predications and organize it into a text plan; research focused on AI planning techniques (e.g., STRIPS planner).

21/42

slide-50
SLIDE 50

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

Early generation systems (pre-1990): consisted of only two modules, were brittle and did not perform well in terms of generating the overall textual structure.

1 Strategic generation: to determine the significant

predications and organize it into a text plan; research focused on AI planning techniques (e.g., STRIPS planner).

2 Tactical generation: grammatical selection; lexical

choice (i.e., sentence planning)

21/42

slide-51
SLIDE 51

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

In early 1990s, researchers recognized that some functions of the strategic and tactical components overlapped:

22/42

slide-52
SLIDE 52

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

In early 1990s, researchers recognized that some functions of the strategic and tactical components overlapped:

1 The bridge between common-sense and linguistic

knowledge is often taken to be the lexicon. There was much debate as to where exactly lexical selection belonged.

22/42

slide-53
SLIDE 53

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

In early 1990s, researchers recognized that some functions of the strategic and tactical components overlapped:

1 The bridge between common-sense and linguistic

knowledge is often taken to be the lexicon. There was much debate as to where exactly lexical selection belonged.

2 In attempting complex text generation, the mapping of

propositions/messages directly onto sentences resulted in choppy, robotic sounding texts. Combining information is especially difficult when the input data is not specially designed for NLG.

22/42

slide-54
SLIDE 54

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Challenges to two-level generation

In early 1990s, researchers recognized that some functions of the strategic and tactical components overlapped:

1 The bridge between common-sense and linguistic

knowledge is often taken to be the lexicon. There was much debate as to where exactly lexical selection belonged.

2 In attempting complex text generation, the mapping of

propositions/messages directly onto sentences resulted in choppy, robotic sounding texts. Combining information is especially difficult when the input data is not specially designed for NLG.

3 Consider the problem of how to refer to domain entities.

In the best systems, control had to be switched back and forth between strategic and tactical components.

22/42

slide-55
SLIDE 55

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Referring expressions

AIG received another bailout ... The insurance giant underwent little scrutiny ... A spokesman for the company ... Congressmen berated the institution as ... Two-step system: Control has to be switched back and forth between strategic and tactical components.

23/42

slide-56
SLIDE 56

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Referring expressions

AIG received another bailout ... The insurance giant underwent little scrutiny ... A spokesman for the company ... Congressmen berated the institution as ... Two-step system: Control has to be switched back and forth between strategic and tactical components.

23/42

slide-57
SLIDE 57

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Referring expressions

AIG received another bailout ... The insurance giant underwent little scrutiny ... A spokesman for the company ... Congressmen berated the institution as ... Two-step system: Control has to be switched back and forth between strategic and tactical components.

23/42

slide-58
SLIDE 58

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Referring expressions

AIG received another bailout ... The insurance giant underwent little scrutiny ... A spokesman for the company ... Congressmen berated the institution as ... Two-step system: Control has to be switched back and forth between strategic and tactical components.

23/42

slide-59
SLIDE 59

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Referring expressions

AIG received another bailout ... The insurance giant underwent little scrutiny ... A spokesman for the company ... Congressmen berated the institution as ... Two-step system: Control has to be switched back and forth between strategic and tactical components.

23/42

slide-60
SLIDE 60

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanner: Purpose

These issues suggest the need for a third, intermediate component, often called a microplanner. This has been a major focus of research in the NLG community for the past decade or so.

24/42

slide-61
SLIDE 61

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanner: Purpose

These issues suggest the need for a third, intermediate component, often called a microplanner. This has been a major focus of research in the NLG community for the past decade or so. The microplanning component receives input from the text planner and determines the deep linguistic structure and content.

24/42

slide-62
SLIDE 62

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanner: Purpose

These issues suggest the need for a third, intermediate component, often called a microplanner. This has been a major focus of research in the NLG community for the past decade or so. The microplanning component receives input from the text planner and determines the deep linguistic structure and content. The main point: the microplanner is an intermediate stage that has access to both non-linguistic information (numerical database, etc.) and linguistic knowledge (grammar and lexicon).

24/42

slide-63
SLIDE 63

Microplanner

slide-64
SLIDE 64

Microplanner input

slide-65
SLIDE 65

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanner output

27/42

slide-66
SLIDE 66

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanner output

a phrase specification: a data structure that, along with the grammar, gives a full recipe for a particular phrase (e.g., clause or noun phrase), but is not the phrase itself.

27/42

slide-67
SLIDE 67

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanner output

a phrase specification: a data structure that, along with the grammar, gives a full recipe for a particular phrase (e.g., clause or noun phrase), but is not the phrase itself. Or for generation of entire texts, a text specification, an abstract structure representing the text without committing to certain surface forms

27/42

slide-68
SLIDE 68

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

28/42

slide-69
SLIDE 69

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

28/42

slide-70
SLIDE 70

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

1 content: lexemes (linguistic counterparts of

“concepts”)

28/42

slide-71
SLIDE 71

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

1 content: lexemes (linguistic counterparts of

“concepts”)

2 structure: 28/42

slide-72
SLIDE 72

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

1 content: lexemes (linguistic counterparts of

“concepts”)

2 structure:

phrasal categories,

28/42

slide-73
SLIDE 73

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

1 content: lexemes (linguistic counterparts of

“concepts”)

2 structure:

phrasal categories, features (whatever is grammaticalized in the language),

28/42

slide-74
SLIDE 74

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Phrase specifications

Consider a phrase specification: we need two things in order have a full recipe for the resulting NL:

1 content: lexemes (linguistic counterparts of

“concepts”)

2 structure:

phrasal categories, features (whatever is grammaticalized in the language), semantic role information for mapping semantic to syntactic structure

28/42

slide-75
SLIDE 75

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexemes

Definition

A lexeme is an uninflected abstraction of a content word, e.g., IDEA, SLEEP, GREEN (cf. ideas, slept, greener). Dictionary entries are based on the idea of a lexeme.

29/42

slide-76
SLIDE 76

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexemes

Definition

A lexeme is an uninflected abstraction of a content word, e.g., IDEA, SLEEP, GREEN (cf. ideas, slept, greener). Dictionary entries are based on the idea of a lexeme. The exact specification of a lexeme takes various forms for given theoretical traditions. But in general, lexemes are abstractions over a set of word forms.

29/42

slide-77
SLIDE 77

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexicon

Definition

The lexicon is the collection of lexemes for a given language. It provides a bridge from non-linguistic (common-sense) knowledge to linguistic knowledge (the grammar).

30/42

slide-78
SLIDE 78

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexicon

Definition

The lexicon is the collection of lexemes for a given language. It provides a bridge from non-linguistic (common-sense) knowledge to linguistic knowledge (the grammar). The lexicon enumerates the psychologically and culturally salient concepts in a language.

30/42

slide-79
SLIDE 79

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexical semantics

A key aim of lexical semantics is to investigate the mapping between language and commonsense knowledge.

31/42

slide-80
SLIDE 80

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexical semantics

A key aim of lexical semantics is to investigate the mapping between language and commonsense knowledge.

Break or squash or crush?

John broke/squashed/crushed the ball.

31/42

slide-81
SLIDE 81

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexical semantics

A key aim of lexical semantics is to investigate the mapping between language and commonsense knowledge.

Break or squash or crush?

John broke/squashed/crushed the ball. lexeme semantic features BREAK

  • bject: +artifact, +rigid

result: -functional

31/42

slide-82
SLIDE 82

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexical semantics

A key aim of lexical semantics is to investigate the mapping between language and commonsense knowledge.

Break or squash or crush?

John broke/squashed/crushed the ball. lexeme semantic features BREAK

  • bject: +artifact, +rigid

result: -functional

31/42

slide-83
SLIDE 83

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexical semantics

A key aim of lexical semantics is to investigate the mapping between language and commonsense knowledge.

Break or squash or crush?

John broke/squashed/crushed the ball. lexeme semantic features BREAK

  • bject: +artifact, +rigid

result: -functional SQUASH

  • bject: -rigid

result: +flat, +smaller

31/42

slide-84
SLIDE 84

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Lexical semantics

A key aim of lexical semantics is to investigate the mapping between language and commonsense knowledge.

Break or squash or crush?

John broke/squashed/crushed the ball. lexeme semantic features BREAK

  • bject: +artifact, +rigid

result: -functional SQUASH

  • bject: -rigid

result: +flat, +smaller CRUSH

  • bject: +rigid

result: +smaller

31/42

slide-85
SLIDE 85

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Grammatical constituents

Structuring

Once the lexicalization is complete, the information in the message needs to be transformed into a grammatical form: the syntactic category (e.g., NP) the syntactic role (e.g., head of phrase, complement of phrase) the features relevant for the grammar (e.g., definiteness, tense, etc.)

32/42

slide-86
SLIDE 86

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Microplanning sub-tasks

In general, we can come up four subtasks for the microplanner: grammaticalization: committing to specific grammatical structures (NPs, VPs, features) lexicalization: committing to specific lexical items (lexemes for message content, cue words for textual relations) aggregation: repackaging message content into a form that is more language-like, and less data-like referring expression generation: generating specific forms for KB entities

33/42

slide-87
SLIDE 87

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Surface realizer

Purpose

To generate natural language strings from a fully specified input (deterministic); the inverse of certain kinds of parsing processes. determines the surface form of the text; adds inflectional endings of words;

  • rders constituents;
  • misc. markup (e.g., lists, paragraphs, punctuation)

34/42

slide-88
SLIDE 88

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Surface realizer

Inputs

phrase specifications Or for an entire text, a text specification

Outputs

linearized sentences, texts

35/42

slide-89
SLIDE 89

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Details of the realizer

The surface realizer, in general, can be separated from the rest of the NLG system. It hides the idiosyncrasies of grammar from the rest of the system.

36/42

slide-90
SLIDE 90

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Details of the realizer

The surface realizer, in general, can be separated from the rest of the NLG system. It hides the idiosyncrasies of grammar from the rest of the system. In theory, the output language (e.g., Spanish) could be changed by swapping out this component. It’s the most language specific of the three components. (But this, really depends on how language-neutral the other NLG components are.)

36/42

slide-91
SLIDE 91

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Details of the realizer

The surface realizer, in general, can be separated from the rest of the NLG system. It hides the idiosyncrasies of grammar from the rest of the system. In theory, the output language (e.g., Spanish) could be changed by swapping out this component. It’s the most language specific of the three components. (But this, really depends on how language-neutral the other NLG components are.) Cutting edge research does not focus on the realizer. Surface realization is largely a solved problem and there are a couple

  • f robust open source systems.

36/42

slide-92
SLIDE 92

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

37/42

slide-93
SLIDE 93

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

Features of the system

37/42

slide-94
SLIDE 94

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

Features of the system

programmatic lexicon access

37/42

slide-95
SLIDE 95

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

Features of the system

programmatic lexicon access morphological component (e.g., adds -s to dog, -ren to child)

37/42

slide-96
SLIDE 96

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

Features of the system

programmatic lexicon access morphological component (e.g., adds -s to dog, -ren to child) an inventory of morphological features

37/42

slide-97
SLIDE 97

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

Features of the system

programmatic lexicon access morphological component (e.g., adds -s to dog, -ren to child) an inventory of morphological features various output formats: HTML, txt, etc.

37/42

slide-98
SLIDE 98

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Purpose

To take an underspecified input object (a text specification) and create a linearized string of words as output.

Features of the system

programmatic lexicon access morphological component (e.g., adds -s to dog, -ren to child) an inventory of morphological features various output formats: HTML, txt, etc. structured objects representing text hierarchy

37/42

slide-99
SLIDE 99

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements

“Realisation in SimpleNLG revolves around a tree structure. Each node in the tree is represented by a NLGElement, which in turn may have child nodes.”

Direct subclasses of NLGElement

These are the primary elements:

38/42

slide-100
SLIDE 100

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements

“Realisation in SimpleNLG revolves around a tree structure. Each node in the tree is represented by a NLGElement, which in turn may have child nodes.”

Direct subclasses of NLGElement

These are the primary elements: DocumentElement: used to define elements that form part of the textual structure

38/42

slide-101
SLIDE 101

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements

“Realisation in SimpleNLG revolves around a tree structure. Each node in the tree is represented by a NLGElement, which in turn may have child nodes.”

Direct subclasses of NLGElement

These are the primary elements: DocumentElement: used to define elements that form part of the textual structure PhraseElement: defines a phrase and covers the expected phrase types: noun phrases, verb phrases, etc.

38/42

slide-102
SLIDE 102

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements

“Realisation in SimpleNLG revolves around a tree structure. Each node in the tree is represented by a NLGElement, which in turn may have child nodes.”

Direct subclasses of NLGElement

These are the primary elements: DocumentElement: used to define elements that form part of the textual structure PhraseElement: defines a phrase and covers the expected phrase types: noun phrases, verb phrases, etc. WordElement: the class for a lexical entry (ie, a word), stored in a Lexicon

38/42

slide-103
SLIDE 103

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements continued

Direct subclasses of NLGElement

Other elements:

39/42

slide-104
SLIDE 104

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements continued

Direct subclasses of NLGElement

Other elements: CoordinatedPhraseElement:defines coordination between two or more phrases and involves the use of key words such as and or but.

39/42

slide-105
SLIDE 105

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements continued

Direct subclasses of NLGElement

Other elements: CoordinatedPhraseElement:defines coordination between two or more phrases and involves the use of key words such as and or but. InflectedWordElement: used to represent an word that requires inflection by the morphology

39/42

slide-106
SLIDE 106

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements continued

Direct subclasses of NLGElement

Other elements: CoordinatedPhraseElement:defines coordination between two or more phrases and involves the use of key words such as and or but. InflectedWordElement: used to represent an word that requires inflection by the morphology ListElement: used to define elements that can be grouped together and treated in a similar manner

39/42

slide-107
SLIDE 107

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Elements continued

Direct subclasses of NLGElement

Other elements: CoordinatedPhraseElement:defines coordination between two or more phrases and involves the use of key words such as and or but. InflectedWordElement: used to represent an word that requires inflection by the morphology ListElement: used to define elements that can be grouped together and treated in a similar manner StringElement: an element for representing canned text

39/42

slide-108
SLIDE 108

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

SimpleNLG

Canned text

Canned text refers to the use of set words or phrases in place of programmatic generation.

Examples:

Instead of generating a VP using phrasal elements and lexical item, simply use a string: “becoming partly cloudy”. StringElement canned = new StringElement("becoming partly cloudy");

40/42

slide-109
SLIDE 109

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Today’s lecture

1

Texts

2

NLG Systems

3

Architecture modules Textplanner Microplanner Surface realizer

SimpleNLG realizer

4

Hw7

41/42

slide-110
SLIDE 110

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Strategies for Hw7

Create various message types

For each message type in the XML input, create a class and then instantiate that class with the input. Message types will contain references to the semantic content.

42/42

slide-111
SLIDE 111

NLG: Specific Components Scott Farrar CLMA, University

  • f Washington far-

rar@u.washington.edu Texts NLG Systems Architecture modules

Textplanner Microplanner Surface realizer SimpleNLG realizer

Hw7

Strategies for Hw7

Create various message types

For each message type in the XML input, create a class and then instantiate that class with the input. Message types will contain references to the semantic content.

Create various semantic classes

Person: a class representing a person (name, gender, etc.) Date: a class representing a data (day, month, year) Location: a class representing a location (city, state)

42/42