Computational Models of Discourse: Preliminaries, Overview Caroline - - PowerPoint PPT Presentation

computational models of discourse preliminaries overview
SMART_READER_LITE
LIVE PREVIEW

Computational Models of Discourse: Preliminaries, Overview Caroline - - PowerPoint PPT Presentation

Computational Models of Discourse: Preliminaries, Overview Caroline Sporleder Universit at des Saarlandes Sommersemester 2008/09 22.04.2008 Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse Preliminaries


slide-1
SLIDE 1

Computational Models of Discourse: Preliminaries, Overview

Caroline Sporleder

Universit¨ at des Saarlandes

Sommersemester 2008/09 22.04.2008

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-2
SLIDE 2

Preliminaries

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-3
SLIDE 3

Preliminaries

Course offered as: Proseminar (B.Sc.): 4. Semester Hauptseminar (B.Sc.): 6. Semester Seminar (M.Sc.) Scheine/Credits: Proseminar: class presentation (20-30 minutes), term paper (10-15 Seiten) ⇒ 5 credit points Hauptseminar/Seminar (MSc): class presentation (30-40 minutes), term paper (about 20 pages) ⇒ 7 credit points (4 for presentation, 3 for term paper)

  • ptional oral exam (10-30 minutes)

“participation in seminar” (reading, discussions, exercises etc.) Course content complementary to Helmut Horacek’s Seminar “Diskursph¨ anomene” (i.e. virtually no overlap)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-4
SLIDE 4

Class Presentations

Preparation

  • ptionally: preparatory meeting to discuss open questions and

presentation draft

  • ptionally: peer group discussion

Evaluation Criteria Content main aspects of the paper(s) covered good explanations of content (examples, graphics) discussion of advantages/disadvantages of presented method use of additional (printed) references (optionally) Form slides contain references slides are well-structured the topic was well-presented (interaction with audience etc.) the presentation is not too short the presentation is not (much) too long

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-5
SLIDE 5

Term Papers

Writing term papers answer a concrete research question (or a set of related questions) show evidence of independent thinking and reasoning independent bibliographic work meet academic standards (proper citations/references, well-structured, well-written etc.) Don’t plagiarise! can include some practical work but doesn’t have to around 100 hours of work (2-3 weeks full-time) 15-20 pages long 10-15 references Please discuss topic with me beforehand!

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-6
SLIDE 6

Course Overview

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-7
SLIDE 7

Why Discourse Processing?

Natural language rarely comes in isolated sentences. . . newspaper articles novels dialogues speeches by politicians . . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-8
SLIDE 8

Why Discourse Processing?

Natural language rarely comes in isolated sentences. . . newspaper articles novels dialogues speeches by politicians . . . NLP applications need to be able to deal with discourse. . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-9
SLIDE 9

Why Discourse Processing?

Natural language rarely comes in isolated sentences. . . newspaper articles novels dialogues speeches by politicians . . . NLP applications need to be able to deal with discourse. . . dialogue systems question answering text summarisation information extraction natural language generation natural language understanding . . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-10
SLIDE 10

Discourse Context Matters

Example: Co-reference Resolution Campaigning has closed in Argentina ahead of Sunday’s election to elect a successor to President Nestor Kirchner. The front-runner in the opinion polls is the current first lady, Senator Cristina Fernandez de Kirchner. She praised the economic record of her husband’s government during a rally in Buenos Aires.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-11
SLIDE 11

Discourse Context Matters

Example: Co-reference Resolution She praised the economic record of her husband’s government during a rally in Buenos Aires.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-12
SLIDE 12

Discourse Context Matters

Example: Question Answering (“Why was Caesar killed?”) Caesar was proclaimed dictator for life, and he heavily centralised the bureaucracy of the Republic. These events provoked a hitherto friend of Caesar, Marcus Junius Brutus, and a group of other senators, to assassinate the dictator

  • n the Ides of March (March 15) in 44 BC.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-13
SLIDE 13

Discourse Context Matters

Example: Coherence/Text Generation Greek officials hope the new site will boost the country’s long cam- paign for the return of the Elgin Marbles. Crowds of bystanders watched the first of the monuments lifted by cranes at the 2,500-year-old Parthenon. Greece has begun moving the ancient sculptures from the Acropolis in Athens to a new home - a museum at the foot of the hilltop citadel.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-14
SLIDE 14

Discourse Context Matters

Example: Coherence/Text Generation Greece has begun moving the ancient sculptures from the Acropolis in Athens to a new home - a museum at the foot of the hilltop citadel. Crowds of bystanders watched the first of the monuments lifted by cranes at the 2,500-year-old Parthenon. Greek officials hope the new site will boost the country’s long cam- paign for the return of the Elgin Marbles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-15
SLIDE 15

Draft Schedule

29.04.2009 Introduction Discourse (and Machine Learning) 06.05.2009 Cohesion and Local Coherence

  • Lexical Cohesion, Lexical Chains
  • Focus, Centering

13.05.2009 Text Segmentation

  • TextTiling
  • Other Segmentation Methods

20.05.2009 Applications

  • Automatic Essay Scoring
  • Information Ordering for Text Generation

27.05.2009 Generating Referring Expressions

  • rule-based
  • machine learning

03.06.2009 Co-reference Resolution

  • rule-based
  • supervised machine learning
  • unsupervised machine learning

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-16
SLIDE 16

Draft Schedule, cont’d

10.06.2009 Discourse Parsing

  • Discourse Parsing with RST
  • Machine Learning

17.06.2009 Temporal Ordering 24.06.2009 Text Summarisation

  • lexical chains
  • RST-based
  • multi-document
  • argumentative zoning

01.07.2009 Sentiment Analysis 08.07.2009 Dialogue Processing

  • classification of dialogue acts
  • dialogue planning

15.07.2009 Speech, Psycholinguistic Models 22.07.2009 Recap, Conclusion

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-17
SLIDE 17

Coherence

Central Questions: how can (local) text coherence be modelled? when is a text coherent?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-18
SLIDE 18

Example: Coherence

John went to his favorite music store to buy a piano. It was a store John had frequented for many years. He was excited that he could finally buy a piano. It was closing just as John arrived. John went to his favorite music store to buy a piano. He had frequented the store for many years. He was excited that he could finally buy a piano. He arrived just as the store was closing.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-19
SLIDE 19

Coherence

Central Questions: how can (local) text coherence be modelled? when is a text coherent?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-20
SLIDE 20

Coherence

Central Questions: how can (local) text coherence be modelled? when is a text coherent? Applications: many . . . text generation (concept-to-text, text-to-text, summarisation etc.) evaluation of text generating systems (summarisation, machine translation etc.) evaluation of human-written text (automatic essay scoring, readability assessment etc.)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-21
SLIDE 21

Lexical Chains

chains of semantically related words ⇒ . . . measure lexical cohesion

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-22
SLIDE 22

Lexical Chains

chains of semantically related words ⇒ . . . measure lexical cohesion Example At least eight people have been killed in flooding in the Dominican Republic following torrential rains dumped by Tropical Storm Noel. The deaths were reported in the Dominican capital Santo Domingo, and along the south coast. The centre of the storm had passed by midday on Monday, and was set to head north towards the Bahamas.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-23
SLIDE 23

Lexical Chains

chains of semantically related words ⇒ . . . measure lexical cohesion Example At least eight people have been killed in flooding in the Dominican Republic following torrential rains dumped by Tropical Storm Noel. The deaths were reported in the Dominican capital Santo Domingo, and along the south coast. The centre of the storm had passed by midday on Monday, and was set to head north towards the Bahamas.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-24
SLIDE 24

Lexical Chains

chains of semantically related words ⇒ . . . measure lexical cohesion Example At least eight people have been killed in flooding in the Dominican Republic following torrential rains dumped by Tropical Storm Noel. The deaths were reported in the Dominican capital Santo Domingo, and along the south coast. The centre of the storm had passed by midday on Monday, and was set to head north towards the Bahamas.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-25
SLIDE 25

Lexical Chains

chains of semantically related words ⇒ . . . measure lexical cohesion Example At least eight people have been killed in flooding in the Dominican Republic following torrential rains dumped by Tropical Storm Noel. The deaths were reported in the Dominican capital Santo Domingo, and along the south coast. The centre of the storm had passed by midday on Monday, and was set to head north towards the Bahamas.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-26
SLIDE 26

Centering Theory

. . . models focus shifts Example Susan would like to go on a holiday. But she needs to find somebody to do her work while she’s away. She can’t think of anybody to do that. She considered Mike but he’s a bit unreliable. Yesterday he forgot to turn up for an important meeting with a

  • client. The client was very annoyed and said she would never do

business with Susan’s company again.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-27
SLIDE 27

Centering Theory

. . . models focus shifts Example Susan would like to go on a holiday. But she needs to find somebody to do her work while she’s away. She can’t think of anybody to do that. She considered Mike but he’s a bit unreliable. Yesterday he forgot to turn up for an important meeting with a

  • client. The client was very annoyed and said she would never do

business with Susan’s company again.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-28
SLIDE 28

Centering Theory

. . . models focus shifts Example Susan would like to go on a holiday. But she needs to find somebody to do her work while she’s away. She can’t think of anybody to do that. She considered Mike but he’s a bit unreliable. Yesterday he forgot to turn up for an important meeting with a

  • client. The client was very annoyed and said she would never do

business with Susan’s company again.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-29
SLIDE 29

Centering Theory

. . . models focus shifts Example Susan would like to go on a holiday. But she needs to find somebody to do her work while she’s away. She can’t think of anybody to do that. She considered Mike but he’s a bit unreliable. Yesterday he forgot to turn up for an important meeting with a

  • client. The client was very annoyed and said she would never do

business with Susan’s company again.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-30
SLIDE 30

Text Segmentation

. . . indentification of (sub-)topic shifts

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-31
SLIDE 31

Text Segmentation

Example Penicillin is a group of beta-lactam antibiotics used in the treatment

  • f bacterial infections caused by susceptible, usually Gram-positive,
  • rganisms. The discovery of penicillin is usually attributed to Scot-

tish scientist Sir Alexander Fleming in 1928. Fleming noticed a halo

  • f inhibition of bacterial growth around a contaminant blue-green

mold Staphylococcus plate culture. Fleming concluded that the mold was releasing a substance that was inhibiting bacterial growth and lysing the bacteria. Common adverse drug reactions associated with use of the penicillins include: diarrhea, nausea, rash, urticaria, and/or superinfection (including candidiasis).

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-32
SLIDE 32

Text Segmentation

Example Penicillin is a group of beta-lactam antibiotics used in the treatment

  • f bacterial infections caused by susceptible, usually Gram-positive,
  • rganisms. The discovery of penicillin is usually attributed to Scot-

tish scientist Sir Alexander Fleming in 1928. Fleming noticed a halo

  • f inhibition of bacterial growth around a contaminant blue-green

mold Staphylococcus plate culture. Fleming concluded that the mold was releasing a substance that was inhibiting bacterial growth and lysing the bacteria. Common adverse drug reactions associated with use of the penicillins include: diarrhea, nausea, rash, urticaria, and/or superinfection (including candidiasis).

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-33
SLIDE 33

Text Segmentation

. . . indentification of (sub-)topic shifts Applications text retrieval preprocessing for other discourse processing tasks (discourse parsing, information extraction, coherence assessment etc.)

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-34
SLIDE 34

Application: Essay Scoring

Is an essay written well or not? Example We as college students, have ahead of us possibly one of the most difficult tasks left to master. It is important to realize that knowledge alone may not lead to the occupation that we desire. The task of searching for a job may be made somewhat simplified if we take some time to consider the qualities we have demonstrated in the interview, once we leave the office.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-35
SLIDE 35

Application: Information Ordering

What is the best ordering in which to present information (e.g., facts in a database)? Which ordering makes the text most coherent?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-36
SLIDE 36

Application: Information Ordering

What is the best ordering in which to present information (e.g., facts in a database)? Which ordering makes the text most coherent? Example

1 This coin is made of silver. 2 Croton is a Greek colony in South Italy. 3 This coin comes from the archaic period. 4 This coin comes from Croton. 5 Towards the end of the archaic period, coins were used for

transactions.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-37
SLIDE 37

Application: Information Ordering

What is the best ordering in which to present information (e.g., facts in a database)? Which ordering makes the text most coherent? Example

1 Towards the end of the archaic period, coins were used for

transactions.

2 This coin comes from the archaic period. 3 This coin is made of silver. 4 This coin comes from Croton. 5 Croton is a Greek colony in South Italy. Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-38
SLIDE 38

Generating Referring Expressions

The linguistic form of referring expressions (NPs) depends on the discourse context Example: Noam Chomsky, the linguist, the author of “Syntactic Structures”, the man, he . . . Application: text generation

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-39
SLIDE 39

Generating Referring Expressions

Example He claims record The 22-year-old computer science undergraduate from Bath is claiming a world record for the longest distance ridden on a unicycle in 24 hours. A unicycling student covered exactly 282 miles at Aberystwyth University’s athletics track. Sam Wakeling was aiming to beat the existing record of 235.3 miles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-40
SLIDE 40

Generating Referring Expressions

Example He claims record The 22-year-old computer science undergraduate from Bath is claiming a world record for the longest distance ridden on a unicycle in 24 hours. A unicycling student covered exactly 282 miles at Aberystwyth University’s athletics track. Sam Wakeling was aiming to beat the existing record of 235.3 miles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-41
SLIDE 41

Generating Referring Expressions

Example Unicycling student claims record A student is claiming a world record for the longest distance ridden

  • n a unicycle in 24 hours.

Sam Wakeling covered exactly 282 miles at Aberystwyth University’s athletics track. The 22-year-old computer science undergraduate from Bath was aiming to beat the existing record of 235.3 miles.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-42
SLIDE 42

Co-reference Resolution

Identify the referent of pronouns, definite NPs etc.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-43
SLIDE 43

Co-reference Resolution

Identify the referent of pronouns, definite NPs etc. Example Sophia Loren says she will always be grateful to Bono. The actress revealed that the U2 singer helped her calm down when she became scared by a thunderstorm while travelling on a plane.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-44
SLIDE 44

Co-reference Resolution

Identify the referent of pronouns, definite NPs etc. Example Sophia Loren says she will always be grateful to Bono. The actress revealed that the U2 singer helped her calm down when she became scared by a thunderstorm while travelling on a plane.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-45
SLIDE 45

Co-reference Resolution

Identify the referent of pronouns, definite NPs etc. Example Sophia Loren says she will always be grateful to Bono. The actress revealed that the U2 singer helped her calm down when she became scared by a thunderstorm while travelling on a plane.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-46
SLIDE 46

Co-reference Resolution

Identify the referent of pronouns, definite NPs etc. Example Sophia Loren says she will always be grateful to Bono. The actress revealed that the U2 singer helped her calm down when she became scared by a thunderstorm while travelling on a plane. Applications: information extraction text summarisation pre-processing for many other NLP tasks

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-47
SLIDE 47

Discourse Parsing

Identify the discourse structure, i.e. how utterances are linked

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-48
SLIDE 48

Discourse Parsing

Identify the discourse structure, i.e. how utterances are linked John hid Peter’s car keys. He was drunk.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-49
SLIDE 49

Discourse Parsing

Identify the discourse structure, i.e. how utterances are linked John hid Peter’s car keys. He was drunk. ⇒ The fact that John was drunk explains why he hid Peter’s car keys.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-50
SLIDE 50

Discourse Parsing

Identify the discourse structure, i.e. how utterances are linked John hid Peter’s car keys. He was drunk. ⇒ The fact that John was drunk explains why he hid Peter’s car keys. Mary likes chocolate, Maggie likes crisps.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-51
SLIDE 51

Discourse Parsing

Identify the discourse structure, i.e. how utterances are linked John hid Peter’s car keys. He was drunk. ⇒ The fact that John was drunk explains why he hid Peter’s car keys. Mary likes chocolate, Maggie likes crisps. ⇒ The fact that Maggie likes crisps contrasts with Mary’s liking of chocolate.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-52
SLIDE 52

Discourse Parsing

Identify the discourse structure, i.e. how utterances are linked John hid Peter’s car keys. He was drunk. ⇒ The fact that John was drunk explains why he hid Peter’s car keys. Mary likes chocolate, Maggie likes crisps. ⇒ The fact that Maggie likes crisps contrasts with Mary’s liking of chocolate. Applications: anything that involves natural language understanding question answering information extraction text summarisation etc.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-53
SLIDE 53

Temporal Ordering

Identify the temporal ordering of events referred to in discourse

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-54
SLIDE 54

Temporal Ordering

Identify the temporal ordering of events referred to in discourse Example

1 John arrived at an oasis. He saw the camels around the

waterhole.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-55
SLIDE 55

Temporal Ordering

Identify the temporal ordering of events referred to in discourse Example

1 John arrived at an oasis. He saw the camels around the

waterhole.

2 John arrived at an oasis. He left the camels around the

waterhole.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-56
SLIDE 56

Temporal Ordering

Identify the temporal ordering of events referred to in discourse Example

1 John arrived at an oasis. He saw the camels around the

waterhole.

2 John arrived at an oasis. He left the camels around the

waterhole. Applications: information extraction question answering machine translation etc.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-57
SLIDE 57

Text Summarisation

Produce a shorter version of an input text (e.g., synthesise various news stories on a topic) Example Penicillin is a group of beta-lactam antibiotics used in the treatment

  • f bacterial infections caused by susceptible, usually Gram-positive,
  • rganisms. The discovery of penicillin is usually attributed to Scot-

tish scientist Sir Alexander Fleming in 1928. Fleming noticed a halo

  • f inhibition of bacterial growth around a contaminant blue-green

mold Staphylococcus plate culture. Fleming concluded that the mold was releasing a substance that was inhibiting bacterial growth and lysing the bacteria. Common adverse drug reactions associated with use of the penicillins include: diarrhea, nausea, rash, urticaria, and/or superinfection (including candidiasis).

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-58
SLIDE 58

Text Summarisation

Produce a shorter version of an input text (e.g., synthesise various news stories on a topic) Example Penicillin is a group of beta-lactam antibiotics. It was discovered by Sir Alexander Fleming in 1928. Common adverse drug reactions associated its use include: diarrhea, nausea, rash, urticaria, and/or superinfection (including candidiasis).

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-59
SLIDE 59

Argumentative Zoning

Find structure in scientific papers (background, own contribution, relation to other work, etc.) Applications: summarisation question answering citation analysis

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-60
SLIDE 60

Argumentative Zoning

Example Researchers in knowledge representation agreed that one of the hard problems of understanding narrative is the representation of temporal information . . . Recently, Smith has suggested the following solution to this problem . . . But this solution cannot be used to interpret the following example . . . We propose a solution which circumvents this problem. . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-61
SLIDE 61

Argumentative Zoning

Example background Researchers in knowledge representation agreed that one of the hard problems of understanding narrative is the representation of temporal information . . .

  • ther work

Recently, Smith has suggested the following solution to this problem . . . But this solution cannot be used to interpret the following example . . .

  • wn contribution

We propose a solution which circumvents this problem. . .

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-62
SLIDE 62

Sentiment Analysis

Is a product review positive or negative? What is a given person’s opinion on a particular topic? Which parts of a text are objective, which reflect subjective

  • pinions?

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-63
SLIDE 63

Sentiment Analysis

Is a product review positive or negative? What is a given person’s opinion on a particular topic? Which parts of a text are objective, which reflect subjective

  • pinions?

Bob Carpenters Amazon Review of Manning & Sch¨ utze (1999) This is the best book I’ve ever read on computational linguistics. It should be ideal for both linguists who want to learn about statistical language processing and those building language applications who want to learn about linguistics. This book isn’t even published and it’s now my most highly used reference book, joining gems such as Cormen, Leiserson and Rivest’s algorithm book, Quirk et al.’s English Grammar, and Andrew Gelman’s Bayesian statistics book (three excellent companions to this book, by the way).

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-64
SLIDE 64

Sentiment Analysis

Is a product review positive or negative? What is a given person’s opinion on a particular topic? Which parts of a text are objective, which reflect subjective

  • pinions?

What does Horst K¨

  • hler think about free trade?

German President Horst K¨

  • hler opened the Hanover Fair on

Sunday, criticizing a perceived lurch towards protectionism in some

  • countries. The former head of the International Monetary Fund

demanded a ”clear-cut timetable” to conclude the current stalled round of talks to encourage free trade around the world.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-65
SLIDE 65

Sentiment Analysis

Is a product review positive or negative? What is a given person’s opinion on a particular topic? Which parts of a text are objective, which reflect subjective

  • pinions?

Tagesschau online 19.04.2009 (modified) Foreign Minister Frank-Walter Steinmeier underpinned his claim to topple Angela Merkel by presenting his party’s manifesto, which calls for increased taxes on the rich, while cutting them for those less well-off. He also emphasized his party’s aim to defend the rights of employees and their jobs and added that struggling German carmaker Opel, which employs some 26,000 people in Germany alone, had to be saved at all costs.

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse

slide-66
SLIDE 66

Dialogue Act Classification

Dialogues consist of different dialogue acts (opening, info-request, commit etc.). For dialogue processing, these need to be identified. Example A: I need to travel in May. [assert] B: And what day in May did you want to travel? [info-request, acknowledgement] A: I need to be there for a meeting that’s from the 12th to the

  • 15th. [assert, answer]

B: And what time would you like to leave Pittsburgh? [info-request, acknowledgement] A: Uh hmm I don’t think there’s many options for non-stop. . . [check, hold] B: Right. [accept, acknowledgement] There is three non-stops today. [assert]

Caroline Sporleder csporled@coli.uni-sb.de Computational Models of Discourse