Leveraging discourse information effectively for authorship - - PowerPoint PPT Presentation

leveraging discourse information effectively for
SMART_READER_LITE
LIVE PREVIEW

Leveraging discourse information effectively for authorship - - PowerPoint PPT Presentation

Leveraging discourse information effectively for authorship attribution Elisa Ferracane, Su Wang, Raymond J. Mooney University of Texas at Austin Task Authorship Attribution: identify the author of a text, given a set of author-labeled


slide-1
SLIDE 1

Leveraging discourse information effectively for authorship attribution

Elisa Ferracane, Su Wang, Raymond J. Mooney

University of Texas at Austin

slide-2
SLIDE 2

2

Task

  • Authorship Attribution: identify the author of a text, given a

set of author-labeled training texts.

slide-3
SLIDE 3

3

Authorship Attribution

  • Neural networks (e.g., character-level CNNs) have proven

very powerful…

  • capture stylometric cues at the surface level

“My very photogenic mother died in a freak accident (picnic, lightning) when I was three...” “But what principally attracted attention of Nicholas, was the old gentleman’s eye… Grafted upon the quaintness and oddity of his appearance, was something…” Lolita, Nabokov Nichola Nickleby, Dickens

slide-4
SLIDE 4

4

Authorship Attribution

  • Authors also have particular rhetorical styles…
  • But how do you incorporate discourse into a neural net?
slide-5
SLIDE 5

5

Our Contributions

1) How can you featurize discourse information? 2) How can you integrate discourse information into the network? 3) Can discourse help in SOTA model (bigram character CNN)?

slide-6
SLIDE 6

Q1: How can you featurize discourse information?

6

  • Use an entity grid model (Barzilay & Lapata, 2008) with either:
  • grammatical relations, or
  • RST discourse relations
slide-7
SLIDE 7

Q1: How can you featurize discourse information?

7

(1) My father was a clergyman of the north of England, who was deservedly respected by all who knew him; and, in his younger days, lived pretty comfortably on the joint income of a small incumbency and a snug little property

  • f his own.


(2) My mother, who married him against the wishes of her friends, was a squire’s daughter, and a woman of spirit. (3) In vain it was represented to her, that if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-8
SLIDE 8

Q1: How can you featurize discourse information?

8

(1) My father was a clergyman of the north of England, who was deservedly respected by all who knew him; and, in his younger days, lived pretty comfortably on the joint income of a small incumbency and a snug little property

  • f his own.


(2) My mother, who married him against the wishes of her friends, was a squire’s daughter, and a woman of spirit. (3) In vain it was represented to her, that if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-9
SLIDE 9

Q1: How can you featurize discourse information?

9

(1) My father was a clergyman of the north of England, who was deservedly respected by all who knew him; and, in his younger days, lived pretty comfortably on the joint income of a small incumbency and a snug little property

  • f his own.


(2) My mother, who married him against the wishes of her friends, was a squire’s daughter, and a woman of spirit. (3) In vain it was represented to her, that if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-10
SLIDE 10

10

Q1: How can you featurize discourse information?

(1) (2) (3)

f a t h e r m

  • t

h e r

Barzilay and Lapata (2008)

row: sentence column: salient entity

slide-11
SLIDE 11

Q1: How can you featurize discourse information?

11

(1) [My father]SUBJECT was a clergyman of the north of England, who was deservedly respected by all who knew him; and, in his younger days, lived pretty comfortably on the joint income of a small incumbency and a snug little property of his

  • wn.


(2) [My mother]SUBJECT, who married [him]OBJECT against the wishes of her friends, was a squire’s daughter, and a woman of spirit. (3) In vain it was represented to her, that if [she]SUBJECT became the [poor parson]OTHER’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-12
SLIDE 12

Q1: How can you featurize discourse information?

12

(1) [My father]SUBJECT was a clergyman of the north of England, who was deservedly respected by all who knew him; and, in his younger days, lived pretty comfortably on the joint income of a small incumbency and a snug little property of his

  • wn.


(2) [My mother]SUBJECT, who married [him]OBJECT against the wishes of her friends, was a squire’s daughter, and a woman of spirit. (3) In vain it was represented to her, that if [she]SUBJECT became the [poor parson]OTHER’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-13
SLIDE 13

13

Q1: How can you featurize discourse information?

(1) S

  • (2)

O S (3) X S

f a t h e r m

  • t

h e r

Grammatical relations

Barzilay and Lapata (2008)

slide-14
SLIDE 14

Q1: How can you featurize discourse information?

14

  • Discourse relations:
  • Rhetorical Structure Theory (RST)
  • Divide a document into elementary discourse units (EDUs),

usually clauses

  • Organize EDUs into a tree structure:
  • edges are discourse relation types
  • node in a relation can be either the nucleus (more

“important”) or satellite

slide-15
SLIDE 15

Q1: How can you featurize discourse information?

15

if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-16
SLIDE 16

Q1: How can you featurize discourse information?

16

if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-17
SLIDE 17

Q1: How can you featurize discourse information?

17

if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life.

slide-18
SLIDE 18

Q1: How can you featurize discourse information?

18

if she became the poor parson’s wife, she must relinquish her carriage and her lady’s-maid, and all the luxuries and elegancies of affluence; which to her were little less than the necessaries of life. condition-n condition-s

slide-19
SLIDE 19

Q1: How can you featurize discourse information?

19

if she became the poor parson’s wife, which to her were little less than the necessaries of life. condition-n condition-s she must relinquish her carriage and her lady’s- maid, and all the luxuries and elegancies

  • f affluence;

interpretation-s interpretation-n

slide-20
SLIDE 20

Q1: How can you featurize discourse information?

20

slide-21
SLIDE 21

21

Q1: How can you featurize discourse information?

(1) background.N, TopicShift, elaboration.S, background.S

  • (2)

elaboration.S elaboration.N, circumstance.N, TopicShift (3) condition.N attribution.S, condition.N, interpretation.S

f a t h e r m

  • t

h e r

RST discourse relations

Feng and Hirst (2014)

slide-22
SLIDE 22

22

Q2: How can you integrate discourse information into the network?

  • Use probability vector
  • Use embeddings!
slide-23
SLIDE 23

Q2: How can you integrate discourse information into the network?

CNN without discourse

Ruder et al., 2016; Shrestha et al., 2017, Sari et al., 2017

slide-24
SLIDE 24

Q2: How can you integrate discourse information into the network?

CNN with discourse probability vector

slide-25
SLIDE 25

Q2: How can you integrate discourse information into the network?

CNN with discourse embeddings

slide-26
SLIDE 26

Q2: How can you integrate discourse information into the network?

  • Use embeddings
  • Local vs. Global
  • Local: how are entities changing across contiguous

sentences?

  • Global: how is each entity changing across a document?
slide-27
SLIDE 27

27

f a t h e r m

  • t

h e r

Sequence:

Q2: How can you integrate discourse information into the network?

Local: by contiguous sentences

(1) S

  • (2)

O S (3) X S

1 2 3 4

so, -s, ox, ss

slide-28
SLIDE 28

28

f a t h e r m

  • t

h e r

Sequence:

Q2: How can you integrate discourse information into the network?

Global: by entity

(1) S

  • (2)

O S (3) X S

1 3 2 4

so,ox, -s, ss

slide-29
SLIDE 29

29

Datasets

Dataset # authors mean words/ auth mean words/ text IMDB62 62 349,004 349 Novel-50 50 709,880 2,000

slide-30
SLIDE 30

Results

30

1) How to featurize? grammatical relations vs. RST discourse relations

F1 90 92.5 95 97.5 100

IMDB Novel-50

grammatical relations RST discourse relations

slide-31
SLIDE 31

Results

31

1) How to featurize? grammatical relations vs. RST discourse relations

F1 90 92.5 95 97.5 100

IMDB Novel-50

grammatical relations RST discourse relations

slide-32
SLIDE 32

Results

32

2) How to integrate? probability vector vs. discourse embedding

F1 90 92.5 95 97.5 100

IMDB Novel-50

probability vector discourse embedding

slide-33
SLIDE 33

Results

33

2) How to integrate? probability vector vs. discourse embedding

F1 90 92.5 95 97.5 100

IMDB Novel-50

probability vector discourse embedding

slide-34
SLIDE 34

Results

34

2) How to integrate? local vs. global

F1 91 93.25 95.5 97.75 100

IMDB Novel-50

local global

slide-35
SLIDE 35

Results

35

2) How to integrate? local vs. global

F1 91 93.25 95.5 97.75 100

IMDB Novel-50

local global

slide-36
SLIDE 36

Results

36

3) Does discourse help? It depends…

F1 90 92.5 95 97.5 100

IMDB

No discourse baseline probability vector discourse embedding (gr. rels.) discourse embedding (RST)

slide-37
SLIDE 37

Results

37

3) Does discourse help? Yes!

F1 95 96.25 97.5 98.75 100

Novel-50

No discourse baseline probability vector discourse embedding (gr. rels.) discourse embedding (RST)

slide-38
SLIDE 38

Error Analysis

  • The least-represented author (Ambrose Bierce) obtains the

biggest improvement from discourse: —Discourse feature is more robust with smaller, fewer samples compared to character bigrams

  • Two authors who gained large improvements from discourse

wrote a variety of genres (e.g., both supernatural horror and love stories) —Character bigrams can’t generalize well to the different vocabularies, but discourse captures the similar rhetorical style

38

slide-39
SLIDE 39

Conclusion

  • Discourse improves authorship attribution over a

strong baseline of character-level CNN

  • Embeddings of RST discourse relations at the global

level perform the best

  • Works better on longer documents

39

slide-40
SLIDE 40

Thank you!

elisa@ferracane.com

Leveraging discourse information effectively for authorship attribution