End-to-end Neural Coreference Resolution Kenton Lee Luheng He - - PowerPoint PPT Presentation

end to end neural coreference resolution
SMART_READER_LITE
LIVE PREVIEW

End-to-end Neural Coreference Resolution Kenton Lee Luheng He - - PowerPoint PPT Presentation

End-to-end Neural Coreference Resolution Kenton Lee Luheng He Mike Lewis Luke Zettlemoyer UWNLP Allen Institute for University of Washington Facebook AI Research Artificial Intelligence 1 Coreference Resolution Input document


slide-1
SLIDE 1

End-to-end Neural Coreference Resolution

1

Kenton Lee Luheng He Mike Lewis Luke Zettlemoyer

UWNLP

University of Washington Facebook AI Research Allen Institute for Artificial Intelligence

slide-2
SLIDE 2

Coreference Resolution

2

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document

slide-3
SLIDE 3

Coreference Resolution

3

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document

Cluster #1 A fire in a Bangladeshi garment factory the blaze in the four-story building

slide-4
SLIDE 4

Coreference Resolution

4

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

Cluster #1 A fire in a Bangladeshi garment factory the blaze in the four-story building Cluster #2 a Bangladeshi garment factory the four-story building

Input document

slide-5
SLIDE 5

Coreference Resolution

5

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

Cluster #1 A fire in a Bangladeshi garment factory the blaze in the four-story building Cluster #2 a Bangladeshi garment factory the four-story building Cluster #3 at least 37 people the deceased

Input document

slide-6
SLIDE 6

Two Subproblems

6 A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document

Mention detection

A fire in a Bangladeshi garment factory at least 37 people … the four-story building

Mention clustering

Cluster #1 A fire in a Bangladeshi garment factory the blaze in the four-story building Cluster #2 a Bangladeshi garment factory the four-story building Cluster #3 at least 37 people the deceased

slide-7
SLIDE 7

Previous Approach: Rule-based pipeline

7

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Candidate mentions A fire in a Bangladeshi garment factory garment factory at least 37 people dead and 100 hospitalized … Input document Hand-engineered rules Syntactic parser Mention #1 Mention #2 Coreferent? A fire in a Bangladeshi garment factory garment ✓/✗ garment factory ✓/✗ factory at least 37 people dead and 100 hospitalized ✓/✗ … … ✓/✗

slide-8
SLIDE 8

Previous Approach: Rule-based pipeline

8

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Candidate mentions A fire in a Bangladeshi garment factory garment factory at least 37 people dead and 100 hospitalized … Input document Hand-engineered rules Syntactic parser Mention #1 Mention #2 Coreferent? A fire in a Bangladeshi garment factory garment ✓/✗ garment factory ✓/✗ factory at least 37 people dead and 100 hospitalized ✓/✗ … … ✓/✗

Mention clustering: main source of improvement for many years!

  • Haghighi and Klein (2010)
  • Raghunathan et al. (2010)
  • Clark & Manning (2016)
slide-9
SLIDE 9

Previous Approach: Rule-based pipeline

9

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Candidate mentions A fire in a Bangladeshi garment factory garment factory at least 37 people dead and 100 hospitalized … Input document Hand-engineered rules Syntactic parser Mention #1 Mention #2 Coreferent? A fire in a Bangladeshi garment factory garment ✓/✗ garment factory ✓/✗ factory at least 37 people dead and 100 hospitalized ✓/✗ … … ✓/✗

Relies on parser for:

  • mention detection
  • syntactic features for clustering (e.g. head words)
slide-10
SLIDE 10

10

Our Contribution: End-to-end Approach

  • Joint mention detection and clustering
  • No preprocessing (no parser, no POS-tagger etc.)
slide-11
SLIDE 11

11

Key Challenges

  • Inference: can we do better than naive O(N4) runtime?
  • Data: can we learn with partial labels?
  • Model: can we induce rich features (e.g. head words)?
slide-12
SLIDE 12

12 A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

Naive joint model is O(N4):

Inference challenge: Can we do better than O(N4)?

slide-13
SLIDE 13

13 Span #1 A A fire A fire in …

O(N2) spans in every document

Inference challenge: Can we do better than O(N4)?

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

Naive joint model is O(N4):

slide-14
SLIDE 14

14 Span #1 Span #2 Coreferent? A A fire

✓/✗

A fire A fire in

✓/✗

A fire in A fire in a

✓/✗

… …

✓/✗

O(N4) pairwise decisions

Inference challenge: Can we do better than O(N4)?

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

Naive joint model is O(N4):

slide-15
SLIDE 15

End-to-end Approach

15

  • Consider all possible spans
  • Learn to rank antecedent spans
  • Factored model to prune search space
slide-16
SLIDE 16

16

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out.

Every span independently chooses an antecedent

Input document

Span Ranking

slide-17
SLIDE 17

17

  • Reason over all possible spans
  • Assign an antecedent to every span

Span Antecedent 1 A 2 A fire 3 A fire in … … … M

  • ut

y3 y2 y1 yM

Span Ranking

slide-18
SLIDE 18

18

y3 ∈ {✏, 1, 2}

  • Reason over all possible spans
  • Assign an antecedent to every span

Span Antecedent 1 A 2 A fire 3 A fire in … … … M

  • ut

y3 y2 y1 yM

Span Ranking

slide-19
SLIDE 19

19

y3 ∈ {✏, 1, 2}

: no coreference link

  • Reason over all possible spans
  • Assign an antecedent to every span

Span Antecedent 1 A 2 A fire 3 A fire in … … … M

  • ut

y3 y2 y1 yM

Span Ranking

slide-20
SLIDE 20

20

y3 ∈ {✏, 1, 2}

Coreference link from span 1 to span 3

  • Reason over all possible spans
  • Assign an antecedent to every span

Span Antecedent 1 A 2 A fire 3 A fire in … … … M

  • ut

y3 y2 y1 yM

Span Ranking

slide-21
SLIDE 21

21

y3 ∈ {✏, 1, 2}

Coreference link from span 2 to span 3

  • Reason over all possible spans
  • Assign an antecedent to every span

Span Antecedent 1 A 2 A fire 3 A fire in … … … M

  • ut

y3 y2 y1 yM

Span Ranking

slide-22
SLIDE 22

Example Clustering

22

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document

Span Antecedent ( ) A A fire … … a Bangladeshi garment factory … … the four-story building a Bangladeshi garment factory … …

  • ut

✏ ✏ ✏ ✏

yi

slide-23
SLIDE 23

Example Clustering

23

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document

Span Antecedent ( ) A A fire … … a Bangladeshi garment factory … … the four-story building a Bangladeshi garment factory … …

  • ut

✏ ✏ ✏ ✏

yi

Not a mention

slide-24
SLIDE 24

Example Clustering

24

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document

Span Antecedent ( ) A A fire … … a Bangladeshi garment factory … … the four-story building a Bangladeshi garment factory … …

  • ut

✏ ✏ ✏ ✏

yi

No link with previously occurring span

slide-25
SLIDE 25

Example Clustering

25

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document

Span Antecedent ( ) A A fire … … a Bangladeshi garment factory … … the four-story building a Bangladeshi garment factory … …

  • ut

✏ ✏ ✏ ✏

yi

Predicted coreference link

slide-26
SLIDE 26

26

P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

Span Ranking Model

slide-27
SLIDE 27

P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

27

Independent decision for every span

Span Ranking Model

slide-28
SLIDE 28

P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

28

Pairwise coreference score between span i and span j

s(i, j)

Span Ranking Model

slide-29
SLIDE 29

P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

29

s(i, j) s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

Factor coreference score to enable span pruning:

s(i, j)

Span Ranking Model

slide-30
SLIDE 30

P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

30

s(i, j) s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

Factor coreference score to enable span pruning:

s(i, j)

Is this span a mention?

Span Ranking Model

slide-31
SLIDE 31

P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

31

s(i, j) s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

Factor coreference score to enable span pruning:

s(i, j)

Is span j an antecedent of span i?

Span Ranking Model

slide-32
SLIDE 32

32

s(i, j)

Dummy antecedent has a fixed zero score

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

Factor coreference score to enable span pruning:

s(i, j) P(y1, . . . , yM | D) =

M

Y

i=1

P(yi | D) =

M

Y

i=1

es(i,yi) P

y0∈Y(i) es(i,y0)

Span Ranking Model

slide-33
SLIDE 33

Two-stage Beam Search

33

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

slide-34
SLIDE 34

Two-stage Beam Search

34

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out.

Span A

  • 10

A fire 4 … … a Bangladeshi garment factory 6 … … the four-story building 10 … …

  • ut
  • 5

sm

Input document (N words)

slide-35
SLIDE 35

Two-stage Beam Search

35

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out.

Spans with low mention scores likely to have a negative overall score

Span A

  • 10

A fire 4 … … a Bangladeshi garment factory 6 … … the four-story building 10 … …

  • ut
  • 5

sm

Input document (N words)

slide-36
SLIDE 36

Two-stage Beam Search

36 Span A fire 4 … … a Bangladeshi garment factory 6 … … the four-story building 10 … …

sm

Span A

  • 10

A fire 4 … … a Bangladeshi garment factory 6 … … the four-story building 10 … …

  • ut
  • 5

sm

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

Keep top λN

slide-37
SLIDE 37

Two-stage Beam Search

37 Target Span A fire … a Bangladeshi garment factory … the four-story building …

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

slide-38
SLIDE 38

Two-stage Beam Search

38 Antecedent A fire … a Bangladeshi garment factory … the four-story building … Target Span A fire

  • a Bangladeshi garment factory
  • 10
  • 5

… … …

  • the four-story building

2

  • 3

10

  • 5

… … … … …

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

slide-39
SLIDE 39

39

Factored model enables aggressive pruning

Span #1 Span #2 Coreferent? A A fire

✓/✗

A fire A fire in

✓/✗

A fire in A fire in a

✓/✗

… …

✓/✗

Inference challenge: Can we do better than O(N4)?

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Witnesses say the only exit door was on the ground floor, and that it was locked when the fire broke out. Input document (N words)

Naive joint model is O(N4):

slide-40
SLIDE 40

40

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building……. Input document

Data Challenge: Can we learn with partial labels?

Only clusters with multiple mentions annotated:

slide-41
SLIDE 41

41

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building……. Input document

Singleton mention missing from data

Only clusters with multiple mentions annotated:

Data Challenge: Can we learn with partial labels?

slide-42
SLIDE 42

Learning

42

Marginal log-likelihood objective.

log

M

Y

i=1

X

ˆ y∈Y(i)∩gold(i)

P(ˆ y | D)

slide-43
SLIDE 43

Learning

43

  • Related to Durrett & Klein (2013)

Marginal log-likelihood objective.

log

M

Y

i=1

X

ˆ y∈Y(i)∩gold(i)

P(ˆ y | D)

slide-44
SLIDE 44

Learning

44

  • Related to Durrett & Klein (2013)
  • Model can assign credit/blame to the mention or antecedent factors

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

Marginal log-likelihood objective.

log

M

Y

i=1

X

ˆ y∈Y(i)∩gold(i)

P(ˆ y | D)

slide-45
SLIDE 45

Credit Assignment Example

45

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document left at 37 people

Span i Span j

slide-46
SLIDE 46

46

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document left at 37 people

Span i Span j

Bad mention

Credit Assignment Example

slide-47
SLIDE 47

47

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document left at 37 people

Span i Span j

Blame mention factor for absent link

Credit Assignment Example

slide-48
SLIDE 48

48

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document a Bangladeshi garment factory left at

Span i Span j

Bad mention

Credit Assignment Example

slide-49
SLIDE 49

49

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document a Bangladeshi garment factory left at

Span i Span j

Blame mention factor for absent link

Credit Assignment Example

slide-50
SLIDE 50

50

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document a Bangladeshi garment factory 37 people

Span i Span j

Incompatible mentions

Credit Assignment Example

slide-51
SLIDE 51

51

s(i, j) = ( sm(i) + sm(j) + sa(i, j) j 6= ✏ j = ✏

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building. Input document a Bangladeshi garment factory 37 people

Span i Span j

Blame antecedent factor for absent link

Credit Assignment Example

slide-52
SLIDE 52

52

Missing mentions are latent in the joint model

Data Challenge: Can we learn with partial labels?

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building……. Input document

Only clusters with multiple mentions annotated:

slide-53
SLIDE 53

53

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building……. Input document

Model Challenge: Can we induce rich features?

Lexical and contextual cues are useful:

slide-54
SLIDE 54

54

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building……. Input document

Lexical and contextual cues are useful:

e.g. paraphrased head words

Model Challenge: Can we induce rich features?

slide-55
SLIDE 55

Neural Span Representations

55 General Electric said the Postal Service contacted the company the Postal Service

+

Word & character embeddings Span representation

slide-56
SLIDE 56

56 General Electric said the Postal Service contacted the company the Postal Service

+

Bidirectional LSTM Word & character embeddings Span representation

Neural Span Representations

slide-57
SLIDE 57

57

Bidirectional LSTM Word & character embeddings Span representation

General Electric said the Postal Service contacted the company the Postal Service

Boundary representations

Neural Span Representations

slide-58
SLIDE 58

58 General Electric said the Postal Service contacted the company the Postal Service

+

Bidirectional LSTM Word & character embeddings Head-finding attention Span representation

Attention mechanism to learn headedness

Neural Span Representations

slide-59
SLIDE 59

59

Bidirectional LSTM Word & character embeddings Head-finding attention Span representation

General Electric said the Postal Service contacted the company General Electric

+

Electric said the

+

the Postal Service

+

Service contacted the

+

the company

+

Compute all span representations

Neural Span Representations

slide-60
SLIDE 60

Coreference Architecture

60

P(yi | D)

General Electric the Postal Service the company s(the company, General Electric) s(the company, the Postal Service) s(the company, ✏) = 0

Span representation

slide-61
SLIDE 61

General Electric the Postal Service the company s(the company, General Electric) s(the company, the Postal Service) s(the company, ✏) = 0

Coreference Architecture

61

P(yi | D)

span i

Span representation

slide-62
SLIDE 62

Coreference Architecture

62

P(yi | D)

General Electric the Postal Service the company

Span representation

sm(i)

slide-63
SLIDE 63

Coreference Architecture

63

P(yi | D)

General Electric the Postal Service the company

Span representation

sm(i) sa(i, j)

slide-64
SLIDE 64

Coreference Architecture

64

P(yi | D)

General Electric the Postal Service the company

Span representation

sm(i) s(i, j) sa(i, j)

slide-65
SLIDE 65

Coreference Architecture

65

Span representation

P(yi | D)

General Electric the Postal Service the company s(the company, General Electric) s(the company, the Postal Service) s(the company, ✏) = 0

sm(i) s(i, j) sa(i, j)

slide-66
SLIDE 66

Experimental Setup

66

Dataset: English OntoNotes (CoNLL-2012) Genres: Telephone conversations, newswire, newsgroups, broadcast conversation, broadcast news, weblogs Documents: 2802 training, 343 development, 348 test

slide-67
SLIDE 67

Experimental Setup

67

Dataset: English OntoNotes (CoNLL-2012) Genres: Telephone conversations, newswire, newsgroups, broadcast conversation, broadcast news, weblogs Documents: 2802 training, 343 development, 348 test Additional pruning: Maximum span width, maximum sentence training, suppress spans with inconsistent bracketing, maximum number of antecedents Longest document has 4009 words!

slide-68
SLIDE 68

Experimental Setup

68

Dataset: English OntoNotes (CoNLL-2012) Genres: Telephone conversations, newswire, newsgroups, broadcast conversation, broadcast news, weblogs Documents: 2802 training, 343 development, 348 test Additional pruning: Maximum span width, maximum sentence training, suppress spans with inconsistent bracketing, maximum number of antecedents Features: distance between spans, span width Metadata: speaker information, genre Longest document has 4009 words!

slide-69
SLIDE 69

Coreference Results

69

Test Avg. F1 (%) 50.0 54.0 58.0 62.0 66.0 70.0

slide-70
SLIDE 70

Coreference Results

70

Test Avg. F1 (%) 50.0 54.0 58.0 62.0 66.0 70.0 Durrett & Klein (2013) Björkelund & Kuhn (2014) Martschat & Strube (2015)

62.5 61.6 60.3

Linear models

slide-71
SLIDE 71

Coreference Results

71

Test Avg. F1 (%) 50.0 54.0 58.0 62.0 66.0 70.0 Durrett & Klein (2013) Björkelund & Kuhn (2014) Martschat & Strube (2015) Wiseman et al. (2016) Clark & Manning (2016)

65.7 64.2 62.5 61.6 60.3

Linear models Neural models

slide-72
SLIDE 72

Coreference Results

72

Test Avg. F1 (%) 50.0 54.0 58.0 62.0 66.0 70.0 Durrett & Klein (2013) Björkelund & Kuhn (2014) Martschat & Strube (2015) Wiseman et al. (2016) Clark & Manning (2016)

65.7 64.2 62.5 61.6 60.3

Pipelined models

slide-73
SLIDE 73

Coreference Results

73

Test Avg. F1 (%) 50.0 54.0 58.0 62.0 66.0 70.0 Durrett & Klein (2013) Björkelund & Kuhn (2014) Martschat & Strube (2015) Wiseman et al. (2016) Clark & Manning (2016) Our model (single) Our model (ensemble)

68.8 67.2 65.7 64.2 62.5 61.6 60.3

Pipelined models End-to-end models

slide-74
SLIDE 74

% mention recall 20 40 60 80 100 Spans per word 0.1 0.2 0.3 0.4 0.5

Raghunathan et al. (2010) Our model (actual threshold) Our model (various thresholds)

Mention Recall

74

slide-75
SLIDE 75

% mention recall 20 40 60 80 100 Spans per word 0.1 0.2 0.3 0.4 0.5

Raghunathan et al. (2010) Our model (actual threshold) Our model (various thresholds)

Mention Recall

75

Recall of gold mentions increases as we keep more spans

slide-76
SLIDE 76

% mention recall 20 40 60 80 100 Spans per word 0.1 0.2 0.3 0.4 0.5

Raghunathan et al. (2010) Our model (actual threshold) Our model (various thresholds)

Mention Recall

76

92.7% @ 0.4 spans per word

slide-77
SLIDE 77

% mention recall 20 40 60 80 100 Spans per word 0.1 0.2 0.3 0.4 0.5

Raghunathan et al. (2010) Our model (actual threshold) Our model (various thresholds)

Mention Recall

77

92.7% @ 0.4 spans per word 89.2% @ 0.26 spans per word

slide-78
SLIDE 78

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

Qualitative Analysis

78

slide-79
SLIDE 79

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

79

: Mention in a predicted cluster

Qualitative Analysis

slide-80
SLIDE 80

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

80

: Mention in a predicted cluster : Head-finding attention weight

Qualitative Analysis

slide-81
SLIDE 81

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

81

: Mention in a predicted cluster : Head-finding attention weight

Qualitative Analysis

Attention-based head finder facilitates soft similarity cues

slide-82
SLIDE 82

A fire in a Bangladeshi garment factory has left at least 37 people dead and 100 hospitalized. Most of the deceased were killed in the crush as workers tried to flee the blaze in the four-story building.

82

: Mention in a predicted cluster : Head-finding attention weight

Qualitative Analysis

Good head-finding requires word-order information!

slide-83
SLIDE 83

Head-finding Agreement

83

% of constituent spans with predicted heads that agree with syntactic heads

% agreement 25 50 75 100 Span width 1 2 3 4 5 6 7 8 9 10

slide-84
SLIDE 84

Common Error Case

: Mention in a predicted cluster : Head-finding attention weight

84

The flight attendants have until 6:00 today to ratify labor concessions. The pilots' union and ground crew did so yesterday.

slide-85
SLIDE 85

: Mention in a predicted cluster : Head-finding attention weight

85

The flight attendants have until 6:00 today to ratify labor concessions. The pilots' union and ground crew did so yesterday. Conflating relatedness with paraphrasing

Common Error Case

slide-86
SLIDE 86

Conclusion

86

  • State-of-the-art end-to-end coreference resolver
  • Scalable inference
  • Learns latent mentions and heads
  • https://github.com/kentonl/e2e-coref
slide-87
SLIDE 87

Conclusion

87

  • State-of-the-art end-to-end coreference resolver
  • Scalable inference
  • Learns latent mentions and heads
  • https://github.com/kentonl/e2e-coref
  • Relatively simplistic model:
  • Doesn’t explicitly model clusters
  • Lacks discourse reasoning and world knowledge
  • Still a long way to go!