Semantically Constrained Multilayer Annotation
The Case of Coreference
Jakob Prange, Nathan Schneider, and Omri Abend
Semantically Constrained Multilayer Annotation The Case of - - PowerPoint PPT Presentation
Semantically Constrained Multilayer Annotation The Case of Coreference Jakob Prange, Nathan Schneider, and Omri Abend How to annotate coreference? DMR, August 1, 2019 Prange, Schneider, Abend 2 Did anyone else have these fears ? DMR, August
Jakob Prange, Nathan Schneider, and Omri Abend
DMR, August 1, 2019 Prange, Schneider, Abend 2
DMR, August 1, 2019 Prange, Schneider, Abend 3
DMR, August 1, 2019 Prange, Schneider, Abend 4
DMR, August 1, 2019 Prange, Schneider, Abend 5
DMR, August 1, 2019 Prange, Schneider, Abend 6
DMR, August 1, 2019 Prange, Schneider, Abend 7
DMR, August 1, 2019 Prange, Schneider, Abend 8
What to annotate as mentions?
Syntactic criteria (e.g., all nouns) Semantic criteria (e.g., all events) Singletons?
How to annotate mention spans?
Minimum spans (only head words) Maximum spans (plus args & mods) Hybrid?
How to annotate coreference?
Identity, Bridging, … Unordered clusters
Tricky linguistic phenomena (e.g., coordination)
[Poesio et al., 2016]
[Moosavi and Strube, 2016; Moosavi et al., 2019]
DMR, August 1, 2019 Prange, Schneider, Abend 9
Prange, Schneider, Abend DMR, August 1, 2019 H10
DMR, August 1, 2019 Prange, Schneider, Abend 10
Prange, Schneider, Abend DMR, August 1, 2019 H11
DMR, August 1, 2019 Prange, Schneider, Abend 11
Prange, Schneider, Abend
DMR, August 1, 2019 H12
Prange, Schneider, Abend
DMR, August 1, 2019 H12
Prange, Schneider, Abend
DMR, August 1, 2019 H12
Prange, Schneider, Abend
DMR, August 1, 2019 H13
World Knowledge
Prange, Schneider, Abend
DMR, August 1, 2019 H13
World Knowledge
DMR, August 1, 2019 Prange, Schneider, Abend 14
DMR, August 1, 2019 Prange, Schneider, Abend 15
Did anyone else have these fears ? How did you get over them ?
Prange, Schneider, Abend
DMR, August 1, 2019 H16
Did anyone else have these fears ? How did you get over them ?
Prange, Schneider, Abend
DMR, August 1, 2019 H16
Did anyone else have these fears ? How did you get over them ?
Prange, Schneider, Abend
DMR, August 1, 2019 H16
Did anyone else have these fears ? How did you get over them ?
DMR, August 1, 2019 Prange, Schneider, Abend 17
Prange, Schneider, Abend
DMR, August 1, 2019 H18
Did anyone else have these fears ? How did you get over them ?
Prange, Schneider, Abend
DMR, August 1, 2019 H18
Did anyone else have these fears ? How did you get over them ?
DMR, August 1, 2019 Prange, Schneider, Abend 19
Prange, Schneider, Abend
DMR, August 1, 2019 H20
Did anyone else have these fears ? How did you get over them ?
Prange, Schneider, Abend
DMR, August 1, 2019 H20
Did anyone else have these fears ? How did you get over them ?
Prange, Schneider, Abend
DMR, August 1, 2019 H20
Did anyone else have these fears ? How did you get over them ?
DMR, August 1, 2019 Prange, Schneider, Abend 21
DMR, August 1, 2019 Prange, Schneider, Abend 22
DMR, August 1, 2019 Prange, Schneider, Abend 23
DMR, August 1, 2019 Prange, Schneider, Abend 24
OntoNotes GUM RED UCoref Anchored in syntax syntax tokens semantics Mention criteria syntax syntax semantics semantics Mention spans max max min flexible Events ✗ (✓) ✓ ✓ Singletons ✗ (✓) ✓ ✓ Annotation guidelines point in this direction.
mentions UCoref mentions OntoNotes 40 128 GUM 288 466
DMR, August 1, 2019 Prange, Schneider, Abend 25
referents UCoref referents 20 96 155 291 Numbers of mentions and referents confirm: UCoref covers more than other schemes.
mentions UCoref mentions RED 120 117
DMR, August 1, 2019 Prange, Schneider, Abend 26
referents UCoref referents 82 78 RED is very similar to UCoref in terms of coverage.
Prange, Schneider, Abend
DMR, August 1, 2019 H27
Exact Match
%
25 50 75 100
OntoNotes GUM RED
Menmons Referents
Prange, Schneider, Abend
Parmal Match
25 50 75 100
OntoNotes GUM RED
DMR, August 1, 2019 H27
Iterative greedy 1-to-1 alignment based on Dice coefficient.
Exact Match
%
25 50 75 100
OntoNotes GUM RED
Menmons Referents
Prange, Schneider, Abend
Parmal Match
25 50 75 100
OntoNotes GUM RED
DMR, August 1, 2019 H27
Iterative greedy 1-to-1 alignment based on Dice coefficient.
Exact Match
%
25 50 75 100
OntoNotes GUM RED
Menmons Referents
Prange, Schneider, Abend
Parmal Match
25 50 75 100
OntoNotes GUM RED
DMR, August 1, 2019 H27
Iterative greedy 1-to-1 alignment based on Dice coefficient.
Exact Match
%
25 50 75 100
OntoNotes GUM RED
Menmons Referents
Moosavi et al. [2019] automatically extract min spans.
DMR, August 1, 2019 Prange, Schneider, Abend 28
DMR, August 1, 2019 Prange, Schneider, Abend 29
FrameNet [Baker et al., 1998] RED [O’Gorman et al., 2016] PropBank [Palmer et al., 2005] Decompositional Semantics [White et al., 2016] AMR [Banarescu et al., 2013] Multi-sentence AMR [O’Gorman et al., 2018]
ANCHORING
Prague [Böhmová et al., 2003] Token Sentence Syntax Semantics
Modular Highly modular 1 Layer
OntoNotes [Hovy et al., 2006] GUM [Zeldes, 2017]
Jakob Prange, Nathan Schneider, and Omri Abend
https://github.com/jakpra/UCoref https://arxiv.org/abs/1906.00663
DMR, August 1, 2019 Prange, Schneider, Abend 31
DMR, August 1, 2019 Prange, Schneider, Abend 32
DMR, August 1, 2019 Prange, Schneider, Abend 33
DMR, August 1, 2019 Prange, Schneider, Abend 34
null-instantiation
DMR, August 1, 2019 Prange, Schneider, Abend 35
→ Efficient annotation across languages, even by non-experts
DMR, August 1, 2019 Prange, Schneider, Abend 36
OntoNotes GUM RED Sentences 17 70 24 Tokens 303 1180 302 UCCA units 336 1436 379 Mention candidates 195 911 186
DMR, August 1, 2019 Prange, Schneider, Abend 37
… simplifies manual annotation.
structure (tokens, syntax, …)?
separate structures/layers?
DMR, August 1, 2019 Prange, Schneider, Abend 38