1
Detecting negation scope is easy, except when it isn’t
Federico Fancellu1 Adam Lopez1 Bonnie Webber1 Hangfeng He2
1ILCC, School of Informatics, University of Edinburgh 2School of Electronics Engineering and Computer Science, Peking University
Detecting negation scope is easy, except when it isnt Federico - - PowerPoint PPT Presentation
Detecting negation scope is easy, except when it isnt Federico Fancellu 1 Adam Lopez 1 Bonnie Webber 1 Hangfeng He 2 1 ILCC, School of Informatics, University of Edinburgh 2 School of Electronics Engineering and Computer Science, Peking
1
1ILCC, School of Informatics, University of Edinburgh 2School of Electronics Engineering and Computer Science, Peking University
2
◮ Input: a sentence containing at least one negation marker (or
◮ Task: classify a token as part of the scope of the cue or not
2
◮ Input: a sentence containing at least one negation marker (or
◮ Task: classify a token as part of the scope of the cue or not
2
◮ Input: a sentence containing at least one negation marker (or
◮ Task: classify a token as part of the scope of the cue or not
3
◮ Bi-LSTM for negation scope detection ◮ Performance on par or better than previous heavily-engineered
◮ Tested on Conan-Doyle neg.[Morante et Daelemans, 2012]
4
◮ Several corpora annotated with negation scope
◮ Different annotation decisions ◮ Different domains
◮ Our question: Does it work on these corpora?
◮ BioScope (EN) [Vincze et al., 2009] ◮ 3 sub-corpora (Abstract, Full, Clinical) ◮ SFUProductReview (EN) [Konstantinova et al., 2012] ◮ CNeSp (ZH) [Zou et al., 2015] ◮ 3 sub-corpora (Product, Financial, Scientific)
5
◮ Same bi-LSTM architecture, same features ◮ Add a 4-parameter transition matrix to create the dependency
n
6
◮ Evaluation
◮ Token-level: F1 on tokens correctly classified ◮ Scope-level: Accuracy of full scopes we correctly match
◮ Performance on par or better than previous work
7
8
9
10
11
◮ Easy: predictable by punctuation
◮ Hard: not predictable by punctuation
12
13
◮ Most of the errors are due to the model trying to match
13
◮ Most of the errors are due to the model trying to match
13
◮ Most of the errors are due to the model trying to match
13
◮ Most of the errors are due to the model trying to match
14
14
14
15
◮ We re-annotated 100 randomly selected sentences of 3
16
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
10 20 30 40 50 60 70 80 90 100
punct dev punct tst no punct dev no punct tst
17
◮ GOOD PERFORMANCE FEELS GREAT BUT
◮ Detecting negation scope is easy, except when it
◮ focus detection on those more difficult cases?