Exploring Lexicalized Features for Coreference Resolution Anders Bj - PowerPoint PPT Presentation

System Results Exploring Lexicalized Features for Coreference Resolution Anders Bj¨ orkelund and Pierre Nugues June 24, 2011 Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 1 / 10

System Shared Task System Results Features Overview Pair-wise classifier based on Soon et al. (2001) Syntactic dependencies obtained through an automatic conversion from the constituents Large number of lexical and dependency-based feature templates Automatic feature selection Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 2 / 10

System Shared Task System Results Features System Architecture Preprocessing Mention extraction — All NPs and possessive pronouns Conversion to syntactic dependencies using the LTH converter Pair-wise classifier using logistic regression (LIBLINEAR) Closest-first clustering for pronouns Best-first clustering for nonpronominals Postprocessing (next slide) Recovery of missed mentions using string matching Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 3 / 10

System Shared Task System Results Features Postprocessing Not all mentions are extracted during mention extraction The automatically parsed constituents contain mistakes NML constituents were disregarded during mention extraction Obvious and easy examples include proper nouns Recovering missed mentions: Search the document for spans of one or more proper nouns whose immediate parent was not clustered Try to match this span of proper nouns to all mentions that were clustered by the classifier using string match If match, add this span to corresponding chain Example (NP (NML (NNP Hong) (NNP Kong)) (NN cinema)) Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 4 / 10

System Shared Task System Results Features Features (baseline) Baseline system: Reimplementation of the Soon et al. (2001) system with 12 features, e.g. StringMatch GenderAgreement AnaphorIsPronoun AnaphorIsDefinite ... These features are extracted using hand-crafted rules They can often be simply reframed in terms of dependencies: IsPronoun can be deduced from POS tag of head word IsDefinite can be deduced from surface form of leftmost child of head word Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 5 / 10

System Shared Task System Results Features Feature Templates To enable a systematic search without requiring prior knowledge, we defined additional feature templates Using the dependency graph of the noun phrase: Surface form, POS tag, dependency label of HeadWord, LeftMostChild, RightMostChild, HeadGovernor, HeadLeftSibling, HeadRightSibling Dependency graph paths, i.e. direction of edges and Form, POS, or dependency label A number of variations of semantic role features Total of ca. 60 feature templates (See paper for details) Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 6 / 10

System Shared Task System Results Features Feature Selection Baseline set was the Soon et al (2001) feature set Pool of feature templates including all above and a set of manually selected pairs, e.g. AntecedentHeadForm + AnaphorHeadForm AntecedentHeadLeftMostChild + AnaphorHeadLeftMostChild Greedy forward-backward selection, incrementally adding or removing one feature template from the current set Cross-validated over the training set, in order not to skew it towards the development set Optimized for the CoNLL score Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 7 / 10

System Postprocessing Results Evaluation set Postprocessing (development set) Impact of the postprocessing step: MD MUC BCUB CEAFM CEAFE BLANC No PP 66.56 54.61 65.93 51.91 40.46 69.36 With PP 67.21 55.62 66.29 52.51 40.67 70.00 Increase 0.65 1.01 0.36 0.60 0.21 0.64 Overall beneficial – increased precision and recall across all metrics Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 8 / 10

System Postprocessing Results Evaluation set Results (evaluation set) Results on the test set – Fourth place in the Shared Task R P F1 Mention detection 69.87 68.08 68.96 MUC 60.20 57.10 58.61 BCUB 66.74 64.23 65.46 CEAFM 51.45 51.45 51.45 CEAFE 38.09 41.06 39.52 BLANC 71.99 70.31 71.11 Official CoNLL score 55.01 54.13 54.53 Our system makes no use of global optimization or constraints We believe feature selection was a key ingredient This technique should be replicable to other languages Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 9 / 10

System Postprocessing Results Evaluation set Questions Questions? Anders Bj¨ orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 10 / 10

Exploring Lexicalized Features for Coreference Resolution Anders Bj - PowerPoint PPT Presentation

System Results Exploring Lexicalized Features for Coreference Resolution Anders Bj orkelund and Pierre Nugues June 24, 2011 Anders Bj orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 1

Evaluating Theories of Coreference Resolution Coreference Resolution: The Task Bayer AG has

Latent Structures for Coreference Resolution Sebastian Martschat and Michael Strube Heidelberg

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

CORBON 2016: Coreference Resolution Beyond OntoNotes NAACL HLT 2016 Workshop Maciej Ogrodniczuk

Lexicalized Probabilistic Context-Free Grammars Michael Collins, Columbia University Overview

Neural Networks and Coreference Resolution for Slot Filling Heike Adel, Hinrich Sch utze Team

End-to-end Neural Coreference Resolution Kenton Lee, Luheng He, Mike Lewis and Luke Zettlemoyer

CSEP 517 Natural Language Processing Coreference Resolution Luke Zettlemoyer University of

End-to-end Neural Coreference Resolution Kenton Lee Luheng He Mike Lewis Luke

GroRef: Rule-Based Coreference Resolution for Dutch Rob van der Goot, Hessel Haagsma, Dieke Oele

Unsupervised Coreference Resolution in a Nonparametric Bayesian Model Aria Haghighi and Dan

Unsupervised Coreference Resolution in a Nonparametric Bayesian Model Aria Haghighi and Dan Klein

Higher-order Coreference Resolution with Coarse-to-fine Inference Kenton Lee * Luheng He Luke

Interplay of Coreference and Discourse Research and Annotations Anna Nedoluzhko Charles University,

Easy Victories and Uphill Ba4les in Coreference Resolu9on Greg

SIGBI Limited General Meeting 2019 Resolutions 1-6 Resolution 1 Resolution 2 Resolution 3

Bimodal Software Documentation Software Documentation [1985] University of Adelaide 2 Software

? (entity type) Apr 23, 2007 NAACL-HLT 2 1 What Is Relation Extraction? hundreds of

Filtering relevant information from reports on flood Lubo s Popel nsk y Knowledge

Radiative pion capture in 2 H, 3 He and 3 H J. Golak , R. Skibiski, K. Topolnicki, H. Witaa,

Overview Last Time Mid-Way Evaluation Forward Algorithm Quiz & Bonus Points

Cadastral GIS An example from Bolivia Pretoria November 7, 2002 Christiaan Lemmen Foto: Edgar

Probabilistic Frame-Semantic Parsing Noah A. Smith Dipanjan Das Nathan Schneider Desai Chen

Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, June 1996, pp. 79-86.

Sambuz

Useful Links

Newsletter

Mail Us

Exploring Lexicalized Features for Coreference Resolution Anders Bj - PowerPoint PPT Presentation

System Results Exploring Lexicalized Features for Coreference Resolution Anders Bj orkelund and Pierre Nugues June 24, 2011 Anders Bj orkelund and Pierre Nugues Exploring Lexicalized Features for Coreference Resolution June 24, 2011 1

Evaluating Theories of Coreference Resolution Coreference Resolution: The Task Bayer AG has

Latent Structures for Coreference Resolution Sebastian Martschat and Michael Strube Heidelberg

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

CORBON 2016: Coreference Resolution Beyond OntoNotes NAACL HLT 2016 Workshop Maciej Ogrodniczuk

Lexicalized Probabilistic Context-Free Grammars Michael Collins, Columbia University Overview

Neural Networks and Coreference Resolution for Slot Filling Heike Adel, Hinrich Sch utze Team

End-to-end Neural Coreference Resolution Kenton Lee, Luheng He, Mike Lewis and Luke Zettlemoyer

CSEP 517 Natural Language Processing Coreference Resolution Luke Zettlemoyer University of

End-to-end Neural Coreference Resolution Kenton Lee Luheng He Mike Lewis Luke

GroRef: Rule-Based Coreference Resolution for Dutch Rob van der Goot, Hessel Haagsma, Dieke Oele

Unsupervised Coreference Resolution in a Nonparametric Bayesian Model Aria Haghighi and Dan

Unsupervised Coreference Resolution in a Nonparametric Bayesian Model Aria Haghighi and Dan Klein

Higher-order Coreference Resolution with Coarse-to-fine Inference Kenton Lee * Luheng He Luke

Interplay of Coreference and Discourse Research and Annotations Anna Nedoluzhko Charles University,

Easy Victories and Uphill Ba4les in Coreference Resolu9on Greg

SIGBI Limited General Meeting 2019 Resolutions 1-6 Resolution 1 Resolution 2 Resolution 3

Bimodal Software Documentation Software Documentation [1985] University of Adelaide 2 Software

? (entity type) Apr 23, 2007 NAACL-HLT 2 1 What Is Relation Extraction? hundreds of

Filtering relevant information from reports on flood Lubo s Popel nsk y Knowledge

Radiative pion capture in 2 H, 3 He and 3 H J. Golak , R. Skibiski, K. Topolnicki, H. Witaa,

Overview Last Time Mid-Way Evaluation Forward Algorithm Quiz &amp; Bonus Points

Cadastral GIS An example from Bolivia Pretoria November 7, 2002 Christiaan Lemmen Foto: Edgar

Probabilistic Frame-Semantic Parsing Noah A. Smith Dipanjan Das Nathan Schneider Desai Chen

Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, June 1996, pp. 79-86.

Sambuz

Useful Links

Newsletter

Mail Us

Overview Last Time Mid-Way Evaluation Forward Algorithm Quiz & Bonus Points