AMR Normalization for Fairer Evaluation Michael Wayne Goodman - PowerPoint PPT Presentation

AMR Normalization for Fairer Evaluation Michael Wayne Goodman goodmami@uw.edu Nanyang Technological University, Singapore 2019-09-13

Presentat ion agenda Introduction: AMR, PENMAN, and Smatch Normalization Experiment Conclusion 1

AMR Abstract Meaning Representation • Compact encoding of sentential semantics as a DAG • Independent of any syntactic analyses • Hand-annotated gold data: some free, most LDC • The “Penn Treebank of semantics” (Banarescu et al., 2013) 2

Example • “I had let my tools drop from my hands.” (The Little Prince Corpus, id: lpp_1943.355 ) (l / let-01 :ARG0 (i / i) :ARG1 (d / drop-01 :ARG1 (t / tool :poss i) :ARG3 (h / hand :part-of i))) 3

PENMAN Notation AMR is encoded in PENMAN notation • l is node id, let-01 is node label, :ARG0 is edge label • Bracketing alone forms a tree • Node ids allow re-entrancy • Inverted edges ( :part-of ) allow multiple roots (l / let-01 :ARG0 (i / i) :ARG1 (d / drop-01 :ARG1 (t / tool :poss i) :ARG3 (h / hand :part-of i))) 4

Triples ARG1(d, t) ^ part-of(h, i) :part-of i))) instance(h, hand) ^ ARG3(d, h) ^ :ARG3 (h / hand poss(t, i) ^ :poss i) instance(t, tool) ^ :ARG1 (t / tool PENMAN graphs translate to a conjunction of triples instance(d, drop-01) ARG1(l, d) ^ :ARG1 (d / drop-01 instance(i, i) ^ ARG0(l, i) ^ :ARG0 (i / i) instance(l, let-01) ^ (l / let-01 5

Back to AMR What is AMR beyond PENMAN graphs? • AMR is the model, PENMAN the encoding scheme • Made up of “concepts” (nodes) and “relations” (edges) • Verbal concepts taken from OntoNotes (Weischedel et al., 2011), others invented as necessary • Mostly finite inventory of roles (except :opN , :sntN ) • Constraints (e.g., no cycles), and valid transformations (inversions, reification) 1 https://github.com/amrisi/amr-guidelines/blob/master/amr.md 6 • Defined by the AMR Specification 1 and annotator docs

Smatch Smatch is the prevailing evaluation metric for AMR • For two AMR graphs, find mappings of node ids • Choose the mapping that maximizes matching triples • Calculate precision, recall, and F1 (the Smatch score) • Example: (s / see-01 (s / see-01 :ARG0 (g / girl) :ARG0 (g / girl) :ARG1 (d / dog :ARG1 (c / cat)) :quant 2)) Left: 7 triples, Right: 6, Matching: 5 Precision: 5/7 = 0.71; Recall: 5/6 = 0.83; F1 = 0.77 7

What’s the Problem? AMR has alternations that are meaning-equivalent according to the specification • Some idiosyncratic role inversions, e.g.: • :mod <-> :domain • :consist-of <-> :consist-of-of • Edge reifications, e.g.: (a / ... :cause (b / ...) …can reify :cause to… (a / ... :ARG1-of (c / cause-01 :ARG0 (c / ...))) • These result in differences in the triples, and thus the Smatch score 8

What’s the Problem? (c / chapter incorrect relation. • Omitting the relation altogether (Hyp2) yields a higher score than having an getting both the role and value wrong (Hyp1) • Getting the role wrong (CAMR, JAMR, AMREager) gets the same score as :op1 7) :li 7) :quant 7) (c / chapter (c / chapter AMREager There is no partial credit for almost-correct triples JAMR CAMR :quant 5) :mod 7) (c / chapter) (c / chapter (c / chapter Hyp2 Hyp1 Gold 9

What’s the Problem? Some ”equivalent” alternations are invalid graphs Gold Bad (c / chapter (c / chapter :mod 7) :domain-of 5) • If :domain-of is inverted, then 5 must be a node id, but it is a constant. 10

Normalization Question: Can we address these problems in evaluation by normalizing the triples? Meaning-preserving normalization: • Canonical Role Inversion • Edge Reification Meaning-augmenting normalization: • Attribute Reification • Structure Preservation 12

Canonical Role Inversion Replace non-canonical role with canonical ones • :mod-of -> :domain • :domain-of -> :mod • :consist -> :consist-of-of • etc. • (Also useful for general data cleaning) 13

Edge Reification :ARG0 (h / he) :ARG2 -))))) <-' :ARG1-of (h2 / have-polarity-91 | :ARG2 (c / care-04 | <-' :ARG1-of (m / have-manner-91 | | | Always reify edges | (d / drive-01 | | <-----+-----------------------. :polarity -)) <---------. :manner (c / care-04 :ARG0 (h / he) (d / drive-01 14

Attribute Reification Make constants into node labels (c / chapter (c / chapter :mod 7) --> :mod (_ / 7)) 15

Structure Preservation Make the tree structure evident in the triples (using the Little Prince example, adding TOP relations) (l / let-01 :ARG0 (i / i :TOP l) :ARG1 (d / drop-01 :TOP l :ARG1 (t / tool :TOP d :poss i) :ARG3 (h / hand :TOP h :part-of i))) 16

Experiment Setup Test the relative effects of normalization on parsing evaluation for multiple parsers • Use the Little Prince corpus with gold annotations • Parse using JAMR (Flanigan et al., 2016) • Parse using CAMR (Wang et al., 2016) • Parse using AMREager (Damonte et al., 2017) • Normalize each of the four above (various configurations) • Compare: 18 • Gold-orig × { JAMR-orig, CAMR-orig, AMREager-orig } • Gold-norm × { JAMR-norm, CAMR-norm, AMREager-norm }

Results 0.57 Normalization 0.56 0.61 0.67 0.55 0.60 0.70 0.57 0.63 0.68 0.58 0.63 AMREager 0.52 0.56 0.55 0.57 0.52 0.55 0.57 0.53 0.55 0.61 0.57 0.59 0.59 0.54 0.56 0.61 0.67 0.67 R CAMR 0.58 0.56 0.60 JAMR F P 0.57 S R A I System Score 0.55 0.60 19 0.63 0.57 0.55 0.59 0.60 0.61 0.57 0.56 0.58 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓

Results 0.70 0.70 0.62 0.56 0.70 0.62 0.56 Normalization 0.63 0.57 0.69 0.61 0.56 0.67 CAMR 0.58 AMREager 0.56 0.59 0.59 0.57 0.61 0.59 0.58 0.60 0.58 0.57 0.60 0.59 0.57 0.61 0.55 0.52 0.59 0.63 0.61 F 0.60 0.63 0.58 0.56 0.60 JAMR R 0.57 P S R A I System Score 0.64 0.57 20 0.60 0.57 0.64 0.60 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓

Discussion • Normalization slightly increases scores on this dataset • mainly due to partial credit • Sometimes it does worse • making available previously ignored triples • more triples -> larger denominator in Smatch • Effects on a single system are unimportant • Rather, relative effects for multiple systems is interesting • Although, relative effects on this experiment are slight • Role inversion harmed JAMR but not others • AMREager improves compared to others • Next step: try on other corpora (Bio-AMR, LDC, …) 22

Discussion • Normalization is not promoted as a postprocessing step (in general) • Rather as preprocessing to evaluation • Thus it allows parser developers to take risks • Although reduced variation may benefit sequence-based models • Similar procedures possibly useful for non-AMR representations (e.g., EDS, DMRS) 23

Thanks Thank you! Software Available: • Normalization https://github.com/goodmami/norman • PENMAN graph library https://github.com/goodmami/penman 24

References i Laura Banarescu, Claire Bonial, Shu Cai, Madalina Georgescu, Kira Griffitt, Ulf Hermjakob, Kevin Knight, Philipp Koehn, Martha Palmer, and Nathan Schneider. 2013. Abstract meaning representation for sembanking. In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse , pages 178–186, Sofia, Bulgaria. Association for Computational Linguistics. Marco Damonte, Shay B. Cohen, and Giorgio Satta. 2017. An incremental parser for abstract meaning representation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers , pages 536–546, Valencia, Spain. Association for Computational Linguistics. 25

References ii Jeffrey Flanigan, Chris Dyer, Noah A Smith, and Jaime Carbonell. 2016. CMU at SemEval-2016 task 8: Graph-based AMR parsing with infinite ramp loss. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016) , pages 1202–1206. Chuan Wang, Sameer Pradhan, Xiaoman Pan, Heng Ji, and Nianwen Xue. 2016. CAMR at semeval-2016 task 8: An extended transition-based AMR parser. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016) , pages 1173–1178, San Diego, California. Association for Computational Linguistics. 26

References iii Ralph Weischedel, Sameer Pradhan, Lance Ramshaw, Martha Palmer, Nianwen Xue, Mitchell Marcus, Ann Taylor, Craig Greenberg, Eduard Hovy, Robert Belvin, et al. 2011. OntoNotes release 4.0. LDC2011T03, Philadelphia, Penn.: Linguistic Data Consortium . 27

AMR Normalization for Fairer Evaluation Michael Wayne Goodman - PowerPoint PPT Presentation

AMR Normalization for Fairer Evaluation Michael Wayne Goodman goodmami@uw.edu Nanyang Technological University, Singapore 2019-09-13 Presentat ion agenda Introduction: AMR, PENMAN, and Smatch Normalization Experiment Conclusion 1 AMR

BUILDING FAIRER AI- BUILDING FAIRER AI- ENABLED SYSTEMS ENABLED SYSTEMS Christian Kaestner

The AMR Group An n In Intr trod oduc uction tion 2013 2013 The Group The he AMR AMR

AMR and EPSRC AMR Networks Meeting, Sheffield, Sept 16 Christina Turner and Stephanie Newland

CS225: Spatial Computing Course Outline Instructor: Amr Magdy Computer Science and Engineering

Updating the RTP payload format for AMR and AMR-WB draft-ietf-avt-rtp-amr-bis-00.txt Magnus

WITH C++ Prof. Amr Goneid AUC Part 5. Functions Prof. amr Goneid, AUC 1 Functions Prof. amr

WITH C++ Prof. Amr Goneid AUC Part 12. Recursion Prof. amr Goneid, AUC 1 Recursion Prof. amr

Neural AMR : Sequence-to-Sequence Models for Parsing and Generation annis Konstas joint work

CS260-002: Spatial Data Modeling and Analysis Course Outline Instructor: Amr Magdy Computer

WITH C++ Prof. Amr Goneid AUC Part 6. Simple and User Defined Data Types Prof. amr Goneid, AUC

WITH C++ Prof. Amr Goneid AUC Part 13. Abstract Data Types (ADTs) Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

WITH C++ Prof. Amr Goneid AUC Part 11a. The Vector Class Prof. amr Goneid, AUC 1 The Vector

WITH C++ Prof. Amr Goneid AUC Part 8. Characters & Strings Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 16. Linked Lists Prof. amr Goneid, AUC 1 Linked Lists

WITH C++ Prof. Amr Goneid AUC Part 7. 1-D & 2-D Arrays Prof. Amr Goneid, AUC 1 Arrays

I A T I O N Z A N M I I I I I I I I 2 1 4 8 1 1 1 2 4 8 = 1 4 2 1 8 1 1 2

Normalization Improve the schema by decomposing relations and removing anomalies. CS 235:

Linearizability of Persistent Memory Objects Michael L. Scott Joint work with Joseph

Pawe Lebioda, Tomasz Kapela Agenda Introduction: Tutorial: Persistent Memory Kernel support

Normalization-Invariant Fuzzy Logic Need for Normalization Operations Explain Empirical Success

CS188 Outline CS 188: Artificial Intelligence Probability Were done with Part I: Search

Type Theory and Coq Herman Geuvers Lecture: Normalization for and 2 1 Properties of

Data Modeling Session 12 INST 301 Introduction to Information Science Databases Database

AMR Normalization for Fairer Evaluation Michael Wayne Goodman - PowerPoint PPT Presentation

AMR Normalization for Fairer Evaluation Michael Wayne Goodman goodmami@uw.edu Nanyang Technological University, Singapore 2019-09-13 Presentat ion agenda Introduction: AMR, PENMAN, and Smatch Normalization Experiment Conclusion 1 AMR

BUILDING FAIRER AI- BUILDING FAIRER AI- ENABLED SYSTEMS ENABLED SYSTEMS Christian Kaestner

The AMR Group An n In Intr trod oduc uction tion 2013 2013 The Group The he AMR AMR

AMR and EPSRC AMR Networks Meeting, Sheffield, Sept 16 Christina Turner and Stephanie Newland

CS225: Spatial Computing Course Outline Instructor: Amr Magdy Computer Science and Engineering

Updating the RTP payload format for AMR and AMR-WB draft-ietf-avt-rtp-amr-bis-00.txt Magnus

WITH C++ Prof. Amr Goneid AUC Part 5. Functions Prof. amr Goneid, AUC 1 Functions Prof. amr

WITH C++ Prof. Amr Goneid AUC Part 12. Recursion Prof. amr Goneid, AUC 1 Recursion Prof. amr

Neural AMR : Sequence-to-Sequence Models for Parsing and Generation annis Konstas joint work

CS260-002: Spatial Data Modeling and Analysis Course Outline Instructor: Amr Magdy Computer

WITH C++ Prof. Amr Goneid AUC Part 6. Simple and User Defined Data Types Prof. amr Goneid, AUC

WITH C++ Prof. Amr Goneid AUC Part 13. Abstract Data Types (ADTs) Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 9. Streams &amp; Files Prof. amr Goneid, AUC 1 Streams

WITH C++ Prof. Amr Goneid AUC Part 11a. The Vector Class Prof. amr Goneid, AUC 1 The Vector

WITH C++ Prof. Amr Goneid AUC Part 8. Characters &amp; Strings Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 16. Linked Lists Prof. amr Goneid, AUC 1 Linked Lists

WITH C++ Prof. Amr Goneid AUC Part 7. 1-D &amp; 2-D Arrays Prof. Amr Goneid, AUC 1 Arrays

I A T I O N Z A N M I I I I I I I I 2 1 4 8 1 1 1 2 4 8 = 1 4 2 1 8 1 1 2

Normalization Improve the schema by decomposing relations and removing anomalies. CS 235:

Linearizability of Persistent Memory Objects Michael L. Scott Joint work with Joseph

Pawe Lebioda, Tomasz Kapela Agenda Introduction: Tutorial: Persistent Memory Kernel support

Normalization-Invariant Fuzzy Logic Need for Normalization Operations Explain Empirical Success

CS188 Outline CS 188: Artificial Intelligence Probability Were done with Part I: Search

Type Theory and Coq Herman Geuvers Lecture: Normalization for and 2 1 Properties of

Data Modeling Session 12 INST 301 Introduction to Information Science Databases Database

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

WITH C++ Prof. Amr Goneid AUC Part 8. Characters & Strings Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 7. 1-D & 2-D Arrays Prof. Amr Goneid, AUC 1 Arrays