Hebrew Dependency Parsing: Initial Results Yoav Goldberg Michael - PowerPoint PPT Presentation

Hebrew Dependency Parsing: Initial Results Yoav Goldberg Michael Elhadad IWPT 2009, Paris

Motivation ◮ We want a Hebrew Dependency parser

Motivation ◮ We want a Hebrew Dependency parser ◮ Initial steps: ◮ Know Hebrew ◮ Create Hebrew Dependency Treebank ◮ Experiment with existing state-of-the-art systems

Motivation ◮ We want a Hebrew Dependency parser ◮ Initial steps: ◮ Know Hebrew ◮ Create Hebrew Dependency Treebank ◮ Experiment with existing state-of-the-art systems ◮ Next year: ◮ Do better

Motivation ◮ We want a Hebrew Dependency parser ◮ Initial steps: Know Hebrew ◮ Create Hebrew Dependency Treebank ◮ Experiment with existing state-of-the-art systems ◮ ◮ Next year: ◮ Do better

Know Hebrew

Know Hebrew ◮ Relatively free constituent order ◮ Suitable for a dependency based representation Mostly SVO, but OVS, VSO also possible. Verbal arguments appear before or after the verb. ◮ went from-Israel to-Paris ◮ to-Paris from-Israel went ◮ went to-Paris from-Israel ◮ to-Paris went from-Israel . . .

Know Hebrew ◮ Relatively free constituent order ◮ Rich morphology ◮ Many word forms ◮ Agreement – noun/adj, verb/subj: should help parsing!

Know Hebrew ◮ Relatively free constituent order ◮ Rich morphology ◮ Agglutination ◮ Many function words are attached to the next token ◮ Together with rich morphology ⇒ Very High Ambiguity ◮ Leaves of tree not known in advance!

Hebrew Dependency Treebank ◮ Converted from Hebrew Constituency Treebank (V2) ◮ Some heads marked in Treebank ◮ For others: (extended) head percolation table from Reut Tsarfaty ◮ 6220 sentences ◮ 34 non-projective sentences

Hebrew Dependency Treebank ◮ Choice of heads

Hebrew Dependency Treebank ◮ Choice of heads ◮ Prepositions are head of PPs

Hebrew Dependency Treebank ◮ Choice of heads ◮ Prepositions are head of PPs ◮ Relativizers are head of Relative clauses

Hebrew Dependency Treebank ◮ Choice of heads ◮ Prepositions are head of PPs ◮ Relativizers are head of Relative clauses ◮ Main verb is head of infinitive verb

Hebrew Dependency Treebank ◮ Choice of heads ◮ Prepositions are head of PPs ◮ Relativizers are head of Relative clauses ◮ Main verb is head of infinitive verb ◮ Coordinators are head of Conjunctions

Hebrew Dependency Treebank ◮ Choice of heads ◮ Prepositions are head of PPs ◮ Relativizers are head of Relative clauses ◮ Main verb is head of infinitive verb ◮ Coordinators are head of Conjunctions ← hard for parsers

Hebrew Dependency Treebank Dependency labels ◮ Marked in TBv2 ◮ OBJ ◮ SUBJ ◮ COMP ◮ Trivially added ◮ ROOT ◮ suffix-inflections ◮ We are investigating ways of adding more labels ◮ This work focus on unlabeled dependency parsing.

Experiments

Parameters Graph vs. Transitions How important is lexicalization? Does morphology help?

Parsers ◮ Transition based: M ALT P ARSER (Joakim Nivre) ◮ M ALT : malt parser, out-of-box feature set ◮ M ALT A RA : malt parser, arabic optimized feature set (should do morphology..) ◮ Graph based: M ST P ARSER (Ryan Mcdonald) ◮ M ST 1: first order MST parser ◮ M ST 2: second order MST parser

Experimental Setup ◮ Oracle setting: use gold morphology/tagging/segmentation ◮ Pipeline setting: use tagger based morphology/tagging/segmentation

Results Features M ST 1 M ST 2 M ALT M ALT -A RA -M ORPH Full Lex 83.60 84.31 80.77 80.32 Lex 20 82.99 84.52 79.69 79.40 Lex 100 82.56 83.12 78.66 78.56 +M ORPH Full Lex 83.60 84.39 80.77 80.73 Lex 20 83.60 84.77 79.69 79.84 Lex 100 83.23 83.80 78.66 78.56 Table: oracle token segmentation and POS-tagging. Features M ST 1 M ST 2 M ALT M ALT -A RA -M ORPH Full Lex 75.64 76.38 73.03 72.94 Lex 20 75.48 76.41 72.04 71.88 Lex 100 74.97 75.49 70.93 70.73 +M ORPH Full Lex 73.90 74.62 73.03 73.43 Lex 20 73.56 74.41 72.04 72.30 Lex 100 72.90 73.78 70.93 70.97 Table: Tagger token segmentation and POS-tagging.

Results Best oracle result: 84.77% Best real result: 76.41%

Results M ST 2 > M ST 1 > M ALT

Results M ST 2 > M ST 1 > M ALT Simply a better model

Results M ST 2 > M ST 1 > M ALT Partly because of coordination representation

Results Lexical items appearing > 20 times ∼ all lexical items

Results With Oracle Morphology ◮ Morphological features don’t really help

Results With Tagger Morphology ◮ Morphological features help M ALT a little ◮ Morphological features hurt M ST a lot

Where do we go from here? ◮ We have a Hebrew Dependency Treebank ◮ Realistic performance still too low ◮ Current models don’t utilize morphological information well ◮ Can we do better? ◮ Pipeline model hurt performance ◮ Can we do parsing, tagging and segmentation jointly?

Hebrew Dependency Parsing: Initial Results Yoav Goldberg Michael - PowerPoint PPT Presentation

Hebrew Dependency Parsing: Initial Results Yoav Goldberg Michael Elhadad IWPT 2009, Paris Motivation We want a Hebrew Dependency parser Motivation We want a Hebrew Dependency parser Initial steps: Know Hebrew Create Hebrew

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

The Hebrew Bible The Hebrew Bible (Old Testament) The Hebrew Bible (Old Testament) The

Devotionals from the Psalms 2 nd Edition The Hebrew Scriptures The Hebrew Scriptures The Hebrew

Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . . . . . Graph Based

Dependency Parsing II CMSC 470 Marine Carpuat Graph-based Dependency Parsing Slides credit:

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Natural Language Processing Other Syntactic Models Parsing IV Dan Klein UC Berkeley Dependency

Dependency Parsing CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre, Dan

Dependency Parsing 2 CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre,

D EVOTIO N ALS DEVOTIONALS The Hebrew Bible The Hebrew Bible (Old Testament) The Hebrew

Dependency Parsing & Feature-based Parsing Ling571 Deep Processing Techniques for NLP

Marina Valeeva Outline 2 1. Introduction What is Dependency Parsing? What is a

Statistical Parsing Dependency parsing ar ltekin University of Tbingen Seminar fr

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Lecture 19: Dependency Grammars and Dependency Parsing Julia Hockenmaier juliahmr@illinois.edu

Dependency Grammars and Parsing CMSC 473/673 UMBC Outline Review: PCFGs and CKY Dependency

Effects of oligoribonucleotides-D-mannitol complexes on the hemagglutinin-glycan interactions

Historical linguistics : the study of how language changes over time sound change: phonemic and

Character Eyes: Seeing Language through Character-Level Taggers Yuval Pinter Marc Marone Jacob

MEASURING STRUCTURAL CHANGES IN TRADE INTEGRATION AND PRODUCTION NETWORK Norihiko Yamano, Colin

Machine Translation 3: Linguistics in SMT and NMT Ond rej Bojar bojar@ufal.mff.cuni.cz

C2NLU: An Overview Heike Adel CIS, LMU Munich Dagstuhl January 23, 2017 C2NLU: An Overview

New NP dependency marking in the second generation IE languages Artemij Keidan, Sapienza

Pirah Pirah Numbers & Stuff ISO 639-2 <myp> Spoken by Hi'aiti'ihi (

Hebrew Dependency Parsing: Initial Results Yoav Goldberg Michael - PowerPoint PPT Presentation

Hebrew Dependency Parsing: Initial Results Yoav Goldberg Michael Elhadad IWPT 2009, Paris Motivation We want a Hebrew Dependency parser Motivation We want a Hebrew Dependency parser Initial steps: Know Hebrew Create Hebrew

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

The Hebrew Bible The Hebrew Bible (Old Testament) The Hebrew Bible (Old Testament) The

Devotionals from the Psalms 2 nd Edition The Hebrew Scriptures The Hebrew Scriptures The Hebrew

Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . . . . . Graph Based

Dependency Parsing II CMSC 470 Marine Carpuat Graph-based Dependency Parsing Slides credit:

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Natural Language Processing Other Syntactic Models Parsing IV Dan Klein UC Berkeley Dependency

Dependency Parsing CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre, Dan

Dependency Parsing 2 CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre,

D EVOTIO N ALS DEVOTIONALS The Hebrew Bible The Hebrew Bible (Old Testament) The Hebrew

Dependency Parsing &amp; Feature-based Parsing Ling571 Deep Processing Techniques for NLP

Marina Valeeva Outline 2 1. Introduction What is Dependency Parsing? What is a

Statistical Parsing Dependency parsing ar ltekin University of Tbingen Seminar fr

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Lecture 19: Dependency Grammars and Dependency Parsing Julia Hockenmaier juliahmr@illinois.edu

Dependency Grammars and Parsing CMSC 473/673 UMBC Outline Review: PCFGs and CKY Dependency

Effects of oligoribonucleotides-D-mannitol complexes on the hemagglutinin-glycan interactions

Historical linguistics : the study of how language changes over time sound change: phonemic and

Character Eyes: Seeing Language through Character-Level Taggers Yuval Pinter Marc Marone Jacob

MEASURING STRUCTURAL CHANGES IN TRADE INTEGRATION AND PRODUCTION NETWORK Norihiko Yamano, Colin

Machine Translation 3: Linguistics in SMT and NMT Ond rej Bojar bojar@ufal.mff.cuni.cz

C2NLU: An Overview Heike Adel CIS, LMU Munich Dagstuhl January 23, 2017 C2NLU: An Overview

New NP dependency marking in the second generation IE languages Artemij Keidan, Sapienza

Pirah Pirah Numbers &amp; Stuff ISO 639-2 &lt;myp&gt; Spoken by Hi'aiti'ihi (

Dependency Parsing & Feature-based Parsing Ling571 Deep Processing Techniques for NLP

Pirah Pirah Numbers & Stuff ISO 639-2 <myp> Spoken by Hi'aiti'ihi (