Transition-Based Dependency Parsing
Saarbrücken, December 23rd 2011 David Przybilla – davida@coli.uni-saarland.de
Transition-Based Dependency Parsing Saarbrcken, December 23rd 2011 - - PowerPoint PPT Presentation
Transition-Based Dependency Parsing Saarbrcken, December 23rd 2011 David Przybilla davida@coli.uni-saarland.de Outline 1. MaltParser 2. Transition Based Parsing a. Example b. Oracle 3. Integrating Graph and Transition Based 4. Non
Saarbrücken, December 23rd 2011 David Przybilla – davida@coli.uni-saarland.de
variety of languages
input
𝑗
𝑗+1
𝑗+2
𝑙
𝑦
Subj
Subj
Subj
Subj Det
Subj Det Obj
Subj Det Obj
𝑗
𝑗+1
𝑗+2
𝑙
𝑦
𝑦
𝑗
𝑗+1
𝑗+2
𝑙
𝑦 𝑋 𝑙…𝑋 𝑗 𝑋 𝑗+1 … .
𝑙) ≠ 0
to the global optimal
and Stack
Graph
the correct head
the correct head and the correct dependency type
More flexible Word order Rich Morphology More Inflexible Word order, ‘poor’ Morphology English Chinese Czech Turkish Danish Dutch Italian Swedish German
Goal -> Evaluate if Maltparser can do reasonably accurate parsing for a wide variety of languages
across languages In General lower accuracy for languages like Czech and Turkish.
–
There are more non-projective structures in those languages
– Big difference in the amount of annotated data – existence of accurate POS Taggers..
State of the art for Italian, Swedish, Danish, Turkish
(Highest Scoring Graph)
Optimal)
Desitions
representation
the best transition between two
Propagation (Greedy Alg.)
Dependencies
dependents that are :
that are:
Treebank T Malt Parser Transition Based Parser Parse sed d T
MST Parser
Treebank T MST Parser Transition Based Parser Malt Parser
Parse sed d T
𝑗, 𝑘, 𝑚′ is in 𝐻𝑛𝑏𝑚𝑢
(left,right,root..)
is in 𝐻𝑛𝑡𝑢 𝑇0=fist element of the Stack, 𝐶0 =First element of the Buffer
cannot be parsed with this algorithm
projective
distances.
A dependency Tree 𝑈 is Projective:
if for every 𝐵𝑠𝑑 (𝑋
𝑗, 𝑋 𝑘, 𝑠𝑓𝑚) there is a path from 𝑋 𝑗 to 𝑋 𝑙 , if 𝑋 𝑙
is between 𝑋
𝑗 and 𝑋 𝑘
From ‘Scheduled’ 𝑋
2 there is an arc to 𝑋 5 however there is no
way to get to 𝑋
4, 𝑋 3 from 𝑋 2
generate this tree?
𝑙
𝑗
𝑗+1
𝑙
𝑗
𝑗+1
Theoretically
Running Time
Parsing Accuracy
dependency label
– Czech and Slovene more non-porjective arcs in this languages.
projective arcs in the whole set
language
Kübler, Svetoslav Marinov, and Erwin Marsi. Maltparser: a language- independent system for data-driven dependency parsing. Natural Language Engineering, 13(1):1–41, 2007.
based dependency parsers. In Proceedings of ACL-08: HLT, pages 950–958, Columbus, Ohio, June 2008.
In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pages 351–359, Suntec, Singapore, 2009.
Claypool Publishers, 2009