[PPT] - A Comparison of Chinese Parsers for Stanford Dependencies Wanxiang PowerPoint Presentation

SLIDE 1

A Comparison of Chinese Parsers for Stanford Dependencies

Wanxiang Che,† Valentin I. Spitkovsky‡ and Ting Liu†

†Harbin Institute of Technology ‡Stanford University ACL 2012

July 11, 2012

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 1 / 19

SLIDE 2

Outline

1 Introduction 2 Methodology 3 Results 4 Analysis 5 Conclusion

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 2 / 19

SLIDE 3

Introduction

Outline

1 Introduction 2 Methodology 3 Results 4 Analysis 5 Conclusion

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 3 / 19

SLIDE 4

Introduction

Stanford Dependencies

A simple description of relations between pairs of words in a sentence A kind of semantically-oriented dependency representation Converted from constituent trees by rules 53 binary relations for English, 46 for Chinese

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 4 / 19

SLIDE 5

Introduction

Stanford Dependencies

A simple description of relations between pairs of words in a sentence A kind of semantically-oriented dependency representation Converted from constituent trees by rules 53 binary relations for English, 46 for Chinese

Root-

I saw the man who loves you

root nsubj dobj det rcmod nsubj dobj ROOT SUB VMOD NMOD CLF SUB VMOD

Figure: Stanford dependencies (above) vs. CoNLL style (below)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 4 / 19

SLIDE 6

Introduction

Stanford Dependencies Applications

Intuitive and easy to apply, requires little linguistic expertise

Biomedical text mining (Kim et al., 2009) Textual entailment (Androutsopoulos and Malakasiotis, 2010) Information extraction (Wu and Weld, 2010; Banko et al., 2007) Sentiment analysis (Meena and Prabhakar, 2007; Wu et al., 2011)

Root-

I saw the man who loves you

root nsubj dobj det rcmod nsubj dobj ROOT SUB VMOD NMOD CLF SUB VMOD

Figure: Stanford dependencies (above) vs. CoNLL style (below)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 5 / 19

SLIDE 7

Introduction

Parsing Methods

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 8

Introduction

Parsing Methods

Constituent Parsing (indirect)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 9

Introduction

Parsing Methods

Constituent Parsing (indirect)

Sentence

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 10

Introduction

Parsing Methods

Constituent Parsing (indirect)

Sentence ⇒

IP VP IP VP NP NN 建设 NN 基础 NN 国家 VV 投资 NP NP NN 企业家 ADJP JJ 民营 VV 鼓励 NP NR 中国

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 11

Introduction

Parsing Methods

Constituent Parsing (indirect)

Sentence ⇒

IP VP IP VP NP NN 建设 NN 基础 NN 国家 VV 投资 NP NP NN 企业家 ADJP JJ 民营 VV 鼓励 NP NR 中国

⇒

中国鼓励民营企业家投资国家基础建设 China encourages private entrepreneurs invest national infrastructure construction

root nsubj dobj dep amod dobj nn nn

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 12

Introduction

Parsing Methods

Constituent Parsing (indirect)

Sentence ⇒

IP VP IP VP NP NN 建设 NN 基础 NN 国家 VV 投资 NP NP NN 企业家 ADJP JJ 民营 VV 鼓励 NP NR 中国

⇒

中国鼓励民营企业家投资国家基础建设 China encourages private entrepreneurs invest national infrastructure construction

root nsubj dobj dep amod dobj nn nn

Stanford dependency parser’s original implementation

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 13

Introduction

Parsing Methods

Constituent Parsing (indirect)

Sentence ⇒

IP VP IP VP NP NN 建设 NN 基础 NN 国家 VV 投资 NP NP NN 企业家 ADJP JJ 民营 VV 鼓励 NP NR 中国

⇒

中国鼓励民营企业家投资国家基础建设 China encourages private entrepreneurs invest national infrastructure construction

root nsubj dobj dep amod dobj nn nn

Stanford dependency parser’s original implementation

Dependency Parsing (direct)

Sentence ⇒

中国鼓励民营企业家投资国家基础建设 China encourages private entrepreneurs invest national infrastructure construction

root nsubj dobj dep amod dobj nn nn

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 6 / 19

SLIDE 14

Introduction

Motivation

Which method is better for Chinese Stanford Dependencies?

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 7 / 19

SLIDE 15

Introduction

Motivation

Which method is better for Chinese Stanford Dependencies? Comparison for English (Cer et al., 2010)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 7 / 19

SLIDE 16

Introduction

Motivation

Which method is better for Chinese Stanford Dependencies? Comparison for English (Cer et al., 2010)

Constituent parsers systematically outperform direct methods

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 7 / 19

SLIDE 17

Introduction

Motivation

Which method is better for Chinese Stanford Dependencies? Comparison for English (Cer et al., 2010)

Constituent parsers systematically outperform direct methods Did not explore more sophisticated (higher-order) dependency parsers Did not explore more consistent (n-way jackknifing of) POS tags Small bug in evaluation of MSTParser

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 7 / 19

SLIDE 18

Methodology

Outline

1 Introduction 2 Methodology 3 Results 4 Analysis 5 Conclusion

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 8 / 19

SLIDE 19

Methodology Open Source Parsers

Parsers Information

Open Source Parsers Type Parser Version Algorithm Constituent Berkeley 1.1 PCFG Bikel 1.2 PCFG Charniak

Nov. 2009

PCFG Stanford 2.0 Factored Dependency MaltParser 1.6.1 Arc-Eager Mate 2.0 2nd-order MST MSTParser 0.5 MST

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 9 / 19

SLIDE 20

Methodology Settings

Settings

Corpus Latest Chinese TreeBank (CTB) 7.0 Number of \in Train Dev Test Total files 2,083 160 205 2,448 sentences 46,572 2,079 2,796 51,447 tokens 1,039,942 59,955 81,578 1,181,475

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 10 / 19

SLIDE 21

Methodology Settings

Settings

Corpus Latest Chinese TreeBank (CTB) 7.0 Number of \in Train Dev Test Total files 2,083 160 205 2,448 sentences 46,572 2,079 2,796 51,447 tokens 1,039,942 59,955 81,578 1,181,475 Software and Hardware Parsers: all default options Hardware: Intel’s Xeon E5620 2.40GHz CPU and 24GB RAM

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 10 / 19

SLIDE 22

Methodology Features for Dependency Parsers

Features for Dependency Parsers

POS tags Stanford POS tagger Automatic tags for training data (via 10-way jackknifing)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 11 / 19

SLIDE 23

Methodology Features for Dependency Parsers

Features for Dependency Parsers

POS tags Stanford POS tagger Automatic tags for training data (via 10-way jackknifing) Lemmas The last character of each Chinese word

E.g., bicycle (自行车车车), car (汽车车车) and train (火车车车) are all various kinds of vehicle (车)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 11 / 19

SLIDE 24

Results

Outline

1 Introduction 2 Methodology 3 Results 4 Analysis 5 Conclusion

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 12 / 19

SLIDE 25

Results

Chinese Results

Dev Test Type Parser UAS LAS UAS LAS Time Constituent Berkeley 82.0 77.0 82.9 77.8 45:56 Bikel 79.4 74.1 80.0 74.3 6,861:31 Charniak 77.8 71.7 78.3 72.3 128:04 Stanford 76.9 71.2 77.3 71.4 330:50 Dependency MaltParser (liblinear) 76.0 71.2 76.3 71.2 0:11 MaltParser (libsvm) 77.3 72.7 78.0 73.1 556:51 Mate (2nd-order) 82.8 78.2 83.1 78.1 87:19 MSTParser (1st-order) 78.8 73.4 78.9 73.1 12:17

Bold: best results. Dark Red: worst results. Blue: best results of constituent parsers.

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 13 / 19

SLIDE 26

Analysis

Outline

1 Introduction 2 Methodology 3 Results 4 Analysis 5 Conclusion

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 14 / 19

SLIDE 27

Analysis

Comparison between Mate and Berkeley parsers

Mate is slightly better than Berkeley (but not significantly, p > 0.05)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 15 / 19

SLIDE 28

Analysis

Comparison between Mate and Berkeley parsers

Mate is slightly better than Berkeley (but not significantly, p > 0.05) Performance (F1) comparison on different relations

Relation Count Mate Berkeley nn 7,783 91.3 89.3 dep 4,651 69.4 70.3 nsubj 4,531 87.1 85.5 advmod 4,028 94.3 93.8 dobj 3,990 86.0 85.0 conj 2,159 76.0 75.8 prep 2,091 94.3 94.1 root 2,079 81.2 82.3 nummod 1,614 97.4 96.7 assmod 1,593 86.3 84.1

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 15 / 19

SLIDE 29

Analysis

More Analysis

Feature Effect 10-way jackknifing POS tags for training data Gold Jackknifing Mate 75.4 78.2 Berkeley 77.0 76.5

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 16 / 19

SLIDE 30

Analysis

More Analysis

Feature Effect 10-way jackknifing POS tags for training data Gold Jackknifing Mate 75.4 78.2 Berkeley 77.0 76.5 Lemmas for Mate

77.8 (w/o) vs. 78.2 (with)

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 16 / 19

SLIDE 31

Analysis

More Analysis

Feature Effect 10-way jackknifing POS tags for training data Gold Jackknifing Mate 75.4 78.2 Berkeley 77.0 76.5 Lemmas for Mate

77.8 (w/o) vs. 78.2 (with)

English vs. Chinese Chinese English Berkeley 77.0 87.9 Charniak 71.7 87.8 CJ (Charniak + Reranking) — 89.1 Mate 78.2 88.6

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 16 / 19

SLIDE 32

Conclusion

Outline

1 Introduction 2 Methodology 3 Results 4 Analysis 5 Conclusion

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 17 / 19

SLIDE 33

Conclusion

For Chinese, direct approach comparable to using constituents

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 18 / 19

SLIDE 34

Conclusion

For Chinese, direct approach comparable to using constituents Which parser to use in practice?

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 18 / 19

SLIDE 35

Conclusion

For Chinese, direct approach comparable to using constituents Which parser to use in practice?

Most accurate: Mate parser Fastest: MaltParser (liblinear) Trade-off: Berkeley parser

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 18 / 19

SLIDE 36

Conclusion

For Chinese, direct approach comparable to using constituents Which parser to use in practice?

Most accurate: Mate parser Fastest: MaltParser (liblinear) Trade-off: Berkeley parser

We prefer dependency parsers which more easily admit richer features

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 18 / 19

SLIDE 37

Conclusion

For Chinese, direct approach comparable to using constituents Which parser to use in practice?

Most accurate: Mate parser Fastest: MaltParser (liblinear) Trade-off: Berkeley parser

We prefer dependency parsers which more easily admit richer features

n-way jackknifing of POS tags and lemma features can help

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 18 / 19

SLIDE 38

Conclusion

Thanks and QA

Che, Spitkovsky, and Liu (HIT, Stanford) Comparison of Chinese Parsers July 11, 2012 19 / 19