Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . - - PowerPoint PPT Presentation

graph based dependency parsing
SMART_READER_LITE
LIVE PREVIEW

Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . - - PowerPoint PPT Presentation

. . . . . . Graph Based Dependency Parsing Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . . . . . Graph Based Dependency Parsing Outline Outline 1 Introduction to dependency parsing 2 Graph based dependency


slide-1
SLIDE 1

. . . . . .

Graph Based Dependency Parsing

Graph Based Dependency Parsing

Wei Qiu ฀ ฀ December 15, 2011

slide-2
SLIDE 2

. . . . . .

Graph Based Dependency Parsing Outline

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-3
SLIDE 3

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-4
SLIDE 4

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

What is dependency parsing?

Input: a sentence John saw Mary yesterday. Output: a dependency tree . . John . saw . Mary . yesterday . . .

poss

.

nsubj

.

xcomp

.

dobj

slide-5
SLIDE 5

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Dependency tree

. . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

Directed tree with root The arcs indicate certain grammatical relatoins between words Properties Single-headness:Each word depends on exactly one parent. Connectivity acyclic

slide-6
SLIDE 6

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Projective dependency tree

. . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

The edges above the words don’t cross. Word and its decendents form a substring.

slide-7
SLIDE 7

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Projective dependency tree

. . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

The edges above the words don’t cross. Word and its decendents form a substring.

slide-8
SLIDE 8

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Projective dependency tree

. . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

The edges above the words don’t cross. Word and its decendents form a substring.

slide-9
SLIDE 9

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Projective dependency tree

. . . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

The edges above the words don’t cross. Word and its decendents form a substring.

slide-10
SLIDE 10

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Non-projective dependency tree

John saw Mary yesterday who was a young lady. . . Root . John . saw . Mary . yesterday . who . . was . . a . . young . . lady . . . .

slide-11
SLIDE 11

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Non-projective dependency tree

John saw Mary yesterday who was a young lady. . . Root . John . saw . Mary . yesterday . who . . was . . a . . young . . lady . . . .

slide-12
SLIDE 12

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Non-projective dependency tree

John saw Mary yesterday who was a young lady. . . Root . John . saw . Mary . yesterday . who . . was . . a . . young . . lady . . . .

slide-13
SLIDE 13

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Non-projective dependency tree

John saw Mary yesterday who was a young lady. . . . Root . John . saw . Mary . yesterday . who . . was . . a . . young . . lady . . . .

slide-14
SLIDE 14

. . . . . .

Graph Based Dependency Parsing Introduction to dependency parsing

Data-driven parsing framework

slide-15
SLIDE 15

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-16
SLIDE 16

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

What is graph based dependcy parsing?

Graph-based models: Define a space of candidate dependency trees

  • f input sentene.

Learning: induce a model for scoring a candidate tree Parsing: find a tree with the highest score given the model

slide-17
SLIDE 17

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

What is graph based dependency parsing?

Candidate trees for “John saw mary yesterday” . . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . . . Root . . John . . saw . . Mary . . yesterday . . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . . . Root . . John . . saw . . Mary . . yesterday . . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . .

slide-18
SLIDE 18

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

What is graph based dependency parsing?

Candidate trees for “John saw mary yesterday” . . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . Root . John . saw . Mary . yesterday . .

.

.

PRED

.

.

SBJ

.

.

OBJ

.

.

TMP

.

.

PU

. . . . Root . . John . . saw . . Mary . . yesterday . . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . . . Root . . John . . saw . . Mary . . yesterday . . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. . .

slide-19
SLIDE 19

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

Arc-factored model

. . Root . John . saw . Mary . yesterday . .

.

.

PRED

.

.

SBJ

.

.

OBJ

.

.

TMP

.

.

PU

X: an input sentence Y: a candidate dependency tree xi → xj: a dependency edge from word i to word j Φ(X): the set of possible depenent trees over X Y∗ = arg max

Y∈Φ(X)

score(Y|X) = arg max

Y∈Φ(X)

(xi→xj)∈Y

score(xi → xj)

slide-20
SLIDE 20

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

Arc-factored model

Y∗ = arg max

Y∈Φ(X)

score(Y|X) = arg max

Y∈Φ(X)

(xi→xj)∈Y

score(xi → xj) score(xi → xj) can be either probability or not. Mcdonald2005: score(xi → xj) = ⃗ w ·⃗ f(xi → xj)

slide-21
SLIDE 21

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

Arc-factored model

. Root . John . saw . Mary . yesterday

slide-22
SLIDE 22

. . . . . .

Graph Based Dependency Parsing Graph based dependency parsing

Arc-factored model

. Root . John . saw . Mary . yesterday

slide-23
SLIDE 23

. . . . . .

Graph Based Dependency Parsing Parsing algorithms

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-24
SLIDE 24

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-25
SLIDE 25

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing

. . . Root . John . saw . Mary . yesterday . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

. saw . . . yesteday . . saw . . . Mary . . saw . . . saw . . John Ideas Legal subtree spans on contiguous string. Subtree can be built from smaller subtrees step by step.In each step, always combine only 2 subtrees!(Exactly CYK!)

slide-26
SLIDE 26

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw (John) (saw mary) . John . . . saw . . . Mary . . saw . . John . John . . . Mary . . . Mary . . saw . . John . . .

slide-27
SLIDE 27

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw (John) (saw mary) . John . . . saw . . . Mary . . saw . . John . John . . . Mary . . . Mary . . saw . . John . . .

slide-28
SLIDE 28

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw (John) (saw mary) . John . . . saw . . . Mary . . saw . . John . John . . . Mary . . . Mary . . saw . . John . . .

slide-29
SLIDE 29

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw (John) (saw mary) . John . . . saw . . . Mary . . saw . . John . . . . John . . . . . . . Mary . . . . . . . Mary . . . . . saw . . . . . John . . .

slide-30
SLIDE 30

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw (John saw) (mary) . John . . . Mary . . John . . . saw . . John . . .

slide-31
SLIDE 31

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw (John saw) (mary) . John . . . Mary . . John . . . saw . . John . . .

slide-32
SLIDE 32

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: example

Example John saw mary . John . . . saw . . John . saw . . . saw . . John . saw . . . Mary . . saw . . . . Mary . . . . . . . Mary . . . . . saw . John . . . Mary . . John . . . saw . . John . John . . . saw . . . Mary . . saw . . John . . . . John . . . . . . . Mary . . . . . . . Mary . . . . . saw . . . . . John . . .

slide-33
SLIDE 33

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing

C[s][t][i] = max

s≤q<t,s≤j≤t

{ C[s][q][i] + C[q + 1][t][j] + λ(wi,wj)if j > i C[s][q][j] + C[q + 1][t][i] + λ(wj,wi)if j < i

slide-34
SLIDE 34

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Naive CYK-like parsing: why is it slow?

The time complexity is O(n5)! . s . q . t . i . j . s . t . i Heads are in the middle, we need extra indices.

slide-35
SLIDE 35

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Extention: Eisner’s algorithm

Intuition For each word, building left dependents is independent of building right dependents. Get rid of the inner indices of heads.

slide-36
SLIDE 36

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

slide-37
SLIDE 37

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

.

Figure: E[s][t][0][1]

.

Figure: E[s][t][1][1]

.

Figure: E[s][t][0][0]

.

Figure: E[s][t][1][0]

slide-38
SLIDE 38

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

. . . John . saw . Mary . . E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws))

slide-39
SLIDE 39

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

. . . John . saw . Mary . E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws))

slide-40
SLIDE 40

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

. . . John . saw . Mary . E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws)) . E[s][t][1][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(ws,wt)) . . . John . saw . Mary . E s t maxs

q t E s q

E q t . E s t maxs

q t E s q

E q t

slide-41
SLIDE 41

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

. . . John . saw . Mary . E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws)) . E[s][t][1][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(ws,wt)) . . . John . saw . Mary . . E[s][t][1][0] = maxs<q≤t(E[s][q][1][1] + E[q][t][1][0]) . E[s][t][0][0] = maxs≤q<t(E[s][q][0][0] + E[q][t][0][1])

slide-42
SLIDE 42

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

. . . John . saw . Mary . E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws)) . E[s][t][1][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(ws,wt)) . . . John . saw . Mary . E[s][t][1][0] = maxs<q≤t(E[s][q][1][1] + E[q][t][1][0]) . E s t maxs

q t E s q

E q t

slide-43
SLIDE 43

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Projective dependency parsing

Eisner’s algorithm

. . . John . saw . Mary . E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws)) . E[s][t][1][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(ws,wt)) . . . John . saw . Mary . E[s][t][1][0] = maxs<q≤t(E[s][q][1][1] + E[q][t][1][0]) . E[s][t][0][0] = maxs≤q<t(E[s][q][0][0] + E[q][t][0][1])

slide-44
SLIDE 44

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-45
SLIDE 45

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Maximum spanning tree

. Root . John . saw . Mary . yesterday

slide-46
SLIDE 46

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Maximum spanning tree

. Root . John . saw . Mary . yesterday

slide-47
SLIDE 47

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm

Ideas Greedy: always try to select the edges with highest weight. Contract: if cirlces occur, always try to break the circle with least value lost. Recursive: repeat this procedure until get the MST.

slide-48
SLIDE 48

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 9 . 10 . 30 . 30 . . 11 . 20 . 3

slide-49
SLIDE 49

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 9 . 10 . 30 . 30 . . 11 . 20 . 3 For each node, select incoming arc with highest weight. If there is no circle, done!

slide-50
SLIDE 50

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 9 . 10 . 30 . 30 . . 11 . 20 . 3 . wjs

slide-51
SLIDE 51

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 9 . 10 . 30 . 30 . . 11 . 20 . 3 . wjs No trick for the outgoing arc from the new node. Select the arc with the highest weight.

slide-52
SLIDE 52

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 10 . 30 . 20 . wjs

slide-53
SLIDE 53

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 40 . . 20 . wjs

slide-54
SLIDE 54

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 10 . 30 . 20 . wjs

slide-55
SLIDE 55

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 29 . 10 . 30 . . wjs

slide-56
SLIDE 56

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 9 . 40 . . 20 . wjs

slide-57
SLIDE 57

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 40 . 30 . . 31 . . wjs

slide-58
SLIDE 58

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 40 . 30 . . 31 . . wjs

slide-59
SLIDE 59

. . . . . .

Graph Based Dependency Parsing Parsing algorithms Non-projective depency parsing(MST)

Chu-Liu-Edmonds algorithm: example

. root . John . saw . Mary . 10 . 30 . 30 . wjs

slide-60
SLIDE 60

. . . . . .

Graph Based Dependency Parsing Learning framework

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-61
SLIDE 61

. . . . . .

Graph Based Dependency Parsing Learning framework

Local learning

Given training data (X, Y) . . . Root . John . . saw . . Mary . . yesterday . . .

.

.

PRED

.

.

SBJ

.

.

OBJ

.

.

TMP

.

.

PU Word pair Link label Instance weight Features John-saw L 1 W1 John, W2 saw, W1W2 John saw, Pos1 noun, Pos2 verb,. . . saw-Mary R 1 W1 saw, W2 Mary, W1W2 saw Mary, Pos1 verb, Pos2 noun,. . . John-Mary N 1 W1 John, W2 Mary, W1W2 saw Mary, Pos1 noun, Pos2 noun,. . . . . .

Table: Training instances

slide-62
SLIDE 62

. . . . . .

Graph Based Dependency Parsing Learning framework

Local learning

linear classfier → link classifier For each word pair in a sentence: No arc, left arc, right arc? Each arc is scored separetely without knowledge of other arcs

slide-63
SLIDE 63

. . . . . .

Graph Based Dependency Parsing Learning framework

Online large-margin training

Intuition Not feed all of the training data once. Update ⃗ w step by step instead. Average on the sequence of ⃗ w∗

slide-64
SLIDE 64

. . . . . .

Graph Based Dependency Parsing Evaluation

Outline

1 Introduction to dependency parsing 2 Graph based dependency parsing 3 Parsing algorithms

Projective dependency parsing Non-projective depency parsing(MST)

4 Learning framework 5 Evaluation

slide-65
SLIDE 65

. . . . . .

Graph Based Dependency Parsing Evaluation

Evaluation methods

Simply use(labeled) dependency accuracy . . Root . John . saw . Mary . . yesterday . . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

1 3 Root PRED 3 2 saw SBJ 3 4 saw OBJ 3 5 saw TMP 3 6 saw PU

Table: Gold

slide-66
SLIDE 66

. . . . . .

Graph Based Dependency Parsing Evaluation

Evaluation methods

Simply use(labeled) dependency accuracy . . Root . John . saw . Mary . . yesterday . . . .

PRED

.

SBJ

.

OBJ

.

TMP

.

PU

1 3 Root PRED 3 2 saw SBJ 3 4 saw OBJ 4 5 saw TMP 3 6 saw PU

Table: Parsed result

slide-67
SLIDE 67

. . . . . .

Graph Based Dependency Parsing Evaluation

Evaluation methods

1 3 Root PRED 3 2 saw SBJ 3 4 saw OBJ 3 5 saw TMP 3 6 saw PU

Table: Gold

1 3 Root PRED 3 2 saw SBJ 3 4 saw OBJ 4 5 saw TMP 3 6 saw PU

Table: Parsed

accuracy = 4 5 = 0.8 No need to use F-meausre. Other metrics: Complete right tree . . .

slide-68
SLIDE 68

. . . . . .

Graph Based Dependency Parsing Evaluation

Evaluation results

Czech-A Czech-B Accuracy Complete Accuracy Complete COLL1999 82.8

  • N&N2005

80.0 31.8

  • McD2005

83.3 31.3 74.8 0.0 Single-best MIRA 84.1 32.2 81.0 14.9 Factored MIRA 84.4 32.3 81.5 14.3

Table: Dependency parsing results for Czech. Czech-B is the subset of Czech-A containing only sentences with at least one non-projective dependency.

slide-69
SLIDE 69

. . . . . .

Graph Based Dependency Parsing Evaluation

Evaluation results

English Accuracy Complete McD2005 90.9 37.5 Singe-best MIRA 90.2 33.2 Factored MIRA 90.2 32.3

Table: Dependency parsing results for English using spanning tree algorithms.

slide-70
SLIDE 70

. . . . . .

Graph Based Dependency Parsing Evaluation

. . Thank . you . ! . . . Any . questions . ? .

slide-71
SLIDE 71

. . . . . .

Graph Based Dependency Parsing Appendix

Eisner’s algorithm

S = w0w1 · · · wn Arc weight parameters λwi,wj ∈ Λ Instaniate E[n][n][2][2] ∈ R Initialization: E[s][s][d][c] = 0 for m: 1 . . . n do for s: 1 . . . n do t = s + m if t > n then break end if E[s][t][0][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(wt,ws)) E[s][t][1][1] = maxs≤q<t(E[s][q][1][0] + E[q + 1][t][0][0] + λ(ws,wt)) E[s][t][0][0] = maxs≤q<t(E[s][q][0][0] + E[q][t][0][1]) E[s][t][1][0] = maxs<q≤t(E[s][q][1][1] + E[q][t][1][0]) end for end for

slide-72
SLIDE 72

. . . . . .

Graph Based Dependency Parsing Appendix

MIRA learning algorithm

Training data: T = (xt, yt)T

t=1

⃗ w0 = 0; ⃗ (v) = 0; i = 0 for n : 1 · · · N do for t : 1 · · · T do min ||⃗ wi+1 − ⃗ wi|| s.t. s(xt, yt) − s(xt, y

′) ≥ L(yt, y ′), ∀y ′ ∈ Φ(xt)

⃗ v = ⃗ v + ⃗ wi+1 i = i + 1 end for end for ⃗ w =

⃗ v N∗T