Collins Parsing
Victor, Yùdōng Zhōu
Collins Parsing Victor, Ydng Zhu Outline Introduction Basic Model - - PowerPoint PPT Presentation
Collins Parsing Victor, Ydng Zhu Outline Introduction Basic Model Representation Calculation Three generative models Models Practice issues Evaluation 2 Introduction Michael Collins PhD Thesis, 1999
Victor, Yùdōng Zhōu
2
3
and probabilities, find best parsing tree
4
An Example
5
6
Notation: AF(j) = (hj, Rj)
where w1=Smith, w5=announced D = {(AF(1),AF(2)...AF(m)} P(T|S)=P(B,D|S)= P(B|S)* P(D|S,B)
7
8
9
Shaw, based in Dalton, Ga., has annual sales of about $1.18 billion, and has economies of scale and lower raw-material costs that are expected to boost the profitability of Armstrong's brands, sold under the Armstrong and Evans-Black names .
10
11
Generative Model Discrimitive Model joint probability distribution
conditional distribution
Representation:
12
*Pr(STOP|S,VP,bought)
13
*Pl(NP(Marks)|S,VP,bought) *Pl(NP(week)|S,VP,bought) *Pl(STOP|S,VP,bought)
=PR(Ri(ri)|P,h,H) In Previous Formula =PR(Ri(ri)|P,h,H, distancer(i-1) )
14
15
16
Plc(LC|P,H,h) and Prc(RC|P,H,h)
PR(Ri(ri)|P,h,H, distancer(i-1), RC ) ……
17
Plc({NP-C}|S,VP,bought) * Prc({ }|S,VP,bought) * Pl(NP-C(Marks)|S,VP,bought, {NP-C} )* Pl(NP(week)|S,VP,bought, { } ) * Pl(STOP|S,VP,bought, { }) * Pr(STOP|S,VP,bought, { }) Plc({NP-C,NP-C}|S,VP,bought) will be quite small Thus achieve the correct parse
18
Brooks Brothers)
Brooks Brothers from TRACE)
19
where G is Head, Left or Right
20
21
22
number of correct constituents in proposed parse number of constituents in proposed parse
number of correct constituents in proposed parse number of constituents in treebank parse
violate constituent boundaries with a constituent in the treebank parse.
23
24
precision/recall 93.3%/90.1%
25