SLIDE 6 Direct Projection
Given sentence pair (E, F) and a set of syntactic relations for E, where E = e1, ..., en is an English sentence and F = f1, ..., fm is its non-English parallel, syntactic relations R(x, y) are projected from English as follows:
- one-to-one – ei aligned with a unique fx and ej aligned with a unique fy – then
R(ei, ej) ⇒ R(fx, fy)
- unaligned English – ej not aligned with any word in F – create new empty word fy so
that for any ei aligned with a unique fx, R(ei, ej) ⇒ R(fx, fy) and R(ej, ei) ⇒ R(fy, fx)
- one-to-many – ei aligned with fx, ..., fy – then create new empty fz, parent of
fx, ..., fy, and set ei to align to fz instead
uniquely aligned to – then keep the head of aligned to , and delete other alignments
- many-to-many – decompose: fjrst one-to-many, then many-to-one
- unaligned foreign – leave them out of the projected tree
Projection of Trees across Parallel Texts
3/14