Dependency Parsing CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT - - PowerPoint PPT Presentation

dependency parsing
SMART_READER_LITE
LIVE PREVIEW

Dependency Parsing CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT - - PowerPoint PPT Presentation

Dependency Parsing CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT marine@cs.umd.edu Slides credit: Joakim Nivre & Ryan McDonald Agenda Formalizing dependency graphs Formalizing transition-based parsing Graph-based


slide-1
SLIDE 1

Dependency Parsing

CMSC 723 / LING 723 / INST 725 MARINE CARPUAT

marine@cs.umd.edu

Slides credit: Joakim Nivre & Ryan McDonald

slide-2
SLIDE 2

Agenda

  • Formalizing dependency graphs
  • Formalizing transition-based parsing

– Graph-based – Transition-based most material based on Kubler, McDonald & Nivre

slide-3
SLIDE 3

Dependencies

  • Typed: Label indicating relationship between words
  • Untyped: Only which words depend

I saw a girl with a telescope

nsubj prep dobj det det pobj

I saw a girl with a telescope

slide-4
SLIDE 4

Data-driven dependency parsing

Goal: learn a good predictor of dependency graphs Input: x Output: dependency graph/tree G Can be framed as a structured prediction task

  • very large output space
  • with interdependent labels
slide-5
SLIDE 5

FOR ORMA MALIZ IZING ING DE DEPE PENDENC NDENCY Y REPR PRES ESENT ENTATIO TIONS NS

slide-6
SLIDE 6

Dependency Graphs

slide-7
SLIDE 7

Dependency Graph Notation

slide-8
SLIDE 8

Properties of Dependency Trees

slide-9
SLIDE 9
slide-10
SLIDE 10

Non-Projectivity

  • Most theoretical frameworks do not assume projectivity
  • Non-projective structures are needed to represent

– Long-distance dependencies – Free word order

slide-11
SLIDE 11

GR GRAP APH-BASED BASED PAR ARSING ING

slide-12
SLIDE 12

Directed Spanning Trees

slide-13
SLIDE 13

Maximum Spanning Tree

  • Assume we have an arc factored model

i.e. weight of graph can be factored as sum or product of weights of its arcs

  • Chu-Liu-Edmonds algorithm can find the

maximum spanning tree for us!

– Greedy recursive algorithm – Naïve implementation: O(n^3)

slide-14
SLIDE 14

Chu-Liu-Edmonds illustrated

slide-15
SLIDE 15

Chu-Liu-Edmonds illustrated

slide-16
SLIDE 16

Chu-Liu-Edmonds illustrated

slide-17
SLIDE 17

Chu-Liu-Edmonds illustrated

slide-18
SLIDE 18

Chu-Liu-Edmonds illustrated

slide-19
SLIDE 19
slide-20
SLIDE 20

Arc weights as linear classifiers

slide-21
SLIDE 21

Example of classifier features

slide-22
SLIDE 22

How to score a graph G using features?

Arc-factored model assumption By definition of arc weights as linear classifiers

slide-23
SLIDE 23

How can we learn the classifier from data?

slide-24
SLIDE 24

TR TRAN ANSITIO ITION-BASED BASED DE DEPE PENDENC NDENCY Y PAR ARSE SER

slide-25
SLIDE 25

Transition-based parsing

slide-26
SLIDE 26

Transition-based parsing

slide-27
SLIDE 27

Deterministic parsing with an oracle

slide-28
SLIDE 28

Stack-based transition system

slide-29
SLIDE 29

Transitions & Preconditions

slide-30
SLIDE 30

Let’s try it out…

slide-31
SLIDE 31

A few steps illustrated…

slide-32
SLIDE 32

A few steps illustrated…