Institute for Information Transmission Problems Russian Academy of - - PowerPoint PPT Presentation

institute for information transmission problems russian
SMART_READER_LITE
LIVE PREVIEW

Institute for Information Transmission Problems Russian Academy of - - PowerPoint PPT Presentation

Experiments on human incremental parsing Leonid Mityushin Leonid Iomdin Laboratory of Computational Linguistics Institute for Information Transmission Problems Russian Academy of Sciences 1 ETAP multilingual multifunctional


slide-1
SLIDE 1

Experiments on human incremental parsing Leonid Mityushin Leonid Iomdin Laboratory of Computational Linguistics Institute for Information Transmission Problems Russian Academy of Sciences

1

slide-2
SLIDE 2

ETAP – multilingual multifunctional linguistic processor dependency parsing, machine translation, semantic analysis, question answering Languages Russian (~110,000 word lexicon) English ( ~90,000 word lexicon) French, German, Spanish, Korean, Arabic SynTagRus – Russian dependency treebank ~1,100,000 words

2

slide-3
SLIDE 3

Incremental text comprehension At any moment, the reader/listener has a complete or almost complete linguistic and pragmatic interpretation of the part

  • f the text perceived up to that moment.

This interpretation, as a rule, does not change after new parts of the text have been perceived.

3

slide-4
SLIDE 4

The aim of this work is to evaluate whether this is true for human comprehension

  • f the syntactic structure of a sentence.

4

slide-5
SLIDE 5

ETAP syntactic model Nodes of a dependency tree = words

  • f the sentence (not punctuation marks).

Syntactic links = directed arcs between words, labelled with names of syntactic relations. Russian syntax: about 70 syntactic relations.

5

slide-6
SLIDE 6

partial syntactic structure 1 . . . . . . K−1 K K+1 . . . left context active right context word initial segment

6

slide-7
SLIDE 7

100 percent confident incremental parsing is impossible. Size of the right context = 0 Initial segment: John met her . . . . – ??? John met her yesterday. John met her yesterday John met her sister yesterday. John met her sister yesterday

7

slide-8
SLIDE 8

Tentative links Given the initial segment John met her . . . . (with a zero right context), we create a syntactic link met her but mark it as tentative (the other links are called final).

8

slide-9
SLIDE 9

partial syntactic structure 1 . . . . . . K−1 K K+1 . . . left context active right context word initial segment

9

slide-10
SLIDE 10

We presume that processing a sentence always results in creating its correct complete dependency tree.

10

slide-11
SLIDE 11

Performance indicators number of corrections number of created tentative links

11

slide-12
SLIDE 12

London Orbital is a 117 mile long motorway, encircling almost all of Greater London. size of the right context = 1

12

slide-13
SLIDE 13

London Orb Orbital al is ...... 1 London -- 2 Orbi bita tal is

  • * --> 2 |

2 --> 1 London |

  • 13
slide-14
SLIDE 14

TENTATIVE LINKS

  • create and insert | -->

create | --> insert | --> remove | -->

  • CORRECTION OF FINAL LINKS
  • insert | -->

remove | -->

  • 14
slide-15
SLIDE 15

London Orb Orbital al is ...... 1 London -- 2 Orbi bita tal is

  • * --> 2 |

2 --> 1 London |

  • 15
slide-16
SLIDE 16

London Orb Orbital al is ...... 1 London -- 2 Orbi bita tal is

  • * --> 2 |

2 --> 1 London | co compo pos

  • 16
slide-17
SLIDE 17

London Orbital is is a ...... 1 London <-. compos 2 Orbital –-' -- 3 is is a

  • * --> 3 |

3 --> 2 Orbital |

  • 17
slide-18
SLIDE 18

London Orbital is is a ...... 1 London <-. compos 2 Orbital –-' -- 3 is is a

  • * --> 3 |

3 --> 2 Orbital | pred edic ic

  • 18
slide-19
SLIDE 19

London Orbital is a 117 ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a 117

  • * --> 4 |

4 --> 3 is |

  • 19
slide-20
SLIDE 20

London Orbital is a 11 117 mile ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 117 mile

  • * --> 5 |

5 --> 3 is | 5 --> 4 a |

  • 20
slide-21
SLIDE 21

London Orbital is a 117 mi mile le long ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 -- 6 mile le long

21

slide-22
SLIDE 22

London Orbital is a 117 mi mile le long ......

  • * --> 6 |

6 --> 3 is | 6 --> 4 a | 6 --> 5 117 |

  • 22
slide-23
SLIDE 23

London Orbital is a 117 mi mile le long ......

  • * --> 6 |

6 --> 3 is | 6 --> 4 a | 6 --> 5 117 | qu quan antit

  • 23
slide-24
SLIDE 24

London Orbital is a 117 mile lon

  • ng

motorway ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 <-. quantit 6 mile --' -- 7 long ng motorway

24

slide-25
SLIDE 25

London Orbital is a 117 mile lon

  • ng

motorway ......

  • * --> 7 |

7 --> 3 is | 7 --> 4 a | 7 --> 6 mile |

  • 25
slide-26
SLIDE 26

London Orbital is a 117 mile lon

  • ng

motorway ......

  • * --> 7 |

7 --> 3 is | 7 --> 4 a | 7 --> 6 mile | res restr

  • 26
slide-27
SLIDE 27

London Orbital is a 117 mile long moto torway ay, encircling ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 <-. quantit 6 mile --' <-. restr 7 long --' -- 8 moto torw rway, encircling

27

slide-28
SLIDE 28

London Orbital is a 117 mile long mo moto torway ay, encircling ......

  • * --> 8 |

8 --> 3 is | 8 --> 4 a | 8 --> 7 long |

  • 28
slide-29
SLIDE 29

London Orbital is a 117 mile long mo moto torway ay, encircling ......

  • * --> 8 | 3

3 copu pula lat 8 --> 3 is | 8 --> 4 a | det determ 8 --> 7 long | mod modif

  • 29
slide-30
SLIDE 30

London Orbital is a 117 mile long motorway, enci circ rcling ng almost ...... 1 London <-. compos 2 Orbital --'<-. predic 3 is --' --. -- 4 a <-. | determ 5 117 <-. | | quantit 6 mile --'<-. | | restr 7 long <-.--' | | modif 8 motorway, --' --'<-' copulat 9 enci circ rcling ng almost

30

slide-31
SLIDE 31

London Orbital is a 117 mile long motorway, enci circ rcling ng almost ......

  • * --> 9 |

9 --> 8 motorway |

  • 31
slide-32
SLIDE 32

London Orbital is a 117 mile long motorway, enci circ rcling ng almost ......

  • * --> 9 | 8

mo modif 9 --> 8 motorway |

  • 32
slide-33
SLIDE 33

and so on ...

33

slide-34
SLIDE 34

Three series of experiments were conducted for the sizes of the right context 0, 1 and 2, with 100 sentences processed in each series. The role of the subjects was played by the authors of this paper. The sentences for the experiments were selected at random from the two sets of sentences offered as training material for the competition "Automatic Gapping Resolution for Russian“. Only non-elliptical sentences were used.

34

slide-35
SLIDE 35

right total tentative created number context number links in tentative of correc-

  • f links the trees links tions

1627 34 = 2.2% 75 = 4.6% 3 1 1741 21 = 1.2% 34 = 2.0% 0 2 1607 8 = 0.5% 13 = 0.8%

35

slide-36
SLIDE 36

Thank you!

36