institute for information transmission problems russian
play

Institute for Information Transmission Problems Russian Academy of - PowerPoint PPT Presentation

Experiments on human incremental parsing Leonid Mityushin Leonid Iomdin Laboratory of Computational Linguistics Institute for Information Transmission Problems Russian Academy of Sciences 1 ETAP multilingual multifunctional


  1. Experiments on human incremental parsing Leonid Mityushin Leonid Iomdin Laboratory of Computational Linguistics Institute for Information Transmission Problems Russian Academy of Sciences 1

  2. ETAP – multilingual multifunctional linguistic processor dependency parsing, machine translation, semantic analysis, question answering Languages Russian (~110,000 word lexicon) English ( ~90,000 word lexicon) French, German, Spanish, Korean, Arabic SynTagRus – Russian dependency treebank ~1,100,000 words 2

  3. Incremental text comprehension At any moment, the reader/listener has a complete or almost complete linguistic and pragmatic interpretation of the part of the text perceived up to that moment. This interpretation, as a rule, does not change after new parts of the text have been perceived. 3

  4. The aim of this work is to evaluate whether this is true for human comprehension of the syntactic structure of a sentence. 4

  5. ETAP syntactic model Nodes of a dependency tree = words of the sentence (not punctuation marks). Syntactic links = directed arcs between words, labelled with names of syntactic relations. Russian syntax: about 70 syntactic relations. 5

  6. partial syntactic structure 1 . . . . . . K−1 K K+1 . . . left context active right context word initial segment 6

  7. 100 percent confident incremental parsing is impossible. Size of the right context = 0 Initial segment: John met her . . . . – ??? John met her yesterday. John met her yesterday John met her sister yesterday. John met her sister yesterday 7

  8. Tentative links Given the initial segment John met her . . . . (with a zero right context), we create a syntactic link met her but mark it as tentative (the other links are called final ). 8

  9. partial syntactic structure 1 . . . . . . K−1 K K+1 . . . left context active right context word initial segment 9

  10. We presume that processing a sentence always results in creating its correct complete dependency tree. 10

  11. Performance indicators number of corrections number of created tentative links 11

  12. London Orbital is a 117 mile long motorway, encircling almost all of Greater London. size of the right context = 1 12

  13. London Orb Orbital al is ...... 1 London -- 2 Orbi bita tal is ----------------------------------- * --> 2 | 2 --> 1 London | ----------------------------------- 13

  14. TENTATIVE LINKS ------------------------------------- create and insert | --> create | --> insert | --> remove | --> ------------------------------------- CORRECTION OF FINAL LINKS ------------------------------------- insert | --> remove | --> ------------------------------------- 14

  15. London Orb Orbital al is ...... 1 London -- 2 Orbi bita tal is ----------------------------------- * --> 2 | 2 --> 1 London | ----------------------------------- 15

  16. London Orb Orbital al is ...... 1 London -- 2 Orbi bita tal is ----------------------------------- * --> 2 | 2 --> 1 London | co compo pos ----------------------------------- 16

  17. London Orbital is is a ...... 1 London <-. compos 2 Orbital – -' -- 3 is is a ----------------------------------- * --> 3 | 3 --> 2 Orbital | ----------------------------------- 17

  18. London Orbital is is a ...... 1 London <-. compos 2 Orbital – -' -- 3 is is a ----------------------------------- * --> 3 | 3 --> 2 Orbital | pred edic ic ----------------------------------- 18

  19. London Orbital is a 117 ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a 117 ----------------------------------- * --> 4 | 4 --> 3 is | ----------------------------------- 19

  20. London Orbital is a 11 117 mile ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 117 mile ----------------------------------- * --> 5 | 5 --> 3 is | 5 --> 4 a | ----------------------------------- 20

  21. London Orbital is a 117 mi mile le long ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 -- 6 mile le long 21

  22. London Orbital is a 117 mi mile le long ...... ----------------------------------- * --> 6 | 6 --> 3 is | 6 --> 4 a | 6 --> 5 117 | ----------------------------------- 22

  23. London Orbital is a 117 mi mile le long ...... ----------------------------------- * --> 6 | 6 --> 3 is | 6 --> 4 a | 6 --> 5 117 | qu quan antit ----------------------------------- 23

  24. London Orbital is a 117 mile lon ong motorway ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 <-. quantit 6 mile --' -- 7 long ng motorway 24

  25. London Orbital is a 117 mile lon ong motorway ...... ----------------------------------- * --> 7 | 7 --> 3 is | 7 --> 4 a | 7 --> 6 mile | ----------------------------------- 25

  26. London Orbital is a 117 mile lon ong motorway ...... ----------------------------------- * --> 7 | 7 --> 3 is | 7 --> 4 a | 7 --> 6 mile | res restr ----------------------------------- 26

  27. London Orbital is a 117 mile long moto torway ay, encircling ...... 1 London <-. compos 2 Orbital --' <-. predic 3 is --' -- 4 a -- 5 117 <-. quantit 6 mile --' <-. restr 7 long --' -- 8 moto torw rway, encircling 27

  28. London Orbital is a 117 mile long mo moto torway ay, encircling ...... ----------------------------------- * --> 8 | 8 --> 3 is | 8 --> 4 a | 8 --> 7 long | ----------------------------------- 28

  29. London Orbital is a 117 mile long mo moto torway ay, encircling ...... ----------------------------------- * --> 8 | 3 3 copu pula lat 8 --> 3 is | 8 --> 4 a | det determ 8 --> 7 long | mod modif ----------------------------------- 29

  30. London Orbital is a 117 mile long motorway, enci circ rcling ng almost ...... 1 London <-. compos 2 Orbital --'<-. predic 3 is --' --. -- 4 a <-. | determ 5 117 <-. | | quantit 6 mile --'<-. | | restr 7 long <-.--' | | modif 8 motorway, --' --'<-' copulat 9 enci circ rcling ng almost 30

  31. London Orbital is a 117 mile long motorway, enci circ rcling ng almost ...... ----------------------------------- * --> 9 | 9 --> 8 motorway | ----------------------------------- 31

  32. London Orbital is a 117 mile long motorway, enci circ rcling ng almost ...... ----------------------------------- * --> 9 | 8 mo modif 9 --> 8 motorway | ----------------------------------- 32

  33. and so on ... 33

  34. Three series of experiments were conducted for the sizes of the right context 0, 1 and 2, with 100 sentences processed in each series. The role of the subjects was played by the authors of this paper. The sentences for the experiments were selected at random from the two sets of sentences offered as training material for the competition "Automatic Gapping Resolution for Russian“. Only non -elliptical sentences were used. 34

  35. right total tentative created number context number links in tentative of correc- of links the trees links tions 0 1627 34 = 2.2% 75 = 4.6% 3 1 1741 21 = 1.2% 34 = 2.0% 0 2 1607 8 = 0.5% 13 = 0.8% 0 35

  36. Thank you! 36

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend