Insights into non-projectivity in Hindi Prashanth Mannem, Himani - - PowerPoint PPT Presentation

insights into non projectivity in hindi
SMART_READER_LITE
LIVE PREVIEW

Insights into non-projectivity in Hindi Prashanth Mannem, Himani - - PowerPoint PPT Presentation

Insights into non-projectivity in Hindi Prashanth Mannem, Himani Chaudhry and Akshar Bharati LTRC, IIIT Hyderabad, India 50032 { prashanth,himani } @research.iiit.ac.in Aug 4, 2009 Introduction Non-projectivity Graph properties Linguistic


slide-1
SLIDE 1

Insights into non-projectivity in Hindi

Prashanth Mannem, Himani Chaudhry and Akshar Bharati

LTRC, IIIT Hyderabad, India 50032 {prashanth,himani}@research.iiit.ac.in

Aug 4, 2009

slide-2
SLIDE 2

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Outline

1 Introduction

HyDT

2 Non-projectivity

Non-projectivity in HyDT Non-projectivity Analysis

3 Graph properties

HyDT’s graph properties

4 Linguistic Analysis

Classes

5 Summary

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-3
SLIDE 3

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Introduction

◮ Hindi is a verb final, flexible word order language

◮ raama baazaara gayaa thaa

Ram market go.PAST be.PAST

◮ baazaara gayaa thaa raama ◮ raama gayaa thaa baazaara ◮ baazaara raama gayaa thaa

◮ Hyderabad Dependency Treebank (HyDT) for Hindi

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-4
SLIDE 4

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Introduction

◮ Hindi is a verb final, flexible word order language

◮ raama baazaara gayaa thaa

Ram market go.PAST be.PAST

◮ baazaara gayaa thaa raama ◮ raama gayaa thaa baazaara ◮ baazaara raama gayaa thaa

◮ Hyderabad Dependency Treebank (HyDT) for Hindi

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-5
SLIDE 5

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References HyDT

HyDT - Hyderabad Dependency Treebank

◮ Paninian Grammar

◮ Syntactic cues help in determining the type of relation

◮ Sentences annotated with

◮ POS tags ◮ Minimal constituents (chunks) and their heads ◮ Relations between chunks (inter-chunk) ◮ Intra-chunk dependencies left unspecified ◮ Trees can be expanded if needed

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-6
SLIDE 6

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References HyDT

Example

◮ meraa baDaa bhaaii bahuta phala khaataa hai

my big brother lots-of fruits eat PRES.

◮ (( meraa baDaa bhaaii ))NP (( bahuta phala ))NP (( khaataa hai ))VG ◮ (( meraa PRP baDaa JJ bhaaii NN ))NP (( bahuta QF phala NN ))NP ((

khaataa VM hai VAUX ))VG

(( khaataa_VM hai_VAUX ))

NP

(( bahuta_QF phala_NN ))

NP VG

(( meraa_PRP baDzaa_JJ bhaaii_NN ))

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-7
SLIDE 7

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References HyDT

Paninian Grammatical Model

◮ A dependency grammar based approach ◮ Inspired by inflectionally rich language (Sanskrit) ◮ Better suited for handling Indian Languages ◮ Provides syntactico-semantic analysis of language ◮ Various linguistic phenomena handled seamlessly ◮ The grammar facilitates analysis of the intended meaning as an

’expression’ of what the speaker wants to communicate (vivaksha) (Bharati et al., 1995)

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-8
SLIDE 8

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References HyDT

Dependency relations

◮ karaka relations: Direct participants (karaka) of the action denoted by

the verb

◮ 6 basic karakas: karta(subject/agent/doer), karma (object/patient),

karana (instrument), sampradaan (beneficiary), apaadaan (source), adhikarana (location in place/time/other)

◮ Other than karaka relations: purpose, genitive, reason etc... ◮ Relations which are not strictly ’dependency relation’ but are used to

represent ’co-ordination’ and ’complex predicates’

◮ 40 labels in all

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-9
SLIDE 9

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Outline

1 Introduction

HyDT

2 Non-projectivity

Non-projectivity in HyDT Non-projectivity Analysis

3 Graph properties

HyDT’s graph properties

4 Linguistic Analysis

Classes

5 Summary

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-10
SLIDE 10

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Non-projectivity

be.PAST (( thaa )) (( unheM )) kaa )) (( bahuta )) (( shauka )) (( phutbaala He had huge liking for football GEN. football I.OBL huge liking ◮ Every word in the span of relation has to be dominated by the head in

that relation for it to be projective.

◮ Otherwise, the relation is non-projective. ◮ In a flat representation, crossing arcs indicate non-projectivity

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-11
SLIDE 11

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity in HyDT

HyDT and non-projectivity

◮ 1865 sentences, 16620 chunks, 35787 words ◮ 14% sentences have non-projective structures ◮ 1.87% of inter-chunk relations are non-projective ◮ 0.87% if intra-chunk relations are also considered ◮ In PDT 2.0 (Czech), 23% (out of 73088) of the sentences are

non-projective

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-12
SLIDE 12

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity in HyDT

HyDT and non-projectivity

◮ 1865 sentences, 16620 chunks, 35787 words ◮ 14% sentences have non-projective structures ◮ 1.87% of inter-chunk relations are non-projective ◮ 0.87% if intra-chunk relations are also considered ◮ In PDT 2.0 (Czech), 23% (out of 73088) of the sentences are

non-projective

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-13
SLIDE 13

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity in HyDT

Why is non-projectivity important as a constraint

◮ Poses problems in parsing with respect to both accuracy and efficiency ◮ Need special algorithms to handle non-projectivity ◮ Bharati et al. (2008) showed that a major chunk of errors in their

Hindi parser is due to non-projectivity

◮ A need to analyse non-projectivity in Hindi for a better insight into

such constructions

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-14
SLIDE 14

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity in HyDT

Why is non-projectivity important as a constraint

◮ Poses problems in parsing with respect to both accuracy and efficiency ◮ Need special algorithms to handle non-projectivity ◮ Bharati et al. (2008) showed that a major chunk of errors in their

Hindi parser is due to non-projectivity

◮ A need to analyse non-projectivity in Hindi for a better insight into

such constructions

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-15
SLIDE 15

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity in HyDT

Why is non-projectivity important as a constraint

◮ Poses problems in parsing with respect to both accuracy and efficiency ◮ Need special algorithms to handle non-projectivity ◮ Bharati et al. (2008) showed that a major chunk of errors in their

Hindi parser is due to non-projectivity

◮ A need to analyse non-projectivity in Hindi for a better insight into

such constructions

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-16
SLIDE 16

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity Analysis

Non-projectivity analysis

◮ From two perspectives ◮ Graph properties constraining non-projectivity (Kuhlmann and Nivre,

2006; Nivre, 2006)

◮ Like gap degree, edge degree, planarity, well-nestedness ◮ These constraints give an idea of the extent of non-projectivity

◮ Linguistic phenomenon giving rise to non-projectivity

◮ Provides better understanding and gives insight into what kind of

constructions lead to non-projectivity

◮ Can be used as features for better learning

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-17
SLIDE 17

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity Analysis

Non-projectivity analysis

◮ From two perspectives ◮ Graph properties constraining non-projectivity (Kuhlmann and Nivre,

2006; Nivre, 2006)

◮ Like gap degree, edge degree, planarity, well-nestedness ◮ These constraints give an idea of the extent of non-projectivity

◮ Linguistic phenomenon giving rise to non-projectivity

◮ Provides better understanding and gives insight into what kind of

constructions lead to non-projectivity

◮ Can be used as features for better learning

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-18
SLIDE 18

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Non-projectivity Analysis

Non-projectivity analysis

◮ From two perspectives ◮ Graph properties constraining non-projectivity (Kuhlmann and Nivre,

2006; Nivre, 2006)

◮ Like gap degree, edge degree, planarity, well-nestedness ◮ These constraints give an idea of the extent of non-projectivity

◮ Linguistic phenomenon giving rise to non-projectivity

◮ Provides better understanding and gives insight into what kind of

constructions lead to non-projectivity

◮ Can be used as features for better learning

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-19
SLIDE 19

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Outline

1 Introduction

HyDT

2 Non-projectivity

Non-projectivity in HyDT Non-projectivity Analysis

3 Graph properties

HyDT’s graph properties

4 Linguistic Analysis

Classes

5 Summary

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-20
SLIDE 20

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Graph constraints restricting non-projectivity

◮ Gap degree ◮ Edge degree ◮ Planarity ◮ Well-nestedness

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-21
SLIDE 21

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Gap degree

◮ Gap is a pair of adjacent nodes in the projection of a node but not

adjacent in the sentence

◮ Gap degree of a node is the number of gaps in the projection of a

node

◮ Gap degree of a sentence is the maximum among gap degrees of

nodes in the sentence

a f b d e c

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-22
SLIDE 22

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Edge degree

◮ The number of connected components in the span of an edge which

are not dominated by the outgoing node in the edge

◮ Edge degree of a sentence is the maximum among edge degrees of

edges in the sentence

a f b d e c

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-23
SLIDE 23

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Planarity and Well-nestedness

◮ A dependency graph is planar if edges do not cross when drawn above

the sentence

a b c d ◮ A dependency graph is well-nested if no two disjoint subgraphs interleave. ◮ Two subgraphs are disjoint if neither of their roots dominates the other ◮ They interleave if their projections overlap a f c b d e

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-24
SLIDE 24

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References HyDT’s graph properties

HyDT w.r.t the graph properties

Property Count Percentage All structures 1865 Gap degree Gd(0) 1603 85.9% Gd(1) 259 13.89% Gd(2) 0% Gd(3) 3 0.0016% Edge degree Ed(0) 1603 85.9% Ed(1) 254 13.6% Ed(2) 6 0.0032% Ed(3) 1 0.0005% Ed(4) 1 0.0005% Projective 1603 85.9% Planar 1639 87.9% Non-projective 36 1.93% & planar Well-nested 1865 100%

Table: Results on HyDT

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-25
SLIDE 25

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Outline

1 Introduction

HyDT

2 Non-projectivity

Non-projectivity in HyDT Non-projectivity Analysis

3 Graph properties

HyDT’s graph properties

4 Linguistic Analysis

Classes

5 Summary

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-26
SLIDE 26

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Linguistic phenomena leading to non-projectivity

◮ Classes

◮ Relative co-relative constructions ◮ Extraposed relative clause constructions ◮ Intra-clausal non-projectivity ◮ Paired connectives ◮ ki complement clauses ◮ Genitive relation split by a verb modifier ◮ Phrase split a co-ordinating structure ◮ Shared argument splitting the non-finite clause ◮ Others

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-27
SLIDE 27

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Analysis of the non-projective classes

◮ Cues to identify non-projectivity ◮ Rigidity

◮ Reorderability of the constructions retaining the gross meaning ◮ Gross meaning − Meaning of the sentence not taking the discourse

and topic-focus into consideration

◮ What is the best projective approximation possible by reordering? ◮ Is this projective construction more natural compared to the

non-projective one?

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-28
SLIDE 28

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Relative co-relative constructions

[then] About one−fourth of the night was over when he felt unconscious (( taba )) (( raata )) (( lagabhaga )) (( chauthaaii )) (( Dhala__chukii thii )) (( jaba )) (( unheM ))

  • ne−fourth

about night then when be.PAST (( behoshii sii aaii ))

  • ver

come.PAST like unconsciouness he.OBL

◮ Cues: relative co-relatives like jaba-taba (when-then), jo-vo

(which-that), jahaaM-vahaaM (where-there), jisa-usa (which-that)

◮ Not rigid ◮ Can be made projective by reordering ◮ Hard to say which among the projective & non-projective ones is more

natural

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-29
SLIDE 29

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Extraposed relative clause constructions

(( kiyaa thaa )) work (( jahaaM )) (( kaama )) (( mai ne )) where . publish.PASTbe.PAST I ERG. did.PAST be.PAST This letter was published in the press at Mumbai where I worked (( yaha patra )) (( mumbai ke )) (( usa (( capathaa thaa )) presa me )) This letter mumbai GEN. in press that

◮ NP and the relative clause are separated by the verb group ◮ Cues: Relative pronoun following a verb group ◮ Not rigid ◮ Extraposed relative clause can be moved next to the noun phrase to

make it projective

◮ Resulting projective construction is less natural than the original

non-projective one

◮ Most common non-projective class

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-30
SLIDE 30

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Paired connectives

(( rupayoM thii )) (( to )) (( bataanaa chahiye thaa )) (( yadi )) if then should kii )) (( zaruurata ko )) (( mujha money.OBL GEN. be.PAST need be.PAST I.OBL DAT. tell.INF If [you] needed money then [you] should have told me

◮ Cues: Paired connectives like agar-to (if -then), yadi-to (if -then) ◮ Can be reordered and is not rigid ◮ The phrase that comes after to followed by yadi clause and then to

◮ to is optional here

◮ Resulting projective construction is not a natural one

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-31
SLIDE 31

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

ki complement clause constructions

(( tyaaga )) (( paate the )) (( na )) (( usako )) (( ki )) (( shauka )) (( aisaa )) kaa )) (( phutbaala (( unheM )) (( thaa )) I.OBL football GEN. such liking be.PAST that it.OBL not give−up be.PAST able−to He had such [a] liking for football that he was not able to give it up

◮ Cues: ki comes after words like yaha (this), aisaa (such), is

tarah (such), itana (this much)

◮ Takes the pattern yaha-its property-VP-ki clause

◮ Rigid ◮ If VP has a transitive verb, then the ki clause and the referent both

modify the verb, making it projective

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-32
SLIDE 32

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Genitive relation split by a verb modifier

be.PAST (( thaa )) (( unheM )) kaa )) (( bahuta )) (( shauka )) (( phutbaala He had huge liking for football GEN. football I.OBL huge liking

◮ No obvious cues ◮ Is not rigid ◮ Move the verb modifier out of the genitive phrase to make it

projective

◮ Projective one is more natural

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-33
SLIDE 33

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Phrase splitting a co-ordinating structure

DAT. (( vaha )) (( 1795 meM )) (( milaa )) it After this Jaman Shah [got it] and then, in 1795 Shah Shuja got it (( isake baada )) (( jamaana shah )) (( aura )) (( phira )) (( shaaha shujaa ko )) Jaman Shah this.OBL after and then 1795 in Shah Shuja get.PAST

◮ Cues: NONE ◮ Adverb occuring in the middle of a co-ordinating structure ◮ Is not rigid ◮ Projective one is more natural

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-34
SLIDE 34

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Shared argument splitting the non-finite clause

be.PAST (( prakaashaka ke saamane )) (( hama )) (( usa lekhakiiya asmitaa ko )) (( sagarva )) (( rakhakara )) (( karate the )) (( baata )) We used to talk after placing that writer’s identity proudly before the publisher that writer’s identity ACC. we proudly publisher GEN. before place.PRT talk do.IMPF

◮ Cues: NONE ◮ Is not rigid ◮ Projective one is more natural

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-35
SLIDE 35

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Others

◮ Few very rare and not natural phenomena ◮ Annotation errors ◮ Inconsistent NULL placement

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-36
SLIDE 36

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References Classes

Non-projective classes in HyDT

Non-projective Class Count % Relative co-relatives constructions 18 6.8 % Extraposed realtive clause constructions 101 38.0 % Intra-clausal non-projectivity 12 4.5 % Paired connectives 33 12.4 % ki complement clauses 52 19.5 % Genitive relation split by a verb modifier 10 3.8 % Phrase splitting a co-ordinating structure 4 1.5 % Shared argument splitting the non-finite clause 10 3.8 % Others 26 9.8 % Table: Non-projectivity class distribution in HyDT

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-37
SLIDE 37

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Outline

1 Introduction

HyDT

2 Non-projectivity

Non-projectivity in HyDT Non-projectivity Analysis

3 Graph properties

HyDT’s graph properties

4 Linguistic Analysis

Classes

5 Summary

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-38
SLIDE 38

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Summary

◮ Analysed non-projectivity in HyDT from two perspectives ◮ Gap degree and edge degree ≤ 1 ensures 99.99% coverage ◮ Non-projective structures classified into 8 categories ◮ Around 75% of the non-projective cases can be identified using strong

lexical cues

◮ Parsers can make use of this information and determine non-projective

arcs directly

◮ The rest are hard to recognize and need extra information (world

knowledge!) to identify non-projectivity in them

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-39
SLIDE 39

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Summary

◮ Analysed non-projectivity in HyDT from two perspectives ◮ Gap degree and edge degree ≤ 1 ensures 99.99% coverage ◮ Non-projective structures classified into 8 categories ◮ Around 75% of the non-projective cases can be identified using strong

lexical cues

◮ Parsers can make use of this information and determine non-projective

arcs directly

◮ The rest are hard to recognize and need extra information (world

knowledge!) to identify non-projectivity in them

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-40
SLIDE 40

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Summary

◮ Analysed non-projectivity in HyDT from two perspectives ◮ Gap degree and edge degree ≤ 1 ensures 99.99% coverage ◮ Non-projective structures classified into 8 categories ◮ Around 75% of the non-projective cases can be identified using strong

lexical cues

◮ Parsers can make use of this information and determine non-projective

arcs directly

◮ The rest are hard to recognize and need extra information (world

knowledge!) to identify non-projectivity in them

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-41
SLIDE 41

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Summary

◮ Analysed non-projectivity in HyDT from two perspectives ◮ Gap degree and edge degree ≤ 1 ensures 99.99% coverage ◮ Non-projective structures classified into 8 categories ◮ Around 75% of the non-projective cases can be identified using strong

lexical cues

◮ Parsers can make use of this information and determine non-projective

arcs directly

◮ The rest are hard to recognize and need extra information (world

knowledge!) to identify non-projectivity in them

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-42
SLIDE 42

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Summary

◮ Analysed non-projectivity in HyDT from two perspectives ◮ Gap degree and edge degree ≤ 1 ensures 99.99% coverage ◮ Non-projective structures classified into 8 categories ◮ Around 75% of the non-projective cases can be identified using strong

lexical cues

◮ Parsers can make use of this information and determine non-projective

arcs directly

◮ The rest are hard to recognize and need extra information (world

knowledge!) to identify non-projectivity in them

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-43
SLIDE 43

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Summary

◮ Analysed non-projectivity in HyDT from two perspectives ◮ Gap degree and edge degree ≤ 1 ensures 99.99% coverage ◮ Non-projective structures classified into 8 categories ◮ Around 75% of the non-projective cases can be identified using strong

lexical cues

◮ Parsers can make use of this information and determine non-projective

arcs directly

◮ The rest are hard to recognize and need extra information (world

knowledge!) to identify non-projectivity in them

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-44
SLIDE 44

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

Thank You

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-45
SLIDE 45

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

References

◮ R. Begum, S. Husain, A. Dhwaj, D. Sharma, L. Bai, and R. Sangal.

  • 2008. Dependency annotation scheme for Indian Languages. In

Proceedings of The Third International Joint Conference on Natural Language Processing (IJCNLP), Hyderabad, India.

◮ Akshar Bharati, Vineet Chaitanya, and Rajeev Sangal. 1995. Natural

Language Processing: A Paninian Perspective. Prentice-Hall of India.

◮ Akshar Bharati, Rajeev Sangal, and Dipti Sharma. 2005. Shakti

analyser: Ssf representation. Technical report, International Institute

  • f Information Technology, Hyderabad, India.

◮ Akshar Bharati, Samar Husain, Bharat Ambati, Sambhav Jain, Dipti

Sharma, and Rajeev Sangal. 2008a. Two semantic features make all the difference in parsing accuracy. In Proceedings of the 6th International Conference on Natural Language Processing (ICON-08), Pune, India.

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-46
SLIDE 46

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

References

◮ Marco Kuhlmann and Mathias Mohl. 2007. Mildly contextsensitive

dependency languages. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 160167, Prague, Czech Republic, June. Association for Computational Linguistics.

◮ Marco Kuhlmann and Joakim Nivre. 2006. Mildly nonprojective

dependency structures. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 507514, Sydney, Australia,

  • July. Association for Computational Linguistics.

◮ Ryan McDonald and Joakim Nivre. 2007. Characterizing the errors of

data-driven dependency parsing models. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 122131, Prague, Czech Republic, June. Association for Computational Linguistics.

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi

slide-47
SLIDE 47

Introduction Non-projectivity Graph properties Linguistic Analysis Summary References

References

◮ Joakim Nivre, Johan Hall, Sandra Kubler, Ryan McDonald, Jens

Nilsson, Sebastian Riedel, and Deniz Yuret. 2007. The CoNLL 2007 shared task on dependency parsing. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, pages 915932, Prague, Czech Republic, June. Association for Computational Linguistics.

◮ Joakim Nivre. 2006. Constraints on non-projective dependency

  • parsing. In In Proceedings of European Association of Computational

Linguistics (EACL), pages 7380.

  • P. Mannem, H. Chaudhry and A. Bharati

IIIT,Hyderabad Insights into non-projectivity in Hindi