Extending the DCU-250 Gold Standard f-structure Bank H. B echara - - PowerPoint PPT Presentation

extending the dcu 250 gold standard f structure bank
SMART_READER_LITE
LIVE PREVIEW

Extending the DCU-250 Gold Standard f-structure Bank H. B echara - - PowerPoint PPT Presentation

Outline Motivation Background Methodology Evaluation Conclusion and Future Work Extending the DCU-250 Gold Standard f-structure Bank H. B echara hbechara@computing.dcu.ie 1/29 Hanna B echara Internship Report Outline Motivation


slide-1
SLIDE 1

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Extending the DCU-250 Gold Standard f-structure Bank

  • H. B´

echara hbechara@computing.dcu.ie

1/29 Hanna B´ echara Internship Report

slide-2
SLIDE 2

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Outline

1

Motivation

2

Background

3

Methodology

4

Evaluation

5

Conclusion and Future Work

2/29 Hanna B´ echara Internship Report

slide-3
SLIDE 3

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Outline

1

Motivation

2

Background

3

Methodology

4

Evaluation

5

Conclusion and Future Work

3/29 Hanna B´ echara Internship Report

slide-4
SLIDE 4

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Motivation

  • Produce an ATB-based LFG gold resource for parsing

evaluation similar to DCU’s previous work on English, German, Chinese, etc.

  • Extend the existing Arabic LFG Gold Standard, from 250

annotated sentences to 500. A larger variety of grammatical phenomena A more comprehensive reference A more general sample for evaluation

4/29 Hanna B´ echara Internship Report

slide-5
SLIDE 5

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Outline

1

Motivation

2

Background

3

Methodology

4

Evaluation

5

Conclusion and Future Work

5/29 Hanna B´ echara Internship Report

slide-6
SLIDE 6

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Arabic Grammar

Some Particularities of Arabic Grammar Sentences can be very long (longest sentence is 384, average sentence 30) Word Order is quite flexible Dropping subjects, objects, relative pronouns (pro-drop) Word endings can overlap for noun cases

6/29 Hanna B´ echara Internship Report

slide-7
SLIDE 7

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Penn Arabic Treebank (ATB)

23,611 parse-annotated sentences in Modern Standard Arabic (Maamouri and Bies 2004) Buckwalter Transliteration: Strictly one-to-one transliteration from Arabic to Latin characters (ASCII) Part of Speech Tags (Noun, Verb, Prep) Phrasal Tags (NP, VP, PP) Functional Tags (OBJ, SUBJ, ADJ)

7/29 Hanna B´ echara Internship Report

slide-8
SLIDE 8

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Penn Arabic Treebank (ATB)

8/29 Hanna B´ echara Internship Report

slide-9
SLIDE 9

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Arabic Annotation Algorithm

The Arabic Annotation Algorithm aims to convert the c-structure provided by the Penn Arabic Treebank into an f-structure. It is a recursive process which annotates eah node of a tree with f-structure information used to generate proper f-structures

9/29 Hanna B´ echara Internship Report

slide-10
SLIDE 10

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Arabic Annotation Algorithm

10/29 Hanna B´ echara Internship Report

slide-11
SLIDE 11

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Outline

1

Motivation

2

Background

3

Methodology

4

Evaluation

5

Conclusion and Future Work

11/29 Hanna B´ echara Internship Report

slide-12
SLIDE 12

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Methodology

Random Selection of 250 new sentences from the Penn Arabic Treebank Application of the Arabic Annotation Algorithm Combination of old and new Sets for Full Evaluation.

12/29 Hanna B´ echara Internship Report

slide-13
SLIDE 13

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Methodology

Correction Method Surface Improvements (manual, semi-automatic and automatic)

Noun Cases Functional Tags Improper Constructions

Annotation Improvements (manual, semi-automatic and automatic)

Adjunct Tags Pro-Drop Resolving Clashes

13/29 Hanna B´ echara Internship Report

slide-14
SLIDE 14

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Surface Changes

Noun Case Ambiguity Arabic has three noun cases which are generally differentiated morphologically based on word endings. Generally: Nominative (NOM): -u Accusative (ACC): -a Genitive (GEN): - i However, there are particular instances where both the genitive and accusative endings are the same. Case Female Plurals Male Plurals Duals Nominative

  • AtN
  • uwon
  • An

Genitive

  • AtK
  • iyon
  • ayon

Accusative

  • AtK
  • iyon
  • ayon

The morphological analyser assigns these words the tag: ACCGEN This Tag occurs 162 times in the 500 sentences.

14/29 Hanna B´ echara Internship Report

slide-15
SLIDE 15

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Surface Changes

Noun Case Ambiguity (Automatic) Habash and Rambow, 2007: Determining Case in Arabic: Learning Complex Linguistic Behaviour Requires Complex Linguistic Features. We explore the local subtree’s current node, mother node, and sister nodes. ACC: ADJ, CONJ, OBJ, TPC, PRD of subordinating conjunction GEN: ADJ, CONJ, PP, NP-adjuncts (Idafa construction)

15/29 Hanna B´ echara Internship Report

slide-16
SLIDE 16

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Surface Changes

Missing Functional Tags (Semi-automatic) When the word is unreadable, the analyser fails to assign a part of speech tag. A word becomes unreadable when it is improperly alliterated, usually due to missing vowels. Examples: fsTynyA xTAb AstrAtyjyA The morphological analyser assigns these words the tag: NO FUNC This Tag occurs 82 times in the 500 sentences.

16/29 Hanna B´ echara Internship Report

slide-17
SLIDE 17

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Surface Improvements

Improper Sentence Construction (Manual) Problems that arise from the Parser’s confusion and/or tokenisation. Example: fa+sa+nalEab+u (then+will+we+play) Example: Helping the elderly and the poor and the handicapped and feeding the hungry.

17/29 Hanna B´ echara Internship Report

slide-18
SLIDE 18

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Annotation Improvements

Specifying Adjuncts

Appositions Adjective Types: attributive, predicative. Adverbs Prepositional Phrases: temporal, directional, locative, etc. Titles: Lexicalising 52 Titles (Mr, Miss, Dr, Sir, Prince, Queen, President, etc)

18/29 Hanna B´ echara Internship Report

slide-19
SLIDE 19

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Annotation Improvements

Appositions (ATB Guidelines) Names in apposition are an exception to the ’all adjuncts on same level’ rule: an extra NP level is added in the tree (NP (NP (NP head noun) (XP any adjunct)) (NP appositive name)

19/29 Hanna B´ echara Internship Report

slide-20
SLIDE 20

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Annotation Improvements

Demonstrative Pronouns h‘*ihi + Al+tagoyiyrAt+i + tata$Abak+u + *Akirat+u+hA + fiy + h‘*A + Al+faDA’+i + Al+HaDAriy +i these + the+changes + be interwoven + remembering+its + in + this + the+space + the+cultural Remembering these changes is interwoven with this cultural space NP modified by quantificational NP akalot+u + Al+dajAjap+a + niSofa+hA ate + the+chicken + half +its I ate half of the chicken NP modified by numerical NP qaraot+u + Al+kitAb+a + Ei$rina + SafoHap+F + min+hu read + the+book + twenty + page from+it I read twenty pages of the book

20/29 Hanna B´ echara Internship Report

slide-21
SLIDE 21

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Annotation Improvements

Specifying Adjuncts

Appositions Adjective Types: attributive, predicative. Adverbs Prepositional Phrases: temporal, directional, locative, etc. Titles: Lexicalising 52 Titles (Mr, Miss, Dr, Sir, Prince, Queen, President, etc)

21/29 Hanna B´ echara Internship Report

slide-22
SLIDE 22

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Annotation Improvements

Resolving Clashes

A problem of Heads: Predicates preceding subjects in nominal sentences. A problem of Traces: Phonetically Empty WHNP A problem of Subjects: Every Sentence needs a subject.

Resolving Traces: Passive constructions (S (VP *uhila (NP-SBJ-1 Aljumhuwru) (NP-OBJ-1 *))) *uhil+a + Al+jumohuwr+u shocked + the+audience The audience was shocked Pro-drop

22/29 Hanna B´ echara Internship Report

slide-23
SLIDE 23

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Outline

1

Motivation

2

Background

3

Methodology

4

Evaluation

5

Conclusion and Future Work

23/29 Hanna B´ echara Internship Report

slide-24
SLIDE 24

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Interannotator Agreement

Calculating Agreement An evaluation set of 50 sentences including all the problems

  • utlined earlier and annotated using the Arabic Annotation

Algorithm has been selected. The automatic annotations were corrected by two separate annotators and agreement was calculated based on Artstein and Poesio’s coefficients for Pi, S, and Kappa.

24/29 Hanna B´ echara Internship Report

slide-25
SLIDE 25

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Calculating Agreement

S: All Categories are equally likely (Bennett, Alpert, and Goldstein 1954) π: Random assignment of categories to items is governed by the distribution of items among categories in the actual world. (Scott 1955) κ: If coders were operating by chance alone, we would get a separate distribution for each coder. (Cohen 1950)

25/29 Hanna B´ echara Internship Report

slide-26
SLIDE 26

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Interannotator Agreement

Results Agreement for the Evaluation Set S 0.98608303 Pi 0.9843478 Kappa 0.98124266 Agreement for Specific Cases Traces 0.866666 Noun Case 1.0

26/29 Hanna B´ echara Internship Report

slide-27
SLIDE 27

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Outline

1

Motivation

2

Background

3

Methodology

4

Evaluation

5

Conclusion and Future Work

27/29 Hanna B´ echara Internship Report

slide-28
SLIDE 28

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Conclusion

Summing Up Selected 250 new parsed sentences from the Penn Arabic Treebank Applied the Arabic Annotation Algorithm to the 250 new sentences Merged the 250 with the existing gold standard Isolated clashes and particularities that the Annotation Algorithm missed Improved the Gold Standard and the Annotation Algorithm

28/29 Hanna B´ echara Internship Report

slide-29
SLIDE 29

Outline Motivation Background Methodology Evaluation Conclusion and Future Work

Future Work

Solve coordination instances with no apparent conjunction or punctuation. Standardising how to deal with numbers. Extending the Gold Standard even further

29/29 Hanna B´ echara Internship Report