Supersense Tagging for Arabic: The MT-in-the-Middle Attack
Nathan Schneider Behrang Mohit Chris Dyer Kemal Oflazer Noah A. Smith
1
Supersense Tagging for Arabic: The MT-in-the-Middle Attack Nathan - - PowerPoint PPT Presentation
Supersense Tagging for Arabic: The MT-in-the-Middle Attack Nathan Schneider Behrang Mohit Chris Dyer Kemal Oflazer Noah A. Smith 1 Gameplan Supersense(Tagging Baselines MT0in0the0Middle Analysis Outlook 3 Supersense(Tagging A
Nathan Schneider Behrang Mohit Chris Dyer Kemal Oflazer Noah A. Smith
1
Supersense(Tagging Baselines MT0in0the0Middle Analysis Outlook
3
(partitioning of WordNet synsets)
26 noun categories (Ciaramita & Johnson 2003)
4
Pierre Vinken , 61 years old , will join the board as a nonexecutive director N
PERSON SOCIAL GROUP PERSON TIME
(Schneider et al. 2012)
(Schneider et al. 2012)
(Ciaramita & Altun 2006)
5
77% F1 in-domain 65k words—1/6 the size of SemCor
Arabic WordNet entries + OntoNotes NEs
nouns in our corpus
6
P R F1 Ann-A 32 16 21.6 Ann-B 29 15 19.4
model
Kirkpatrick et al. 2010)
P R F1 Ann-A 20 16 17.5 Ann-B 14 10 11.6 [evaluating on Arabic Wikipedia test set— 18 articles, 40k words]
7
c d e c
( تﺎﻧوﺮﺘﻜﻟﻹا ) ﺔﺒﻟﺎﺴﻟا تﺎﻨﺤﺸﻟا ﻦﻣ ﺔﺑﺎﺤﺳ ﻦﻣ ةرﺬﻟا نﻮﻜﺘﺗ . ﻂﺳﻮﻟا ﻲﻓ اﺪﺟ ةﺮﻴﻐﺻ ﺔﻨﺤﺸﻟا ﺔﺒﺟﻮﻣ ةاﻮﻧ لﻮﺣ مﻮﲢ
(cf. Zitouni & Florian 2008; Rahman & Ng 2012)
NIST 2012 GWord
8
The(corn(is(composed(of(negative(shipments(((electronics()( cloud(hovering(over(the(nucleus(of(a(very(small(positive( shipment(in(the(center(.
PLANT COGNITION BODY ARTIFACT LOCATION ARTIFACT
8
The(corn(is(composed(of(negative(shipments(((electronics()( cloud(hovering(over(the(nucleus(of(a(very(small(positive( shipment(in(the(center(.
PLANT COGNITION BODY ARTIFACT LOCATION ARTIFACT
8
The(corn(is(composed(of(negative(shipments(((electronics()( cloud(hovering(over(the(nucleus(of(a(very(small(positive( shipment(in(the(center(.
PLANT COGNITION BODY ARTIFACT LOCATION ARTIFACT
9
matching:
P R F1 Ann-A 32 16 21.6 Ann-B 29 15 19.4
P R F1 Ann-A 37 31 33.8 Ann-B 38 32 34.6
9
P R F1 Ann-A 37 31 33.8 Ann-B 38 32 34.6
P R F1 Ann-A 35 36 35.5 Ann-B 36 36 36.0
English supersense tagging, and projection
10
English system from QCRI
consistently better (by 2–4 points). Why?
11
BLEU METEOR TER QCRI 32.86 32.10 0.46 cdec 28.84 31.38 0.49
necessarily measure preservation of coarse lexical semantics
adequacy for noun phrases
acceptability judgments for a sample of sentences
12
(cf. Carpuat 2013: SSSST)
90.0% for cdec
genoa lynx for GNU Linux
13
does slightly better at it
projection
QCRI gives phrase alignments
14
(noisily) for a language so long as it can be automatically translated to English
alignments
15
16