Supervised Learning of Complete Morphological Paradigms Greg - - PowerPoint PPT Presentation
Supervised Learning of Complete Morphological Paradigms Greg - - PowerPoint PPT Presentation
Supervised Learning of Complete Morphological Paradigms Greg Durrett and John DeNero UC Berkeley / Google Morphological Inflection train (de) Zug Morphological Inflection N OM, S ING: Zug N OM, P LU: Zge train (de) Zug G EN, S ING:
Morphological Inflection
Zug
train (de)
Morphological Inflection
Zug
train (de)
(8 total) NOM, SING: Zug NOM, PLU: Züge GEN, SING: Zuges GEN, PLU: Züge
... ...
Morphological Inflection
Zug
train (de)
(8 total) NOM, SING: Zug NOM, PLU: Züge GEN, SING: Zuges GEN, PLU: Züge
... ...
u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü
... ...
Morphological Inflection
Zug
train (de)
(8 total) NOM, SING: Zug NOM, PLU: Züge GEN, SING: Zuges GEN, PLU: Züge
... ...
u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü
... ...
g NOM, SING: g NOM, PLU: ge GEN, SING: ges GEN, PLU: ge
... ...
express train (de)
u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü
... ...
g NOM, SING: g NOM, PLU: ge GEN, SING: ges GEN, PLU: ge
... ...
Schnellzug NOM, SING: Schnellzug NOM, PLU: Schnellzüge GEN, SING: Schnellzuges GEN, PLU: Schnellzüge
... ...
Morphological Inflection
valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista
... ... egg white (fi)
syyllinen
culprit (fi)
NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä
... ...
Morphological Inflection
valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista
... ... egg white (fi)
syyllinen
culprit (fi)
NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä
... ...
Morphological Inflection
valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista
... ... egg white (fi)
syyllinen
culprit (fi)
back vowels NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä
... ...
Morphological Inflection
valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista
... ... egg white (fi)
syyllinen
culprit (fi)
front vowels back vowels NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä
... ...
Morphological Inflection
valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista
... ... egg white (fi)
syyllinen
culprit (fi)
Suffix morphology can depend on context in a complex way front vowels back vowels NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä
... ...
Morphological Inflection
Base form (Word) Given
Morphological Inflection
Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)
... ...
Given Predict Inflection table (variants)
Morphological Inflection
Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)
... ...
Given Predict Inflection table (variants) Language/POS independence
Morphological Inflection
Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)
... ...
Given Predict Inflection table (variants) Handle multi-part paradigms Language/POS independence
Morphological Inflection
Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)
... ...
Given Predict Inflection table (variants) Handle multi-part paradigms Language/POS independence Rich way to predict inflections
Morphological Inflection
Outline
Outline
Rule extraction Kluft ... ... Zug ... ... (>1000)
Outline
Rule extraction g ... ... u ... ... Kluft ... ... Zug ... ... (>1000) (<50)
Outline
Rule extraction Paradigm prediction
,
) (
g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ... u ... ... (>1000) (<50)
Outline
Rule extraction Paradigm prediction
,
) (
g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ... u ... ... Base form Inflection table predictor (>1000) (<50)
Wiktionary Data
Thousands of full inflection tables for many languages and parts of speech
Wiktionary Data
Thousands of full inflection tables for many languages and parts of speech
http://en.wiktionary.org/wiki/Zug
Wiktionary Data
Thousands of full inflection tables for many languages and parts of speech 2763 more...
http://en.wiktionary.org/wiki/Zug
Previous Work
Dreyer and Eisner (2011) develops a semi- supervised Bayesian model
Previous Work
Dreyer and Eisner (2011) develops a semi- supervised Bayesian model Wicentowski (2004) predicts one variant at a time (the lemma)
Previous Work
Dreyer and Eisner (2011) develops a semi- supervised Bayesian model Wicentowski (2004) predicts one variant at a time (the lemma) u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü
... ...
Rule Extraction
Rule Extraction
dringen 1P
, PRES, SING: dringe
2P
, PRES, SING: dringst
3P
, PRES, SING: dringt
1P
, PAST, SING: drang
... ... to insist (de)
(27 total)
Rule Extraction
dr i ngen 1P
, PRES, SING: dringe
2P
, PRES, SING: dringst
3P
, PRES, SING: dringt
1P
, PAST, SING: drang
... ... to insist (de)
i (27 total) 1P
, PRES, SING: i
2P
, PRES, SING: i
3P
, PRES, SING: i
1P
, PAST, SING: a
... ...
Rule Extraction
dr i ng en 1P
, PRES, SING: dringe
2P
, PRES, SING: dringst
3P
, PRES, SING: dringt
1P
, PAST, SING: drang
... ... to insist (de)
i en (27 total) 1P
, PRES, SING: i
2P
, PRES, SING: i
3P
, PRES, SING: i
1P
, PAST, SING: a
... ...
1P
, PRES, SING: e
2P
, PRES, SING: st
3P
, PRES, SING: t
1P
, PAST, SING: -
... ...
INFINITIVE
d r i n g e n d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE
d r i n g e n d r i n g e
1P
, PRES, SING
Alignment to Base Form
D
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE
d r i n g e n d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE
d r i n g e n d r i n g s t
2P
, PRES, SING
S S
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE
d r i n g e n d r i n g t
3P
, PRES, SING
D S
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE
d r i n g e n d r a n g
1P
, PAST, SING
D D S
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Alignment to Base Form
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
Concatenation
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
d r i n g e d r i n g s t d r i n g t d r a n g
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
d r i n g e d r i n g s t d r i n g t d r a n g
INFINITIVE d r i n g e n
d r i n g e
1P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g s t
2P
, PRES, SING
INFINITIVE d r i n g e n
d r i n g t
3P
, PRES, SING
INFINITIVE d r i n g e n
d r a n g
1P
, PAST, SING
d r i n g e n
Concatenation
d r i n g e d r i n g s t d r i n g t d r a n g
(<50) (>1000)
Outline
Rule extraction Paradigm prediction
,
) (
g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ... u ... ... Base form Inflection table predictor
Paradigm Prediction
w i n d e n
to wind (de)
Paradigm Prediction
w i n d e n
i i i i a ... i1 to wind (de) en e st t
- ...
en1
Paradigm Prediction
w i n d e n
i i i i a ... i1 en e est et
- ...
en2 n
- st
t te ... n1 to wind (de) en e st t
- ...
en1
Paradigm Prediction
w i n d e n
i i i i a ... i1 en e est et
- ...
en2 n
- st
t te ... n1 i1 = = = en2 to wind (de) en e st t
- ...
en1
Paradigm Prediction
w i n d e n
i i i i a ... i1 en e est et
- ...
en2 n
- st
t te ... n1 i1 = = = = = = en2 to wind (de) en e st t
- ...
en1
Paradigm Prediction
w i n d e n
i i i i a ... i1 en e est et
- ...
en2 n
- st
t te ... n1 i1 n1 = = = = = = en2 to wind (de) en e st t
- ...
en1
Paradigm Prediction
w i n d e n
i i i i a ... i1 en e est et
- ...
en2 n
- st
t te ... n1 i1 n1 = = = = = = n1 en1 en2 to wind (de) en e st t
- ...
en1
Paradigm Prediction
w i n d e n
i1 n1 = = = = = = n1 en1 en2
Paradigm Prediction
w i n d e n
i1 n1 = = = = = = n1 en1 en2
Paths through this lattice are hypotheses
Paradigm Prediction
w i n d e n
i1 n1 = = = = = = n1 en1 en2
Model with semi-Markov CRF (Sarawagi and Cohen 2004) Paths through this lattice are hypotheses
w i n d e n
i1
Prediction Features
w i n d e n
i1
binden verbinden ...
Prediction Features
w i n d e n
i1
Prediction Features
binden verbinden ...
w i n d e n
i1
[i1] nd
Prediction Features
binden verbinden ...
w i n d e n
i1
Rule identity conjoined with 1- through 4-grams at offsets up to +/-5 [i1] nd
Prediction Features
binden verbinden ...
w i n d e n
i1
Rule identity conjoined with 1- through 4-grams at offsets up to +/-5 Coarse features shared between different rules [i1] nd
Prediction Features
binden verbinden ...
Learning
The gold inflection table of every training example can be produced using our rules
Learning
The gold inflection table of every training example can be produced using our rules Optimize conditional log-likelihood of the correct rule sequence
Base form Inflection table predictor u ... ... (<50) (>1000)
Outline
Rule extraction Paradigm prediction
,
) (
g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ...
Wiktionary Evaluation Setup
# lines/table Train size Test size German nouns 8 2364 200 German verbs 27 1627 200 Spanish verbs 57 3655 200 Finnish nouns/adjs 28 6000 200 Finnish verbs 53 6849 200
Wiktionary Evaluation Setup
# lines/table Train size Test size German nouns 8 2364 200 German verbs 27 1627 200 Spanish verbs 57 3655 200 Finnish nouns/adjs 28 6000 200 Finnish verbs 53 6849 200
Most common 200 inflection tables are not chosen for the test set; we can memorize these
25 50 75 100
74.39
Wiktionary Results
Suffixing baseline
Average inflected form accuracy
NAÏVE THIS WORK ORACLE FACTORED
25 50 75 100
94.93 74.39
Average inflected form accuracy
Wiktionary Results
NAÏVE THIS WORK ORACLE FACTORED
25 50 75 100
94.93 74.39 99.81 The best we can do with
- ur rules
Average inflected form accuracy
Wiktionary Results
NAÏVE THIS WORK ORACLE FACTORED
25 50 75 100
94.93 74.39
NAÏVE
99.81 94.41 Separate predictor for each inflected form
Average inflected form accuracy
Wiktionary Results
THIS WORK ORACLE FACTORED
25 50 75 100
78.6 86.1 99.4 40.0
Average whole table accuracy
Wiktionary Results
NAÏVE THIS WORK ORACLE FACTORED
Dreyer and Eisner (2011)
Dreyer and Eisner (2011)
Evaluate on German verbs in CELEX Results averaged over 10 random train/test splits Small train sets (50 or 100 observed tables) Large test sets (5415 verbs)
Dreyer and Eisner (2011)
Evaluate on German verbs in CELEX Results averaged over 10 random train/test splits Small train sets (50 or 100 observed tables) Large test sets (5415 verbs) Hierarchical Bayesian model of inflection Type-level transducers of variants can be trained in a supervised fashion Additionally incorporate unlabeled text with a token-level model
CELEX Results
85 90 95 100 50 100 200
Training set size (example tables) I n fl e c t e d f
- r
m a c c u r a c y
90.5 89.9 90.9 92.2 91.5 92.2
THIS WORK DE11+CORPUS DE11
CELEX Results
85 90 95 100 50 100 200
I n fl e c t e d f
- r
m a c c u r a c y
90.5 89.9 90.9 92.2 91.5 92.2
THIS WORK
93.2
Training set size (example tables) DE11+CORPUS DE11
Conclusion
Morphological inflection rules can be learned from supervised data, which is widely available
Conclusion
Morphological inflection rules can be learned from supervised data, which is widely available Structured prediction of entire tables at once is effective for inflecting unseen base forms
Conclusion
Morphological inflection rules can be learned from supervised data, which is widely available Structured prediction of entire tables at once is effective for inflecting unseen base forms Code and Wiktionary data is available at http://eecs.berkeley.edu/~gdurrett
Conclusion
Morphological inflection rules can be learned from supervised data, which is widely available
Thank you!
Structured prediction of entire tables at once is effective for inflecting unseen base forms Code and Wiktionary data is available at http://eecs.berkeley.edu/~gdurrett
Accuracy Breakdown
80 85 90 95 100 DE verbs DE nouns ES verbs FI verbs FI nouns/adjs
Inflected form accuracy
96.19 88.94 93.41 96.43 99.67