Supervised Learning of Complete Morphological Paradigms Greg - - PowerPoint PPT Presentation

supervised learning of complete morphological paradigms
SMART_READER_LITE
LIVE PREVIEW

Supervised Learning of Complete Morphological Paradigms Greg - - PowerPoint PPT Presentation

Supervised Learning of Complete Morphological Paradigms Greg Durrett and John DeNero UC Berkeley / Google Morphological Inflection train (de) Zug Morphological Inflection N OM, S ING: Zug N OM, P LU: Zge train (de) Zug G EN, S ING:


slide-1
SLIDE 1

Supervised Learning of Complete Morphological Paradigms

Greg Durrett and John DeNero

UC Berkeley / Google

slide-2
SLIDE 2

Morphological Inflection

Zug

train (de)

slide-3
SLIDE 3

Morphological Inflection

Zug

train (de)

(8 total) NOM, SING: Zug NOM, PLU: Züge GEN, SING: Zuges GEN, PLU: Züge

... ...

slide-4
SLIDE 4

Morphological Inflection

Zug

train (de)

(8 total) NOM, SING: Zug NOM, PLU: Züge GEN, SING: Zuges GEN, PLU: Züge

... ...

u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü

... ...

slide-5
SLIDE 5

Morphological Inflection

Zug

train (de)

(8 total) NOM, SING: Zug NOM, PLU: Züge GEN, SING: Zuges GEN, PLU: Züge

... ...

u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü

... ...

g NOM, SING: g NOM, PLU: ge GEN, SING: ges GEN, PLU: ge

... ...

slide-6
SLIDE 6

express train (de)

u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü

... ...

g NOM, SING: g NOM, PLU: ge GEN, SING: ges GEN, PLU: ge

... ...

Schnellzug NOM, SING: Schnellzug NOM, PLU: Schnellzüge GEN, SING: Schnellzuges GEN, PLU: Schnellzüge

... ...

Morphological Inflection

slide-7
SLIDE 7

valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista

... ... egg white (fi)

syyllinen

culprit (fi)

NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä

... ...

Morphological Inflection

slide-8
SLIDE 8

valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista

... ... egg white (fi)

syyllinen

culprit (fi)

NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä

... ...

Morphological Inflection

slide-9
SLIDE 9

valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista

... ... egg white (fi)

syyllinen

culprit (fi)

back vowels NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä

... ...

Morphological Inflection

slide-10
SLIDE 10

valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista

... ... egg white (fi)

syyllinen

culprit (fi)

front vowels back vowels NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä

... ...

Morphological Inflection

slide-11
SLIDE 11

valkuainen NOM, SING: valkuainen GEN, SING: valkuaisen PART, SING: valkuaista

... ... egg white (fi)

syyllinen

culprit (fi)

Suffix morphology can depend on context in a complex way front vowels back vowels NOM, SING: syyllinen GEN, SING: syyllisen PART, SING: syyllistä

... ...

Morphological Inflection

slide-12
SLIDE 12

Base form (Word) Given

Morphological Inflection

slide-13
SLIDE 13

Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)

... ...

Given Predict Inflection table (variants)

Morphological Inflection

slide-14
SLIDE 14

Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)

... ...

Given Predict Inflection table (variants) Language/POS independence

Morphological Inflection

slide-15
SLIDE 15

Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)

... ...

Given Predict Inflection table (variants) Handle multi-part paradigms Language/POS independence

Morphological Inflection

slide-16
SLIDE 16

Base form (Word) Attributes(1): Inflected form(1) Attributes(2): Inflected form(2)

... ...

Given Predict Inflection table (variants) Handle multi-part paradigms Language/POS independence Rich way to predict inflections

Morphological Inflection

slide-17
SLIDE 17

Outline

slide-18
SLIDE 18

Outline

Rule extraction Kluft ... ... Zug ... ... (>1000)

slide-19
SLIDE 19

Outline

Rule extraction g ... ... u ... ... Kluft ... ... Zug ... ... (>1000) (<50)

slide-20
SLIDE 20

Outline

Rule extraction Paradigm prediction

,

) (

g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ... u ... ... (>1000) (<50)

slide-21
SLIDE 21

Outline

Rule extraction Paradigm prediction

,

) (

g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ... u ... ... Base form Inflection table predictor (>1000) (<50)

slide-22
SLIDE 22

Wiktionary Data

Thousands of full inflection tables for many languages and parts of speech

slide-23
SLIDE 23

Wiktionary Data

Thousands of full inflection tables for many languages and parts of speech

http://en.wiktionary.org/wiki/Zug

slide-24
SLIDE 24

Wiktionary Data

Thousands of full inflection tables for many languages and parts of speech 2763 more...

http://en.wiktionary.org/wiki/Zug

slide-25
SLIDE 25

Previous Work

Dreyer and Eisner (2011) develops a semi- supervised Bayesian model

slide-26
SLIDE 26

Previous Work

Dreyer and Eisner (2011) develops a semi- supervised Bayesian model Wicentowski (2004) predicts one variant at a time (the lemma)

slide-27
SLIDE 27

Previous Work

Dreyer and Eisner (2011) develops a semi- supervised Bayesian model Wicentowski (2004) predicts one variant at a time (the lemma) u NOM, SING: u NOM, PLU: ü GEN, SING: u GEN, PLU: ü

... ...

slide-28
SLIDE 28

Rule Extraction

slide-29
SLIDE 29

Rule Extraction

dringen 1P

, PRES, SING: dringe

2P

, PRES, SING: dringst

3P

, PRES, SING: dringt

1P

, PAST, SING: drang

... ... to insist (de)

(27 total)

slide-30
SLIDE 30

Rule Extraction

dr i ngen 1P

, PRES, SING: dringe

2P

, PRES, SING: dringst

3P

, PRES, SING: dringt

1P

, PAST, SING: drang

... ... to insist (de)

i (27 total) 1P

, PRES, SING: i

2P

, PRES, SING: i

3P

, PRES, SING: i

1P

, PAST, SING: a

... ...

slide-31
SLIDE 31

Rule Extraction

dr i ng en 1P

, PRES, SING: dringe

2P

, PRES, SING: dringst

3P

, PRES, SING: dringt

1P

, PAST, SING: drang

... ... to insist (de)

i en (27 total) 1P

, PRES, SING: i

2P

, PRES, SING: i

3P

, PRES, SING: i

1P

, PAST, SING: a

... ...

1P

, PRES, SING: e

2P

, PRES, SING: st

3P

, PRES, SING: t

1P

, PAST, SING: -

... ...

slide-32
SLIDE 32

INFINITIVE

d r i n g e n d r i n g e

1P

, PRES, SING

Alignment to Base Form

slide-33
SLIDE 33

INFINITIVE

d r i n g e n d r i n g e

1P

, PRES, SING

Alignment to Base Form

D

slide-34
SLIDE 34

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

slide-35
SLIDE 35

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE

d r i n g e n d r i n g s t

2P

, PRES, SING

slide-36
SLIDE 36

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE

d r i n g e n d r i n g s t

2P

, PRES, SING

S S

slide-37
SLIDE 37

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

slide-38
SLIDE 38

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE

d r i n g e n d r i n g t

3P

, PRES, SING

D S

slide-39
SLIDE 39

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

slide-40
SLIDE 40

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE

d r i n g e n d r a n g

1P

, PAST, SING

D D S

slide-41
SLIDE 41

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Alignment to Base Form

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

slide-42
SLIDE 42

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

Concatenation

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

slide-43
SLIDE 43

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-44
SLIDE 44

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-45
SLIDE 45

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-46
SLIDE 46

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-47
SLIDE 47

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-48
SLIDE 48

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-49
SLIDE 49

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

slide-50
SLIDE 50

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

d r i n g e d r i n g s t d r i n g t d r a n g

slide-51
SLIDE 51

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

d r i n g e d r i n g s t d r i n g t d r a n g

slide-52
SLIDE 52

INFINITIVE d r i n g e n

d r i n g e

1P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g s t

2P

, PRES, SING

INFINITIVE d r i n g e n

d r i n g t

3P

, PRES, SING

INFINITIVE d r i n g e n

d r a n g

1P

, PAST, SING

d r i n g e n

Concatenation

d r i n g e d r i n g s t d r i n g t d r a n g

slide-53
SLIDE 53

(<50) (>1000)

Outline

Rule extraction Paradigm prediction

,

) (

g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ... u ... ... Base form Inflection table predictor

slide-54
SLIDE 54

Paradigm Prediction

w i n d e n

to wind (de)

slide-55
SLIDE 55

Paradigm Prediction

w i n d e n

i i i i a ... i1 to wind (de) en e st t

  • ...

en1

slide-56
SLIDE 56

Paradigm Prediction

w i n d e n

i i i i a ... i1 en e est et

  • ...

en2 n

  • st

t te ... n1 to wind (de) en e st t

  • ...

en1

slide-57
SLIDE 57

Paradigm Prediction

w i n d e n

i i i i a ... i1 en e est et

  • ...

en2 n

  • st

t te ... n1 i1 = = = en2 to wind (de) en e st t

  • ...

en1

slide-58
SLIDE 58

Paradigm Prediction

w i n d e n

i i i i a ... i1 en e est et

  • ...

en2 n

  • st

t te ... n1 i1 = = = = = = en2 to wind (de) en e st t

  • ...

en1

slide-59
SLIDE 59

Paradigm Prediction

w i n d e n

i i i i a ... i1 en e est et

  • ...

en2 n

  • st

t te ... n1 i1 n1 = = = = = = en2 to wind (de) en e st t

  • ...

en1

slide-60
SLIDE 60

Paradigm Prediction

w i n d e n

i i i i a ... i1 en e est et

  • ...

en2 n

  • st

t te ... n1 i1 n1 = = = = = = n1 en1 en2 to wind (de) en e st t

  • ...

en1

slide-61
SLIDE 61

Paradigm Prediction

w i n d e n

i1 n1 = = = = = = n1 en1 en2

slide-62
SLIDE 62

Paradigm Prediction

w i n d e n

i1 n1 = = = = = = n1 en1 en2

Paths through this lattice are hypotheses

slide-63
SLIDE 63

Paradigm Prediction

w i n d e n

i1 n1 = = = = = = n1 en1 en2

Model with semi-Markov CRF (Sarawagi and Cohen 2004) Paths through this lattice are hypotheses

slide-64
SLIDE 64

w i n d e n

i1

Prediction Features

slide-65
SLIDE 65

w i n d e n

i1

binden verbinden ...

Prediction Features

slide-66
SLIDE 66

w i n d e n

i1

Prediction Features

binden verbinden ...

slide-67
SLIDE 67

w i n d e n

i1

[i1] nd

Prediction Features

binden verbinden ...

slide-68
SLIDE 68

w i n d e n

i1

Rule identity conjoined with 1- through 4-grams at offsets up to +/-5 [i1] nd

Prediction Features

binden verbinden ...

slide-69
SLIDE 69

w i n d e n

i1

Rule identity conjoined with 1- through 4-grams at offsets up to +/-5 Coarse features shared between different rules [i1] nd

Prediction Features

binden verbinden ...

slide-70
SLIDE 70

Learning

The gold inflection table of every training example can be produced using our rules

slide-71
SLIDE 71

Learning

The gold inflection table of every training example can be produced using our rules Optimize conditional log-likelihood of the correct rule sequence

slide-72
SLIDE 72

Base form Inflection table predictor u ... ... (<50) (>1000)

Outline

Rule extraction Paradigm prediction

,

) (

g ... ... u ... ... Kluft ... ... Zug ... ... Zug ... ...

slide-73
SLIDE 73

Wiktionary Evaluation Setup

# lines/table Train size Test size German nouns 8 2364 200 German verbs 27 1627 200 Spanish verbs 57 3655 200 Finnish nouns/adjs 28 6000 200 Finnish verbs 53 6849 200

slide-74
SLIDE 74

Wiktionary Evaluation Setup

# lines/table Train size Test size German nouns 8 2364 200 German verbs 27 1627 200 Spanish verbs 57 3655 200 Finnish nouns/adjs 28 6000 200 Finnish verbs 53 6849 200

Most common 200 inflection tables are not chosen for the test set; we can memorize these

slide-75
SLIDE 75

25 50 75 100

74.39

Wiktionary Results

Suffixing baseline

Average inflected form accuracy

NAÏVE THIS WORK ORACLE FACTORED

slide-76
SLIDE 76

25 50 75 100

94.93 74.39

Average inflected form accuracy

Wiktionary Results

NAÏVE THIS WORK ORACLE FACTORED

slide-77
SLIDE 77

25 50 75 100

94.93 74.39 99.81 The best we can do with

  • ur rules

Average inflected form accuracy

Wiktionary Results

NAÏVE THIS WORK ORACLE FACTORED

slide-78
SLIDE 78

25 50 75 100

94.93 74.39

NAÏVE

99.81 94.41 Separate predictor for each inflected form

Average inflected form accuracy

Wiktionary Results

THIS WORK ORACLE FACTORED

slide-79
SLIDE 79

25 50 75 100

78.6 86.1 99.4 40.0

Average whole table accuracy

Wiktionary Results

NAÏVE THIS WORK ORACLE FACTORED

slide-80
SLIDE 80

Dreyer and Eisner (2011)

slide-81
SLIDE 81

Dreyer and Eisner (2011)

Evaluate on German verbs in CELEX Results averaged over 10 random train/test splits Small train sets (50 or 100 observed tables) Large test sets (5415 verbs)

slide-82
SLIDE 82

Dreyer and Eisner (2011)

Evaluate on German verbs in CELEX Results averaged over 10 random train/test splits Small train sets (50 or 100 observed tables) Large test sets (5415 verbs) Hierarchical Bayesian model of inflection Type-level transducers of variants can be trained in a supervised fashion Additionally incorporate unlabeled text with a token-level model

slide-83
SLIDE 83

CELEX Results

85 90 95 100 50 100 200

Training set size (example tables) I n fl e c t e d f

  • r

m a c c u r a c y

90.5 89.9 90.9 92.2 91.5 92.2

THIS WORK DE11+CORPUS DE11

slide-84
SLIDE 84

CELEX Results

85 90 95 100 50 100 200

I n fl e c t e d f

  • r

m a c c u r a c y

90.5 89.9 90.9 92.2 91.5 92.2

THIS WORK

93.2

Training set size (example tables) DE11+CORPUS DE11

slide-85
SLIDE 85

Conclusion

Morphological inflection rules can be learned from supervised data, which is widely available

slide-86
SLIDE 86

Conclusion

Morphological inflection rules can be learned from supervised data, which is widely available Structured prediction of entire tables at once is effective for inflecting unseen base forms

slide-87
SLIDE 87

Conclusion

Morphological inflection rules can be learned from supervised data, which is widely available Structured prediction of entire tables at once is effective for inflecting unseen base forms Code and Wiktionary data is available at http://eecs.berkeley.edu/~gdurrett

slide-88
SLIDE 88

Conclusion

Morphological inflection rules can be learned from supervised data, which is widely available

Thank you!

Structured prediction of entire tables at once is effective for inflecting unseen base forms Code and Wiktionary data is available at http://eecs.berkeley.edu/~gdurrett

slide-89
SLIDE 89

Accuracy Breakdown

80 85 90 95 100 DE verbs DE nouns ES verbs FI verbs FI nouns/adjs

Inflected form accuracy

96.19 88.94 93.41 96.43 99.67

THIS WORK