A Multi-purpose Bayesian Model for Word-Based Morphology Maciej - PowerPoint PPT Presentation

Introduction The Model Experiments Conclusion A Multi-purpose Bayesian Model for Word-Based Morphology Maciej Janicki University of Leipzig September 17, 2015 Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Morphology in NLP wahrscheinlichster wahr-schein-lich-st-er wahr ❁ ADJ ❃ -schein ❁ NN ❃ -lich ❁ SUFF ADJ ❃ -st ❁ SUP ❃ -er ❁ M.SG.NOM ❃ Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Morphology in NLP wahrscheinlichster wahr-schein-lich-st-er wahr ❁ ADJ ❃ -schein ❁ NN ❃ -lich ❁ SUFF ADJ ❃ -st ❁ SUP ❃ -er ❁ M.SG.NOM ❃ provided: morpheme segmentation (with or without tags) needed: is a valid word? lemma, possible tags (PoS, inflectional) other word features Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Whole Word Morphology   phon: /kæt/       synt: N, sg               sem:       Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Whole Word Morphology     phon: phon: /kæt/ /kæts/             synt: synt: N, sg N, pl                             sem: sem:             Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Whole Word Morphology     phon: /X/ phon: /Xs/  ← → synt: N, sg synt: N, pl    sem: ♠ sem: many ♠ concentrates on relations between words no “absolute structure/analysis” not decomposable allows for non-concatenative operations Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Unigram distribution Let u ( w ) be the unigram-based probability of the word w . � Pr ( L ) = Pr ( | L | ) · | L | ! · u ( w ) w ∈ L . . . . . . 2 . 17 · 10 − 11 sprache 1 . 88 · 10 − 12 sprachen . . . . . . (each word drawn independently from the unigram distribution) Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Introducing rules Let a morphological rule r : / Xe / → / Xen / be known. r applies from left to right with probability π r = 0 . 53 ( productivity ). . . . . . . 2 . 17 · 10 − 11 sprache ( sprachen derived by r ) → sprachen 0 . 53 . . . . . . Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Introducing rules Let a morphological rule r : / Xe / → / Xen / be known. r applies from left to right with probability π r = 0 . 53 ( productivity ). . . . . . . 2 . 17 · 10 − 11 sprache ( sprachen derived by r ) → sprachen 0 . 53 . . . . . . . . . . . . 2 . 17 · 10 − 11 · 0 . 47 sprache ( sprachen not derived by r ) 1 . 88 · 10 − 12 sprachen . . . . . . Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Lexicon as directed graph machst machen macht machte machtest machbar machbaren machbare Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Learning Model components: L – lexicon (graph) R – set of rules with their productivities defined: P ( L | R ), P ( R ) find: ˆ = arg max P ( R | L ) R R P ( L | R ) P ( R ) = arg max P ( L ) R = arg max P ( L | R ) P ( R ) R Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Learning (cont.) Supervised learning: given L , find R extract rules from pairs of related words ML estimation for rule productivities Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Learning (cont.) Unsupervised learning: given V ( L ), find E ( L ) and R Find all reasonable edges. find pairs of string-similar words extract rules choose 10k most frequent rules create a “full” graph of all possible edges Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Learning (cont.) Unsupervised learning: given V ( L ), find E ( L ) and R Find all reasonable edges. find pairs of string-similar words extract rules choose 10k most frequent rules create a “full” graph of all possible edges Alternating ML estimation of E ( L ) and R (“hard EM”). “guess” an initial R repeat until convergence: find best E ( L ) given V ( L ) and R (optimal branching) find best R given V ( L ) and E ( L ) (ML estimation) Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Lexicon expansion: task definition unsupervised training on 50k-wordlists (German, Polish) generate new words in the order of increasing cost Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Lexicon expansion: results Polish Precision (%) German 80 60 40 0 0 . 2 0 . 4 0 . 6 0 . 8 1 · 10 5 Words generated Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Lemmatization and Tagging: task definition given a word, determine its lemma and PoS/inflectional tag training data: supervised: word-lemma pairs unsupervised: a set of words and a set of lemmas (without alignment) variants: +/- Lem : lemmas of all unknown words included in the training data? +/- Tags : tag of the target word given? baselines: unsupervised: alignment based on least edit distance supervised: Maximum Entropy classifier based on letter N-grams Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Lemmatization and Tagging: results unsupervised: Data Results Baseline Language Lem Tags Lem Tags Lem+Tags Lem Tags Lem+Tags + + 93% 100% 93% 84% – – + – 80% 46% 45% 76% – – German – + 76% 100% 76% 44% – – – – 61% 34% 28% 43% – – + + 84% 100% 84% 80% – – + – 80% 61% 59% 67% – – Polish – + 80% 100% 80% 41% – – – – 79% 61% 55% 40% – – supervised: Data Results Baseline Language Lem Tags Lem Tags Lem+Tags Lem Tags Lem+Tags + + 97% 100% 97% 89% 97% 89% + – 92% 38% 38% 19% 20% 19% German – + 90% 100% 90% 89% 97% 89% – – 57% 20% 19% 19% 20% 19% + + 94% 100% 94% 83% 94% 83% + – 93% 56% 56% 33% 36% 33% Polish – + 88% 100% 88% 83% 94% 83% – – 68% 40% 38% 33% 36% 33% Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Inflection: results Task definition: given lemma and tag, output the correct inflected form baseline: Maximum Entropy classifier based on letter N-grams Results: Language Result Baseline German 84% 83% Polish 86% 84% Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Introduction The Model Experiments Conclusion Conclusion focus on relations between words, rather than segmentation non-concatenative morphology included many training possibilities: unsupervised, supervised, manual editing one model for multiple tasks Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

A Multi-purpose Bayesian Model for Word-Based Morphology Maciej - PowerPoint PPT Presentation

Introduction The Model Experiments Conclusion A Multi-purpose Bayesian Model for Word-Based Morphology Maciej Janicki University of Leipzig September 17, 2015 Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Morphology Morphology Morphology yields words with Morphology yields words with predictable

Introduction to English Linguistics 3: Morphology and Word Formation Part I: Morphology Part II:

Introduction to English Linguistics 3: Morphology and Word Formation Part I: Morphology Part II:

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Computational Morphology: Machine learning of morphology Yulia Zinova 09 April 2014 16 July

Update on morphology WP activities M. Huertas-Company (GAL-SWG - morphology) EUCLID France - 7

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

NLU lecture 5: Word representations and morphology Adam Lopez alopez@inf.ed.ac.uk Essential

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Multi-phase Particles Morphology Formation: Model & Methods Simone Rusconi BCAM Basque

Lexical Phonology and Morphology February 4, 2016 Lexical Phonology and Morphology Paul

Computational Morphology: Introduction Yulia Zinova SoSe 2020 Yulia Zinova Computational

Morphology and Corpora: Introduction Marco Baroni University of Bologna Granada Morphology

Discrete Morphology and Distances on graphs Jean Cousty Four-Day Course on Mathematical

MULTIFLEX - a Formalism and a Tool for the Computational Morphology of Multi-Word Units Agata

Two-Level Morphology: A General Model for Word-Form Recognition and Production Kimmo

DUNE Trigger Requirements Requirements are not specifications But best if they are

IN4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning Looking at data 2 Data 3

Welcome to the FCM SA Workshop - 17 May 2017 Opening and Welcome - Dr Tjaart van der Walt co

Transitions Follow us on twitter @spsp_mh #spspmh5 Agenda 11.15 - 11.20 Introduction

bp third quarter 2020 financial results presentation Craig Marshall SVP investor relations

Election Methods Is It Possible to Choose the Winner? Will Best October 2, 2020 Draws heavily

MKM Business Meeting 0. We need a scribe 1. Report from the track chair 2. Track chair 2013 3.

L EDGER C OMBINERS FOR F AST S ETTLEMENT Matthias Fitzi 1 , Peter Gai 1 , Aggelos Kiayias 1,2 ,

A Multi-purpose Bayesian Model for Word-Based Morphology Maciej - PowerPoint PPT Presentation

Introduction The Model Experiments Conclusion A Multi-purpose Bayesian Model for Word-Based Morphology Maciej Janicki University of Leipzig September 17, 2015 Maciej Janicki A Multi-purpose Bayesian Model for Word-Based Morphology

Morphology Morphology Morphology yields words with Morphology yields words with predictable

Introduction to English Linguistics 3: Morphology and Word Formation Part I: Morphology Part II:

Introduction to English Linguistics 3: Morphology and Word Formation Part I: Morphology Part II:

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Computational Morphology: Machine learning of morphology Yulia Zinova 09 April 2014 16 July

Update on morphology WP activities M. Huertas-Company (GAL-SWG - morphology) EUCLID France - 7

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

NLU lecture 5: Word representations and morphology Adam Lopez alopez@inf.ed.ac.uk Essential

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Multi-phase Particles Morphology Formation: Model &amp; Methods Simone Rusconi BCAM Basque

Lexical Phonology and Morphology February 4, 2016 Lexical Phonology and Morphology Paul

Computational Morphology: Introduction Yulia Zinova SoSe 2020 Yulia Zinova Computational

Morphology and Corpora: Introduction Marco Baroni University of Bologna Granada Morphology

Discrete Morphology and Distances on graphs Jean Cousty Four-Day Course on Mathematical

MULTIFLEX - a Formalism and a Tool for the Computational Morphology of Multi-Word Units Agata

Two-Level Morphology: A General Model for Word-Form Recognition and Production Kimmo

DUNE Trigger Requirements Requirements are not specifications But best if they are

IN4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning Looking at data 2 Data 3

Welcome to the FCM SA Workshop - 17 May 2017 Opening and Welcome - Dr Tjaart van der Walt co

Transitions Follow us on twitter @spsp_mh #spspmh5 Agenda 11.15 - 11.20 Introduction

bp third quarter 2020 financial results presentation Craig Marshall SVP investor relations

Election Methods Is It Possible to Choose the Winner? Will Best October 2, 2020 Draws heavily

MKM Business Meeting 0. We need a scribe 1. Report from the track chair 2. Track chair 2013 3.

L EDGER C OMBINERS FOR F AST S ETTLEMENT Matthias Fitzi 1 , Peter Gai 1 , Aggelos Kiayias 1,2 ,

Multi-phase Particles Morphology Formation: Model & Methods Simone Rusconi BCAM Basque