POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana - PowerPoint PPT Presentation

POS Tagging Definition Tagsets Automatic POS Tagging Bigram tagging MLE POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana University Fall 2015 1 / 17

POS Tagging Def. Part of Speech Tagging Definition Tagsets Automatic POS Tagging Bigram tagging MLE HMMs POS Tagging = Assigning word class information to words ex: the man bought a book determiner noun verb determiner noun 2 / 17

POS Tagging Linguistic Questions Definition Tagsets Automatic POS Tagging Bigram tagging MLE HMMs ◮ How do we divide the text into individual word tokens ? ◮ How do we choose a tagset to represent all words? ◮ How do we select appropriate tags for individual words ? 3 / 17

POS Tagging Tagsets Definition Tagsets Size of tagsets Automatic POS Tagging Bigram tagging ◮ English: MLE TOSCA 32 HMMs Penn treebank 36 BNC C5 61 Brown 77 LOB 132 London-Lund Corpus 197 TOSCA-ICE 270 ◮ Romanian: 614 ◮ Hungarian: ca. 2 100 4 / 17

POS Tagging Penn Treebank Tagset CC Coord. conjunction RB Adverb Definition CD Cardinal number RBR Adverb, comparative Tagsets DT Determiner RBS Adverb, superlative Automatic POS EX Existential there RP Particle Tagging Bigram tagging FW Foreign word SYM Symbol MLE IN Prep. / subord. conj. TO to HMMs JJ Adjective UH Interjection JJR Adjective, comparative VB Verb, base form JJS Adjective, superlative VBD Verb, past tense LS List item marker VBG Verb, gerund / present part. MD VBN Modal Verb, past part. NN VBP Noun, singular or mass Verb, non-3rd p., sing. pres. NNS VBZ Noun, plural Verb, 3rd p. sing. pres. NP WDT Proper noun, singular Wh-determiner NPS WP Proper noun, plural Wh-pronoun PDT Predeterminer WP Possessive wh-pronoun POS Possessive ending WRB Wh-adverb PRP Personal pronoun , Comma PRP$ Possessive pronoun . Sentence-final punctuation 5 / 17

POS Tagging Annotating POS Tags Definition Tagsets Automatic POS Tagging Bigram tagging Two fundamentally different approaches: MLE ◮ Start from scratch, find characteristics in words or HMMs context ( = rules) which give indication of word class ◮ e.g., if word ends in ‘‘ion’’, tag it as noun ◮ Accumulate lexicon, disambiguate words with more than one tag ◮ e.g., possible categories for ‘‘about’’: preposition, adverb, particle 6 / 17

POS Tagging Automatic POS Tagging Definition Tagsets Automatic POS Tagging Bigram tagging Assumption: local context is sufficient MLE HMMs Examples: ◮ for the man : noun or verb? ◮ we will man : noun or verb? ◮ I can put : verb base form or past? ◮ re-cap real quick : adjective or adverb? 7 / 17

POS Tagging Bigram Tagging Definition Tagsets Automatic POS Tagging Bigram tagging ◮ Basic assumption: POS tag only depends on word itself MLE and on the POS tag of the previous word HMMs ◮ Use lexicon to retrieve ambiguity class for words ◮ e.g., word: beginning , ambiguity class: [JJ, NN, VBG] ◮ For unknown words: use heuristics, e.g. all open class POS tags ◮ Disambiguation: look for most likely path through possibilities 8 / 17

POS Tagging Bigram Tagging – Example Definition Tagsets Automatic POS time like an arrow flies Tagging Bigram tagging MLE HMMs S NN VBZ IN DT NN E NNS VB VB JJ RB 9 / 17

POS Tagging Bigram Tagging – Probabilities Definition Tagsets Automatic POS Tagging Bigram tagging MLE P ( t 1 . . . t 5 ) = P ( t 1 | S ) P ( w 1 | t 1 ) P ( t 2 | t 1 ) P ( w 2 | t 2 ) . . . HMMs (Note: this is actually P ( t 1 . . . t 5 | w 1 . . . w 5 ) ) green = transition probabilities blue = lexical probabilities 10 / 17

POS Tagging Bigram Tagging – Counter-Examples Definition Tagsets ◮ start before Automatic POS Tagging ◮ start before the course or start before he is Bigram tagging done MLE ◮ real quick HMMs ◮ re-cap real quick or a real quick lunch ◮ barely changed ◮ he was barely changed or he barely changed his contents ◮ that beginning ◮ that beginning part or that beginning frightened the students or with that beginning early, he was forced ... 12 / 17

POS Tagging Maximum Likelihood Estimation Definition Tagsets Automatic POS Tagging Bigram tagging Simplest way to calculate such probabilities from a corpus: MLE HMMs P MLE ( t n | t n − 1 ) = C ( t n − 1 t n ) C ( t n − 1 ) P MLE ( w n | t n ) = C ( w n t n ) C ( t n ) ◮ Uses relative frequency ◮ Maximizes the probabilities of the corpus 13 / 17

POS Tagging Maximum Likelihood Estimation (2) Definition Tagsets Automatic POS Tagging Bigram tagging MLE ◮ Not a great estimator: zero probabilities for unseen HMMs events makes them impossible ◮ Need smoothing or discounting method to give minimal probabilities to unseen events ◮ Simplest possibility: learn from hapax legomena (words that appear only once) 14 / 17

POS Tagging Motivating Hidden Markov Models Definition Tagsets Automatic POS Tagging Thinking back to Markov models: we are now given a Bigram tagging sequence of words and want to find the POS tags MLE HMMs ◮ The underlying event of POS tags can be thought of as generating the words in the sentence ◮ Each state in the Markov model can be a POS tag ◮ We don’t know the correct state sequence ( Hidden Markov Model (HMM) ) This requires an additional emission matrix , linking words with POS tags (cf. P ( arrow | NN ) ) 15 / 17

POS Tagging Example HMM Assume DET, N, and VB as hidden states, with this Definition Tagsets transition matrix (A): Automatic POS Tagging DET N VB Bigram tagging MLE DET 0.01 0.89 0.10 HMMs N 0.30 0.20 0.50 VB 0.67 0.23 0.10 ... emission matrix (B): dogs bit the chased a these cats ... DET 0.0 0.0 0.33 0.0 0.33 0.33 0.0 ... N 0.2 0.1 0.0 0.0 0.0 0.0 0.15 ... VB 0.1 0.6 0.0 0.3 0.0 0.0 0.0 ... DET 0.7 ... and initial probability matrix ( π ): N 0.2 VB 0.1 16 / 17

POS Tagging Using Example HMM Definition Tagsets In order to generate words, we: Automatic POS Tagging 1. Choose tag/state from π Bigram tagging MLE 2. Choose emitted word from relevant row of B HMMs 3. Choose transition from relevant row of A 4. Repeat #2 & #3, until we hit a stopping point ◮ keeping track of probabilities as we go along We could generate all possibilities this way and find the most probable sequence ◮ Want a more efficient way of finding most probable sequence 17 / 17

POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana - PowerPoint PPT Presentation

POS Tagging Definition Tagsets Automatic POS Tagging Bigram tagging MLE POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana University Fall 2015 1 / 17 POS Tagging Def. Part of Speech Tagging Definition Tagsets Automatic POS

Arabic POS Tagging Results Error Analysis Conclusion Emad Mohamed, Sandra K ubler Indiana

Algorithms for NLP IITP, Spring 2020 HMMs, POS tagging, NER Yulia Tsvetkov 1 Plan POS

Part Of Speech (POS) Tagging Based on Foundations of Statistical NLP by C. Manning & H.

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Part-of-Speech Tagging Part-of-Speech Tagging Berlin Chen 2003 References: 1. Speech and

Part of Speech Tagging Informatics 2A: Lecture 16 John Longley School of Informatics University

Part of Speech Tagging Informatics 2A: Lecture 15 Mirella Lapata School of Informatics

NLP Programming Tutorial 5 - Part of Speech Tagging with Hidden Markov Models Graham Neubig

Joint Word Segmentation and pos-Tagging using a Single Perceptron Yue Zhang and Stephen Clark

Statistical Natural Language Processing Dr. Besnik Fetahu Overview POS tagging

Part-of-Speech Tagging Part-of-Speech Tagging Berlin Chen 2005 References: 1. Speech and

IN4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning 2 Tagging and sequence

Forewords Tagging in a nutshell Sources Slides inspired by M. Rajman and J.-C. Chappelier,

Traffic UTM Tagging AdWords WebMaster Tools UTM TAGGING Where does my traffic come from? UTM

Empirical Methods in Natural Language Processing Lecture 8 Tagging (III): Maximum Entropy Models

Feature-Based Tagging The Task, Again Recall: tagging ~ morphological disambiguation

International Conference on Language Teaching and Assessment, August 21 st - 23rd, 2017, UIN

The Power Presentation (Paperback) The Power Presentation (Paperback) Filesize: 1.39 MB Reviews

Trip-Wire Presentation Notes Rothermere American Institute Summer School Workshop 2O15 Trip-Wire

Early Childhood Education Matters Jamie Kutner, Lisa Lenhart, Ph.D., Kirstin S. Toth and Derran

Cooking off the Grid Presentation by Lisa Bedford, Survival Conference, Dallas, May 29, 2011 I.

Fantasy, Haruki Murakami and Carlos Fuentes KHANH MINH TRINH SENIOR SEMINAR Thesis Statement

Intermolecular Forces, Liquids, and Solids Slide 2 / 92 Intermolecular Forces Intermolecular

Session Objectives Improve your understanding of conceptual, relational & informational

POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana - PowerPoint PPT Presentation

POS Tagging Definition Tagsets Automatic POS Tagging Bigram tagging MLE POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana University Fall 2015 1 / 17 POS Tagging Def. Part of Speech Tagging Definition Tagsets Automatic POS

Arabic POS Tagging Results Error Analysis Conclusion Emad Mohamed, Sandra K ubler Indiana

Algorithms for NLP IITP, Spring 2020 HMMs, POS tagging, NER Yulia Tsvetkov 1 Plan POS

Part Of Speech (POS) Tagging Based on Foundations of Statistical NLP by C. Manning &amp; H.

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Part-of-Speech Tagging Part-of-Speech Tagging Berlin Chen 2003 References: 1. Speech and

Part of Speech Tagging Informatics 2A: Lecture 16 John Longley School of Informatics University

Part of Speech Tagging Informatics 2A: Lecture 15 Mirella Lapata School of Informatics

NLP Programming Tutorial 5 - Part of Speech Tagging with Hidden Markov Models Graham Neubig

Joint Word Segmentation and pos-Tagging using a Single Perceptron Yue Zhang and Stephen Clark

Statistical Natural Language Processing Dr. Besnik Fetahu Overview POS tagging

Part-of-Speech Tagging Part-of-Speech Tagging Berlin Chen 2005 References: 1. Speech and

IN4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning 2 Tagging and sequence

Forewords Tagging in a nutshell Sources Slides inspired by M. Rajman and J.-C. Chappelier,

Traffic UTM Tagging AdWords WebMaster Tools UTM TAGGING Where does my traffic come from? UTM

Empirical Methods in Natural Language Processing Lecture 8 Tagging (III): Maximum Entropy Models

Feature-Based Tagging The Task, Again Recall: tagging ~ morphological disambiguation

International Conference on Language Teaching and Assessment, August 21 st - 23rd, 2017, UIN

The Power Presentation (Paperback) The Power Presentation (Paperback) Filesize: 1.39 MB Reviews

Trip-Wire Presentation Notes Rothermere American Institute Summer School Workshop 2O15 Trip-Wire

Early Childhood Education Matters Jamie Kutner, Lisa Lenhart, Ph.D., Kirstin S. Toth and Derran

Cooking off the Grid Presentation by Lisa Bedford, Survival Conference, Dallas, May 29, 2011 I.

Fantasy, Haruki Murakami and Carlos Fuentes KHANH MINH TRINH SENIOR SEMINAR Thesis Statement

Intermolecular Forces, Liquids, and Solids Slide 2 / 92 Intermolecular Forces Intermolecular

Session Objectives Improve your understanding of conceptual, relational &amp; informational

Part Of Speech (POS) Tagging Based on Foundations of Statistical NLP by C. Manning & H.

Session Objectives Improve your understanding of conceptual, relational & informational