Sequence Labeling Prof. Sameer Singh CS 295: STATISTICAL NLP - PowerPoint PPT Presentation

Sequence Labeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 31, 2017 Based on slides from Nathan Schneider, Noah Smith, Yejin Choi, and everyone else they copied from.

Outline Sequence Labelling and POS Tagging Generative Modeling: HMMs Inference in HMMs: Viterbi and F/B Unsupervised Tagging using EM CS 295: STATISTICAL NLP (WINTER 2017) 2

Classification Sentiment Analysis Identify Topic Language Model CS 295: STATISTICAL NLP (WINTER 2017) 4

Sequence Labeling CS 295: STATISTICAL NLP (WINTER 2017) 5

Parts of Speech This is a simple sentence . DET VB DET ADJ NOUN . Applications: Text to speech: record, lead, … • Machine translation: run, walk, … • Noun phrases: `grep {JJ | NN}* {NN | NNS}` • • and many others… CS 295: STATISTICAL NLP (WINTER 2017) 6

Parts of Speech: Tags “Open classes” Nouns, verbs, adjectives, adverbs, numbers “Closed classes” Modal verbs • Prepositions (on, to) • Particles (off, up) • Determiners (the, some) • Pronouns (she, they) • Conjunctions (and, or) • CS 295: STATISTICAL NLP (WINTER 2017) 7

Named Entity Recognition Barack Obama spoke from the White House today . PER PER O O O LOC LOC O O CS 295: STATISTICAL NLP (WINTER 2017) 8

Field Segmentation: Ads 3BR flat in Bruntsfield , near main roads . Bright , well maintained ... SIZE TYPE O LOC O LOC LOC LOC O FEAT O FEAT FEAT ... CS 295: STATISTICAL NLP (WINTER 2017) 9

Field Segmentation: Citations Authors Title Publication Venue CS 295: STATISTICAL NLP (WINTER 2017) 10

Naïve Bayes Classifier CS 295: STATISTICAL NLP (WINTER 2017) 12

“Transitions” matter “Impossible” Transitions Based on semantics Two determiners never follow each other • Fruit flies like a bird. Two base form verbs never follow each other • Determiner is followed by adjective or noun • Fruit flies like bananas. How do we select a “consistent” set of POS tags? CS 295: STATISTICAL NLP (WINTER 2017) 13

“Transitions” matter CS 295: STATISTICAL NLP (WINTER 2017) 14

“Transitions” matter Transition on Words versus Tags Too many words, learn the same thing again • Support for unseen words: “I like tenguizino!” • CS 295: STATISTICAL NLP (WINTER 2017) 15

Hidden Markov Models S E CS 295: STATISTICAL NLP (WINTER 2017) 16

Example Sentence This is a simple sentence S DET VB DET ADJ NOUN E CS 295: STATISTICAL NLP (WINTER 2017) 17

Estimating Emissions S E Smoothing Unknown/rare words get inaccurate probabilities • Reminder: Laplace Smoothing (Add-k) • Next lecture: we will look at “features” • CS 295: STATISTICAL NLP (WINTER 2017) 18

Estimating Transitions S E Interpolation If there are too many tags, or too little data, some combinations are too rare • Same as N-gram language models, “backoff” to simpler models • CS 295: STATISTICAL NLP (WINTER 2017) 19

Predicting from HMMs CS 295: STATISTICAL NLP (WINTER 2017) 21

Brute Force Inference CS 295: STATISTICAL NLP (WINTER 2017) 22

Conditional Independence S E CS 295: STATISTICAL NLP (WINTER 2017) 23

Dynamic Programming CS 295: STATISTICAL NLP (WINTER 2017) 24

State Lattice Fruit flies like bananas R(1,N) R(2,N) R(3,N) R(4,N) S R(1,V) R(2,V) R(3,V) R(4,V) E R(1,IN) R(2,IN) R(3,IN) R(4,IN) CS 295: STATISTICAL NLP (WINTER 2017) 25

Viterbi Decoding Algorithm Initialization Iterative Computation (forward) Follow pointers (backward) CS 295: STATISTICAL NLP (WINTER 2017) 26

Computational Complexity CS 295: STATISTICAL NLP (WINTER 2017) 27

Unsupervised Tagging Supervision is not always appropriate Linguist has to read and understand each sentence • Time consuming and expensive • Contains domain specific signal in the labels • WSJ doesn’t generalize to Twitter, for example • Difficult to agree on the universal part-of-speech tags (C5 tags: 61, Brown: 87) • Want to apply it to low-resource/unknown languages • Generalize the notion of “clustering” to sequence labeling. CS 295: STATISTICAL NLP (WINTER 2017) 29

Expectation Maximization K-Means Initialization Pick K random centroids Compute Expectations Cluster all the points Update Parameters Update centroids CS 295: STATISTICAL NLP (WINTER 2017) 30

Upcoming… Homework 2 is due (~10 days): February 9, 2017 • Homework Write-up, data, and code for Homework 2 is up • Ask questions early! • Proposal is due in a week: February 7, 2017 • Project Only 2 pages • Paper summaries: February 17, February 28, March 14 • Summaries Only 1 page each • CS 295: STATISTICAL NLP (WINTER 2017) 31

Sequence Labeling Prof. Sameer Singh CS 295: STATISTICAL NLP - PowerPoint PPT Presentation

Sequence Labeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 31, 2017 Based on slides from Nathan Schneider, Noah Smith, Yejin Choi, and everyone else they copied from. Outline Sequence Labelling and POS Tagging

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Background Sequence labeling MEMMs - ? HMMs you know, right? Structured

EMNLP | 2020 SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup Rongzhi Zhang, Yue

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

Sequence Labeling Markov Models Many information extraction tasks can be formulated as

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or

Requirements of the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Definitions in the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Fall Seminar Seed Sampling & Labeling Larry Nees Seed Administrator Office of INDIANA

Hub Labeling Algorithms Andrew V. Goldberg Amazon.com A.V. Goldberg Hub Labeling 6/2/2016 1 /

Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens Marek Rei

Delta: a Toolset for the Structural Analysis of Biological Sequences on a 3D Triangular Lattice

Beatty sequences and integers in golden mean base Michel Dekking CIRM November 6, 2019

Sequence Based 100,071 genomes 96,985 pass quality checks (96.9%)

Model-theoretic approach to multi-dimensional de Finetti theory Artem Chernikov UCLA 2015 RIMS

SEQUENCE QUERY PROCESSING Praveen Seshadri, Miron Livny, Raghu Ramakrishnan (CS Department,

Mathematical Induction Jason Filippou CMSC250 @ UMCP 06-27-2016 Jason Filippou (CMSC250 @ UMCP)

Tractability Using Periodized Generalized Faure Sequences Christiane Lemieux Department of

Latent Normalizing Flows for Discrete Sequences Zachary M. Ziegler, Alexander M. Rush School of

Sequence Labeling Prof. Sameer Singh CS 295: STATISTICAL NLP - PowerPoint PPT Presentation

Sequence Labeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 31, 2017 Based on slides from Nathan Schneider, Noah Smith, Yejin Choi, and everyone else they copied from. Outline Sequence Labelling and POS Tagging

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Background Sequence labeling MEMMs - ? HMMs you know, right? Structured

EMNLP | 2020 SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup Rongzhi Zhang, Yue

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

Sequence Labeling Markov Models Many information extraction tasks can be formulated as

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

SEQUENCE ANALYSIS The term &quot; sequence analysis &quot; in biology implies subjecting a DNA or

Requirements of the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Definitions in the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Fall Seminar Seed Sampling &amp; Labeling Larry Nees Seed Administrator Office of INDIANA

Hub Labeling Algorithms Andrew V. Goldberg Amazon.com A.V. Goldberg Hub Labeling 6/2/2016 1 /

Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens Marek Rei

Delta: a Toolset for the Structural Analysis of Biological Sequences on a 3D Triangular Lattice

Beatty sequences and integers in golden mean base Michel Dekking CIRM November 6, 2019

Sequence Based 100,071 genomes 96,985 pass quality checks (96.9%)

Model-theoretic approach to multi-dimensional de Finetti theory Artem Chernikov UCLA 2015 RIMS

SEQUENCE QUERY PROCESSING Praveen Seshadri, Miron Livny, Raghu Ramakrishnan (CS Department,

Mathematical Induction Jason Filippou CMSC250 @ UMCP 06-27-2016 Jason Filippou (CMSC250 @ UMCP)

Tractability Using Periodized Generalized Faure Sequences Christiane Lemieux Department of

Latent Normalizing Flows for Discrete Sequences Zachary M. Ziegler, Alexander M. Rush School of

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or

Fall Seminar Seed Sampling & Labeling Larry Nees Seed Administrator Office of INDIANA