Introduction to Hidden Markov Models CMSC 473/673 UMBC October 9 th - PowerPoint PPT Presentation

Introduction to Hidden Markov Models CMSC 473/673 UMBC October 9 th , 2017

673 Announcement: Graduate Paper Due this Wednesday 10/11 at 11:59 AM (< 2 days) Use project id paper1 for the submit utility

Course Announcement: Assignment 2 Due next Wednesday, 10/18 (~9 days) Any questions?

Course Announcement: Midterm Monday, 10/30 (3 weeks) Format: In-class (75 minutes) You may bring any notes you created yourself; they must be turned in with the exam (photocopies are OK) Some practice questions will be out next Wednesday (10/18)

Recap from last time…

Expectation Maximization (EM) 0. Assume some value for your parameters Two step, iterative algorithm 1. E-step: count under uncertainty, assuming these parameters 2. M-step: maximize log-likelihood, assuming these uncertain counts estimated counts

Counting Requires Marginalizing E-step: count under uncertainty, assuming these parameters

Counting Requires Marginalizing E-step: count under uncertainty, assuming these parameters break into 4 disjoint pieces w z 2 & w z 3 & w z 4 & w z 1 & w

EM Example 1: Three Coins/Class-based Unigrams Imagine three coins Flip 1 st coin (penny) unobserved: vowel or constonant? part of speech? If heads: flip 2 nd coin (dollar coin) observed: a , b , e , etc. If tails: flip 3 rd coin (dime) We run the code, vs. The run failed

EM Example 2: Machine Translation Alignment Le chat est sur la chaise verte The cat is on the green chair Want : P( f | e ) But don’t know how to train this directly… Solution : Use P( a, f | e ), where a is an alignment Remember: marginalizing across all possible alignments

IBM Model 1 (1993) f : vector of French words Le chat est sur la chaise verte (visualization of alignment) e : vector of English words The cat is on the green chair a : vector of alignment indices 0 1 2 3 4 6 5 t ( f j | e i ) : translation probability of the word f j given the word e i

Learning the Alignments through EM 0. Assume some value for and compute other parameter values Two step, iterative algorithm 1. E-step: count alignments and translations under uncertainty, assuming these parameters le chat P( | “the cat”) le chat P( | “the cat”) 2. M-step: maximize log-likelihood (update parameters), using uncertain counts estimated counts

Follow up: IBM Model 1 Parameters For IBM model 1, we can compute all parameters given translation parameters: How many of these are there? | French vocabulary | x | English vocabulary | From Rebecca: See Sec. 31 of the Knight tutorial for more about space considerations

Alignment: Output and Complexities Component of machine translation systems Produce a translation lexicon automatically Cross-lingual projection/extraction of information Supervision for training other models (for example, neural MT systems) http://www.cis.upenn.edu/~ccb/figures/research-statement/pivoting.jpg

Any Questions on What We’ve Seen of EM So Far?

Hidden Markov Models …

Agenda HMM Motivation (Part of Speech) and Brief Definition What is Part of Speech? HMM Detailed Definition HMM Tasks

Hidden Markov Models Class-based Model Use different distributions to explain groupings of observations Sequence Model Bigram model of the classes , not the observations Implicitly model all possible class sequences Algorithms for finding best sequence, and for the marginal likelihood

Hidden Markov Models: Part of Speech (i): Adjective Noun Verb Prep Noun Noun (ii): Noun Verb Noun Prep Noun Noun p(British Left Waffles on Falkland Islands) Bigram model Class-based Model all of the classes model class sequences

Hidden Markov Models: Part of Speech (i): Adjective Noun Verb Prep Noun Noun (ii): Noun Verb Noun Prep Noun Noun p(British Left Waffles on Falkland Islands) Bigram model Class-based Model all of the classes model class sequences 1. Explain this sentence as a sequence of (likely?) latent (unseen) tags (labels) 2. Produce a tag sequence for this sentence

Agenda HMM Motivation (Part of Speech) and Brief Definition What is Part of Speech? HMM Detailed Definition HMM Tasks

Brief Aside: Parts of Speech Classes of words that behave like one another in similar syntactic contexts

Parts of Speech Classes of words that behave like one another in similar syntactic contexts Pronunciation (stress) can differ: object (noun: OB-ject) vs. object (verb: ob-JECT) It can help improve the inputs to other systems (text-to-speech, syntactic parsing)

XKCD, #1771: https://imgs.xkcd.com/comics/it_was_i.png

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give run Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give run Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give run “I can eat.” may can do Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give run may can do Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give Adverbs run happily recently may can do Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give Adverbs run happily recently “Today, we eat there .” there may can then (location) do Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give Adverbs run happily recently “ I ate.” “ There is a cat.” you I there there may can then (location) do Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give Adverbs run happily Numbers recently 1,324 you I there one there may can then (location) do Prepositions Conjunctions and or if Determiners top in every because a under what the Adapted from Luke Zettlemoyer

Parts of Speech Open class words fake Adjectives would-be Nouns red cats Verbs bread large happy Baltimore cat speak milk wettest UMBC give Adverbs run happily Numbers recently 1,324 you I there one there may can then (location) do Prepositions Conjunctions and or if Determiners top in every because a under what the Closed class words Adapted from Luke Zettlemoyer

Parts of Speech Open class words fake non- Adjectives subsective would-be Nouns red cats Verbs bread large happy Baltimore cat intransitive speak milk wettest UMBC subsective give Kamp & Partee (1995) ditransitive Adverbs run happily Numbers transitive recently 1,324 you I there one there may can then (location) do Pronouns modals, Prepositions auxiliaries Conjunctions and or if Determiners top in every because a under what the Closed class words Adapted from Luke Zettlemoyer

Introduction to Hidden Markov Models CMSC 473/673 UMBC October 9 th - PowerPoint PPT Presentation

Introduction to Hidden Markov Models CMSC 473/673 UMBC October 9 th , 2017 673 Announcement: Graduate Paper Due this Wednesday 10/11 at 11:59 AM (< 2 days) Use project id paper1 for the submit utility Course Announcement: Assignment 2 Due

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Outline depmixS4: an R-package for hidden Markov models Hidden Markov Models Ingmar Visser 1

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

Hidden Markov Models Steven J Zeil Old Dominion Univ. Fall 2010 1 Discrete Markov Processes

Hidden Markov Models Pratik Lahiri Introduction A hidden Markov model (HMM) is a

Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich;

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Markov Chains and Hidden Markov Models COMP 571 - Spring 2015 Luay Nakhleh, Rice University

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Hidden Markov Models Markov Model (Finite State Machine with Probs) Modeling a sequence of

A spectral algorithm for learning hidden Markov models . . . h 3 h 2 h 1 x 3 x 2 x 1 Daniel Hsu

CS 4495 Computer Vision Hidden Markov Models Aaron Bobick School of Interactive Computing

Outline Sequential Data - Part 2 Greg Mori - CMPT 419/726 Hidden Markov Models - Most Likely

1 Our Image of Witches? 2 Apuleius, Metamorphoses Met . 1.7.7 Socrates narrates: I stopped with

LORD RD JESUS US CHRI RIST ST Kenneth M. Hoeck www.totw.org Truth On The Web Ministries

Chief Justice John Jay (Former Confederation Foreign Minister) v Favored allowing British troop

L ECTURE 2 The Effects of Monetary Changes: Narrative Evidence and Natural Experiments August

Introduction to Hidden Markov Models CMSC 473/673 UMBC Recap from last time Expectation

Working memory and syntac1c islands revisited Edward Gibson, MIT

www.wm.edu WILLIAMSBURG, VA WHY WILLIAM & MARY ? Second-oldest institution of

Commensurate comparisons of models with energy budget observations reveal consistent climate

Introduction to Hidden Markov Models CMSC 473/673 UMBC October 9 th - PowerPoint PPT Presentation

Introduction to Hidden Markov Models CMSC 473/673 UMBC October 9 th , 2017 673 Announcement: Graduate Paper Due this Wednesday 10/11 at 11:59 AM (< 2 days) Use project id paper1 for the submit utility Course Announcement: Assignment 2 Due

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Outline depmixS4: an R-package for hidden Markov models Hidden Markov Models Ingmar Visser 1

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

Hidden Markov Models Steven J Zeil Old Dominion Univ. Fall 2010 1 Discrete Markov Processes

Hidden Markov Models Pratik Lahiri Introduction A hidden Markov model (HMM) is a

Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich;

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Markov Chains and Hidden Markov Models COMP 571 - Spring 2015 Luay Nakhleh, Rice University

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Hidden Markov Models Markov Model (Finite State Machine with Probs) Modeling a sequence of

A spectral algorithm for learning hidden Markov models . . . h 3 h 2 h 1 x 3 x 2 x 1 Daniel Hsu

CS 4495 Computer Vision Hidden Markov Models Aaron Bobick School of Interactive Computing

Outline Sequential Data - Part 2 Greg Mori - CMPT 419/726 Hidden Markov Models - Most Likely

1 Our Image of Witches? 2 Apuleius, Metamorphoses Met . 1.7.7 Socrates narrates: I stopped with

LORD RD JESUS US CHRI RIST ST Kenneth M. Hoeck www.totw.org Truth On The Web Ministries

Chief Justice John Jay (Former Confederation Foreign Minister) v Favored allowing British troop

L ECTURE 2 The Effects of Monetary Changes: Narrative Evidence and Natural Experiments August

Introduction to Hidden Markov Models CMSC 473/673 UMBC Recap from last time Expectation

Working memory and syntac1c islands revisited Edward Gibson, MIT

www.wm.edu WILLIAMSBURG, VA WHY WILLIAM &amp; MARY ? Second-oldest institution of

Commensurate comparisons of models with energy budget observations reveal consistent climate

www.wm.edu WILLIAMSBURG, VA WHY WILLIAM & MARY ? Second-oldest institution of