N-gram Language Models
CMSC 470 Marine Carpuat
Slides credit: Jurasky & Martin
N-gram Language Models CMSC 470 Marine Carpuat Slides credit: - - PowerPoint PPT Presentation
N-gram Language Models CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin Roadmap Language Models Our first example of modeling sequences n-gram language models How to estimate them? How to evaluate them? Neural
Slides credit: Jurasky & Martin
P(W) = P(w1,w2,w3,w4,w5…wn)
P(w5|w1,w2,w3,w4)
P(W) or P(wn|w1,w2…wn-1)
between frequency and rank
150th most common word
f = frequency r = rank c = constant
Graph illustrating Zipf’s Law for the Brown corpus
from Manning and Shütze
p(B|A) = P(A,B)/P(A) Rewriting: P(A,B) = P(A)P(B|A)
P(A,B,C,D) = P(A)P(B|A)P(C|A,B)P(D|A,B,C)
P(x1,x2,x3,…,xn) = P(x1)P(x2|x1)P(x3|x1,x2)…P(xn|x1,…,xn-1)
P(“its water is so transparent”) = P(its) × P(water|its) × P(is|its water) × P(so|its water is) × P(transparent|its water is so)
i
… …
P(the |its water is so transparent that) = Count(its water is so transparent that the) Count(its water is so transparent that)
P(the |its water is so transparent that) » P(the |that)
P(the |its water is so transparent that) » P(the |transparent that)
Andrei Markov
i
… … … …
fifth, an, of, futures, the, an, incorporated, a, a, the, inflation, most, dollars, quarter, in, is, mass thrift, did, eighty, said, hard, 'm, july, bullish that, or, limited, the Some automatically generated sentences from a unigram model
i
…
Condition on the previous word:
texaco, rose, one, in, this, issue, is, pursuing, growth, in, a, boiler, house, said, mr., gurria, mexico, 's, motion, control, proposal, without, permission, from, five, hundred, fifty, five, yen
this, would, be, a, record, november
…
“The computer which I had just put into the machine room on the ground floor crashed.”
<s> I am Sam </s> <s> Sam I am </s> <s> I do not like green eggs and ham </s>
P(wi | wi-1) = c(wi-1,wi) c(wi-1)
9222 sentences in total Examples
P(<s> I want english food </s>) = P(I|<s>) × P(want|I) × P(english|want) × P(food|english) × P(</s>|food) = .000031
…
http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html
P(w | denied the) 3 allegations 2 reports 1 claims 1 request 7 total P(w | denied the) 2.5 allegations 1.5 reports 0.5 claims 0.5 request 2 other 7 total
allegations reports claims
attack
request
man
allegations
attack man
allegations reports
claims
request
From Dan Klein
P
MLE(wi | wi-1) = c(wi-1,wi)
c(wi-1) P
Add-1(wi | wi-1) = c(wi-1,wi)+1
c(wi-1)+V
S(wi | wi-k+1
i-1 ) =
count(wi-k+1
i
) count(wi-k+1
i-1 )
if count(wi-k+1
i
) > 0 0.4S(wi | wi-k+2
i-1 ) otherwise
ì í ï ï î ï ï
S(wi) = count(wi) N