Foundations of Language Science and Technology: Statistical Language Models
Dietrich Klakow
Foundations of Language Science and Technology: Statistical - - PowerPoint PPT Presentation
Foundations of Language Science and Technology: Statistical Language Models Dietrich Klakow Using Language Models 2 How Speech Recognition works Speech Signal Feature Extraction Feature Extraction Acoustic Model Stream of feature P(A|W)
Dietrich Klakow
2
3 Speech Signal Feature Extraction Feature Extraction
Acoustic Model
P(A|W)
Language Model
P(W) Stream of feature vectors A Search W=argmax [P(A|W) P(W)] ^ [W]
Recognized word sequence W ^ ^ Language Model
P(W)
4
5
6
7
8
9
10
D6 D1 D2 D3 D5 D4 D7 Query Q P(Q|D2) Ranking according to P(Q|Di)
11
12
− h w N N
, / 1 1
P(w|h): language model N(w,h): frequency of sequence w,h in some test corpus N: size of test corpus
13
14
Perplexity and error rate are correlate within error bars
15
16
h w Train Train
,
17
h w Train Train
,
Minimizing perplexity
How to take normalization constraint into account?
18
19
Train Train
What´s the problem?
20
21
22
23
24
Train Train Train Train
2 1 2 1 1 1 1
− − −
V: size of vocabulary
25
26
27