Language and Stats 11-(7/6)61 Introduction Objectives Logistics Statistical Language Modeling (SLM); Computational Linguistics (CL)
Bhiksha Raj
1 11-761
Language and Stats 11-(7/6)61 Introduction Objectives Logistics - - PowerPoint PPT Presentation
Language and Stats 11-(7/6)61 Introduction Objectives Logistics Statistical Language Modeling (SLM); Computational Linguistics (CL) Bhiksha Raj 11-761 1 11-761 2 Language and Statistics Iozmne pqmnzg habfbngyeydh shahmw Language or
1 11-761
11-761 2
3 11-761
4 11-761
5 11-761
6 11-761
7 11-761
8 11-761
9 11-761
10 11-761
George into the entrance of the trail and up toward the highway. Curley and Carlson looked after them. And Carlson said, "Now what the hell ya suppose is eatin' them two guys?“
11 11-761
George into the entrance of the trail and up toward the highway. Curley and Carlson looked after them. And Carlson said, "Now what the hell ya suppose is eatin' them two guys?“
12 11-761
George into the entrance of the trail and up toward the highway. Curley and Carlson looked after them. And Carlson said, "Now what the hell ya suppose is eatin' them two guys?“
13 11-761
14 11-761
likely to be expressed in a valid sentence than other sequences we have never seen
11-761 15
char O, o[]; main(l) {for(;~l;O||puts(o)) O=(O[o]=~(l=getchar())?4<(4^l>>5)?l:46:0)?-~O & printf("%02x ",l)*5:!O;}
11-761 16
patterned sequences of symbolic units
communication
11-761 17
language
previously unseen patterns
11-761 18
11-761 19
above..
11-761 20
21
probability, statistics and information theory.
11-761
22
11-761
23
11-761
24
No single book exists which covers the course material.
11-761
25
11-761
26
11-761
27
11-761
28 11-761
29 11-761
30 11-761
11-761 31
11-761 32
11-761 33
Mary Jane
11-761 34
11-761 35
11-761 36
11-761 37
11-761 38
11-761 39
11-761 40
11-761 41
11-761 42
11-761 43
11-761 44
11-761 45
11-761 46
11-761 47
11-761 48
11-761 49
linguists
11-761 50
51 11-761
*
*
11-761 52
)
11-761 53
11-761 54
acoustic signal !
into the channel
statement
11-761 55
source " !
11-761 56
source ! "
*
*
57
source " !
*
*
58
source " ! Language Model Quantifies the plausibility of word sequences
*
*
59
source " ! Language Model Quantifies the plausibility of word sequences Acoustic Model Quantifies the degree of match between candidate word sequence and observed acoustics
*
*
source " ! Language Model Quantifies the plausibility of word sequences Acoustic Model Quantifies the degree of match between candidate word sequence and observed acoustics
*
*
61
source " ! Estimated Language Model Estimated Acoustic Model
11-761 62
four score and seven years ago Hace cuatro y siete años
)
)
11-761 63
four score and seven years ago Hace cuatro y siete años
)
)
)
11-761 64
four score and seven years ago Hace cuatro y siete años
!∗ = argmax
)
* + ! * +(-|!)
is exponentially large
11-761 65
four score and seven years ago Hace cuatro y siete años
1 ! 2
11-761 66 There was ease in Casey’s manner as he stepped into his place; There was pride in Casey’s bearing and a smile lit Casey’s face. And when, responding to the cheers, he lightly doffed his hat, No stranger in the crowd could doubt ‘twas Casey at the bat.
SPORTS
produces a document
*
*
*
67 There was ease in Casey’s manner as he stepped into his place; There was pride in Casey’s bearing and a smile lit Casey’s face. And when, responding to the cheers, he lightly doffed his hat, No stranger in the crowd could doubt ‘twas Casey at the bat.
SPORTS
produces a document
*
*
*
68 There was ease in Casey’s manner as he stepped into his place; There was pride in Casey’s bearing and a smile lit Casey’s face. And when, responding to the cheers, he lightly doffed his hat, No stranger in the crowd could doubt ‘twas Casey at the bat.
SPORTS
!∗ = argmax
)
*(!|-) = argmax
)
* ! *(-|!)
*(!|-) (or alternately of *(!) and *(-|!))
11-761 69
!∗ = argmax
)
*(!|-) = argmax
)
* ! *(-|!)
*(!|-) (or alternately of *(!) and *(-|!))
11-761 70
The a priori probability distribution of X (also called the prior) Tells us about the natural biases in the data – which Xs are produced more preferentially by the source
!∗ = argmax
)
*(!|-) = argmax
)
* ! *(-|!)
*(!|-) (or alternately of *(!) and *(-|!))
11-761 71
The a priori probability distribution of X (also called the prior) Tells us about the natural biases in the data – which Xs are produced more preferentially by the source The conditional probability of Y given X Gives us a measure of the “fit” of Y to a given X
!∗ = argmax
)
*(!|-) = argmax
)
* ! *(-|!)
*(!|-) (or alternately of *(!) and *(-|!))
11-761 72
The a priori probability distribution of X (also called the prior) Tells us about the natural biases in the data – which Xs are produced more preferentially by the source The conditional probability of Y given X Gives us a measure of the “fit” of Y to a given X The decomposition allows us to learn these two from two entirely different datasets
11-761 73
)
11-761 74
)
11-761 75
)
11-761 76
)
77
)
11-761 78 There was ease in Casey’s manner as he stepped into his place; There was pride in Casey’s bearing and a smile lit Casey’s face. And when, responding to the cheers, he lightly doffed his hat, No stranger in the crowd could doubt ‘twas Casey at the bat.
SPORTS
)
79
four score and seven years ago Hace cuatro y siete años
1 ! 2
11-761 80
11-761 81