SLIDE 1
http://demo.clab.cs.cmu.edu/algo4nlp19/ - - PowerPoint PPT Presentation
http://demo.clab.cs.cmu.edu/algo4nlp19/ - - PowerPoint PPT Presentation
http://demo.clab.cs.cmu.edu/algo4nlp19/ https://piazza.com/class/jy617kmo6ub134 Chan Young Park Jong Hyuk Park Slide credit: Noah Smith Slide credit: Noah Smith Slide credit: Noah Smith Slide credit: Noah
SLIDE 2
SLIDE 3
Slide credit: Noah Smith
SLIDE 4
Slide credit: Noah Smith
SLIDE 5
Slide credit: Noah Smith
SLIDE 6
Slide credit: Noah Smith
SLIDE 7
Slide credit: Noah Smith
SLIDE 8
Slide credit: Noah Smith
SLIDE 9
SLIDE 10
My legal name is Alexander Perchov.
SLIDE 11
My legal name is Alexander Perchov. But all of my many friends dub me Alex, because that is a more flaccid-to-utter version of my legal name.
SLIDE 12
My legal name is Alexander Perchov. But all of my many friends dub me Alex, because that is a more flaccid-to-utter version of my legal name. Mother dubs me Alexi-stop-spleening-me!, because I am always spleening her.
SLIDE 13
My legal name is Alexander Perchov. But all of my many friends dub me Alex, because that is a more flaccid-to-utter version of my legal name. Mother dubs me Alexi-stop-spleening-me!, because I am always spleening her. If you want to know why I am always spleening her, it is because I am always elsewhere with friends, and disseminating so much currency, and performing so many things that can spleen a mother.
SLIDE 14
My legal name is Alexander Perchov. But all of my many friends dub me Alex, because that is a more flaccid-to-utter version of my legal name. Mother dubs me Alexi-stop-spleening-me!, because I am always spleening her. If you want to know why I am always spleening her, it is because I am always elsewhere with friends, and disseminating so much currency, and performing so many things that can spleen a mother. Father used to dub me Shapka, for the fur hat I would don even in the summer month.
SLIDE 15
My legal name is Alexander Perchov. But all of my many friends dub me Alex, because that is a more flaccid-to-utter version of my legal name. Mother dubs me Alexi-stop-spleening-me!, because I am always spleening her. If you want to know why I am always spleening her, it is because I am always elsewhere with friends, and disseminating so much currency, and performing so many things that can spleen a mother. Father used to dub me Shapka, for the fur hat I would don even in the summer
- month. He ceased dubbing me that because I
- rdered him to cease dubbing me that. It sounded
boyish to me, and I have always thought of myself as very potent and generative.
SLIDE 16
▪ ▪ ▪ ▪
SLIDE 17
▪
▪ ▪
▪
▪ ▪ ▪ ▪ ▪
SLIDE 18
▪
▪ ▪
SLIDE 19
disseminating so much currency spending a lot of money
SLIDE 20
▪
▪ ▪
SLIDE 21
▪
▪
▪
▪ ▪
▪
▪
▪
SLIDE 22
▪
s p ee ch l a b
SLIDE 23
▪
SLIDE 24
SLIDE 25
SLIDE 26
SLIDE 27
▪
SLIDE 28
▪ ▪
SLIDE 29
▪
SLIDE 30
▪
SLIDE 31
SLIDE 32
SLIDE 33
sent transmission: English recovered transmission: French recovered message: English’
SLIDE 34
SLIDE 35
▪ ▪ ▪ ▪ ▪ handwriting recognition ▪ document summarization ▪ dialog generation ▪ linguistic decipherment ▪
SLIDE 36
▪ ▪ ▪ ▪ ▪ ▪
SLIDE 37
▪
▪ ▪
SLIDE 38
▪ ▪ ▪
SLIDE 39
▪ ▪ ▪ ▪
SLIDE 40
▪
▪
Given a sequence of n random variables: ▪ We want a sequence probability model
SLIDE 41
▪
▪
Given a sequence of n random variables: ▪ We want a sequence probability model ▪ There are |V|n possible sequences
SLIDE 42
SLIDE 43
SLIDE 44
▪ Relax independence assumption:
SLIDE 45
▪ Relax independence assumption: ▪ Simplify notation:
SLIDE 46
▪ We want probability distribution over sequences of any length
SLIDE 47
▪ Probability distribution over sequences of any length ▪ Define always Xn=STOP, where STOP is a special symbol
SLIDE 48
▪ Probability distribution over sequences of any length ▪ Define always Xn=STOP, where STOP is a special symbol ▪ Then use a Markov process as before: ▪ We now have probability distribution over all sequences
▪ Intuition: at every step you have probability 𝛽h to stop (conditioned
- n history) and (1-𝛽h) to keep going
SLIDE 49
SLIDE 50
▪
▪ ▪ ▪
SLIDE 51
SLIDE 52
SLIDE 53
SLIDE 54
SLIDE 55
▪ ▪
SLIDE 56
▪ ▪ ▪ ▪ ▪ ▪
SLIDE 57
▪
▪ ▪
198015222 the first 194623024 the same 168504105 the following 158562063 the world … 14112454 the door
- 23135851162 the *
Training Counts
SLIDE 58
▪
198015222 the first 194623024 the same 168504105 the following 158562063 the world … 14112454 the door
- 23135851162 the *
197302 close the window 191125 close the door 152500 close the gap 116451 close the thread 87298 close the deal …
- 3785230 close the *
Bigram Model Trigram Model P(door | the) = 0.0006 P(door | close the) = 0.05
SLIDE 59
▪ ▪ ▪ ▪ ▪ ▪
SLIDE 60
▪
SLIDE 61
SLIDE 62
▪ ▪ ▪
SLIDE 63
▪
▪ ▪ ⇒
SLIDE 64
▪
▪ ▪ ⇒ ▪ ▪ ⇒
SLIDE 65
▪ ▪ ▪ ▪ ▪ ▪
SLIDE 66
▪ ▪
SLIDE 67
▪ ▪ ▪ ▪
Training Data Held-Out Data Test Data
Counts / parameters from here Hyperparameters from here Evaluate here
SLIDE 68
▪
▪ ▪
SLIDE 69
▪
▪ ▪ ▪ ▪
SLIDE 70
▪
▪ ⇒ ▪ ⇒ ▪ ⇒
▪ ▪
▪
SLIDE 71
▪ ▪ ▪ ▪
SLIDE 72
▪ ▪ ▪ ▪ ▪ ▪
▪
SLIDE 73
▪ unk unk
▪
unk ▪
SLIDE 74
▪
SLIDE 75
SLIDE 76
▪ ▪
▪
▪ ▪
▪ ▪
SLIDE 77
▪
SLIDE 78
▪
SLIDE 79
Training Data Held-Out Data Test Data
Counts / parameters from here Hyperparameters from here Evaluate here
SLIDE 80