Natural Language Processing
Spring 2017
Liang Huang
Unit 1: Sequence Models
Lecture 4a: Probabilities and Estimations
Lecture 4b: Weighted Finite-State Machines
required
- ptional
Natural Language Processing Spring 2017 Unit 1: Sequence Models - - PowerPoint PPT Presentation
Natural Language Processing Spring 2017 Unit 1: Sequence Models Lecture 4a: Probabilities and Estimations Lecture 4b: Weighted Finite-State Machines required optional Liang Huang Probabilities experiment (e.g., toss a coin 3
Liang Huang
CS 562 - Lec 5-6: Probs & WFSTs
2
CS 562 - Lec 5-6: Probs & WFSTs
3
CS 562 - Lec 5-6: Probs & WFSTs
4
CS 562 - Lec 5-6: Probs & WFSTs
P(A|C) = P (A|B, C)
5
CS 562 - Lec 5-6: Probs & WFSTs
6
CS 562 - Lec 5-6: Probs & WFSTs
7
CS 562 - Lec 5-6: Probs & WFSTs
8
CS 562 - Lec 5-6: Probs & WFSTs
9
CS 562 - Lec 5-6: Probs & WFSTs
10
CS 562 - Lec 5-6: Probs & WFSTs
11
CS 562 - Lec 5-6: Probs & WFSTs
12
CS 562 - Lec 5-6: Probs & WFSTs
13
CS 562 - Lec 5-6: Probs & WFSTs
14
CS 562 - Lec 5-6: Probs & WFSTs
15
CS 562 - Lec 5-6: Probs & WFSTs
16
CS 562 - Lec 5-6: Probs & WFSTs
17
CS 562 - Lec 5-6: Probs & WFSTs
18
costs: replacement: 1 insertion: 2 deletion: 2
a:a/0 a:b/1 a:*e*/2 b:b/0 *e*:a/2 b:a/1
...
WFST: real edit distance
CS 562 - Lec 5-6: Probs & WFSTs
19
CS 562 - Lec 5-6: Probs & WFSTs
20
CS 562 - Lec 5-6: Probs & WFSTs
21
CS 562 - Lec 5-6: Probs & WFSTs
22
CS 562 - Lec 5-6: Probs & WFSTs
23
from M. Mohri and J. Eisner
CS 562 - Lec 5-6: Probs & WFSTs
24
CS 562 - Lec 5-6: Probs & WFSTs
25
CS 562 - Lec 5-6: Probs & WFSTs
26
CS 562 - Lec 5-6: Probs & WFSTs
27
CS 562 - Lec 5-6: Probs & WFSTs
28
CS 562 - Lec 5-6: Probs & WFSTs
29
CS 562 - Lec 5-6: Probs & WFSTs
30
CS 562 - Lec 5-6: Probs & WFSTs
31
FST C: POS bigram LM
CS 562 - Lec 5-6: Probs & WFSTs
32
CS 562 - Lec 5-6: Probs & WFSTs
33
CS 562 - Lec 5-6: Probs & WFSTs
34
CS 562 - Lec 5-6: Probs & WFSTs
35
a:ε" ε" ε:a b:ε" ε" ε:b a:b b:a a:a b:b O(k) deletion arcs O(k) insertion arcs O(k) identity arcs
from J. Eisner
CS 562 - Lec 5-6: Probs & WFSTs
36
clara
a : ε " ε " ε ε : a b : ε " ε " ε ε : b a : b b : a a : a b : b
.o. =
caca
.o.
c:ε" ε" l:ε" ε" a:ε" ε" r:ε" ε" a:ε" ε" ε:c c:c ε:c l:c ε:c a:c ε:c r:c ε:c a:c ε:c c:ε" ε" l:ε" ε" a:ε" ε" r:ε" ε" a:ε" ε" ε:a c:a ε:a l:a ε:a a:a ε:a r:a ε:a a:a ε:a c:ε" ε" l:ε" ε" a:ε" ε" r:ε" ε" a:ε" ε" ε:c c:c ε:c l:c ε:c a:c ε:c r:c ε:c a:c ε:c c:ε" ε" l:ε" ε" a:ε" ε" r:ε" ε" a:ε" ε" ε:a c:a ε:a l:a ε:a a:a ε:a r:a ε:a a:a ε:a c:ε" ε" l:ε" ε" a:ε" ε" r:ε" ε" a:ε" ε"
Best path (by Dijkstra’s algorithm)
CS 562 - Lec 5-6: Probs & WFSTs
Viterbi if the FSA is acyclic
37
(b. 1932) Viterbi Alg. (1967) CMDA, Qualcomm Edsger Dijkstra (1930-2002) “GOTO considered harmful”
CS 562 - Lec 5-6: Probs & WFSTs
38
that’s min. spanning tree!
Jarnik (1930) - Prim (1957) - Dijkstra (1959)
Edsger Dijkstra (1930-2002) “GOTO considered harmful”
CS 562 - Lec 5-6: Probs & WFSTs
39
that’s shortest-path
Moore (1957) - Dijkstra (1959)
CS 562 - Lec 5-6: Probs & WFSTs
40
special case of dynamic programming (Bellman, 1957)
CS 562 - Lec 5-6: Probs & WFSTs
Viterbi
Viterbi globally
41