Introduction + Information Theory
LING 572 January 7, 2020 Shane Steinert-Threlkeld
1
Introduction + Information Theory LING 572 January 7, 2020 Shane - - PowerPoint PPT Presentation
Introduction + Information Theory LING 572 January 7, 2020 Shane Steinert-Threlkeld Adapted from F. Xia, 17 1 Outline Background General course information Course contents Information Theory 2 Early NLP Early
LING 572 January 7, 2020 Shane Steinert-Threlkeld
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
“Notify me right away”, “send daily summary”.
might also have the answer.
25
26
▪ Email: yhtian@uw.edu ▪ Office hours:
27
28
29
(s)he is not familiar with.
30
cd hwX/ # suppose hwX is your dir that includes all the files tar -czvf hw.tar.gz *
/dropbox/19-20/572/hw2/check_hw2.sh hw.tar.gz
572/hwX/submit-file-list and 572/languages
31
32
after the grade is posted.
33
34
35
Assignments (hw) Reading assignments Num 9 or 10 4 or 5 Distribution Web and patas Web Discussion Allowed Submission Canvas Due date 11pm every Thurs 11am on Tues or Thurs Late penalty 1%, 10%, 20% No late submission accepted Estimate of hours 10-15 hours 2-4 hours Grading Graded according to the rubrics Checked
➔ 15-25 hours per week; about 20 hrs/week
amount of time on 572, you should take 572 later when you can.
discuss what can be done to reduce time.
36
37
score”.
38
39
40
41
CL, and Speech Recognition
42
43
44
45
46
47
48
49
50
51
52
53
Here, X is a random variable, x is a possible outcome of X.
54
x
55
56
57
58
8
i=1
59
8
i=1
60
8
i=1
61
8
i=1
62
H(X) = −
8
∑
i=1
1/8 log 1/8 = − log 1/8 = 3 bits H(X) = −
8
∑
i=1
p(i)log p(i)
63
H(X) = −
8
∑
i=1
p(i)log p(i) = 1/2 log 1/2 + 1/4 log 1/4 + 1/8 log 1/8 + 1/16 log 1/16 + 4/64 log 1/64 H(X) = −
8
∑
i=1
1/8 log 1/8 = − log 1/8 = 3 bits
64
H(X) = −
8
∑
i=1
1/8 log 1/8 = − log 1/8 = 3 bits H(X) = −
8
∑
i=1
p(i)log p(i) = 2 bits
65
66
x
x
67
x
68
x ∑ y
69
n→∞
n→∞
70
i
i
71
N
72
x y
73 Dulek and Schaffner 2017 See also Cover+Thomas Fig 2.2; MS Fig 2.6
74
75
76
) ( ) , ( ) ( log ) ( ) , ( log ) , ( ) ( log ) , ( ) , ( log ) , ( ) ) ( log ) , ( )(log , ( ) ( / ) , ( log ) , ( ) | ( log ) , ( ) | ( log ) | ( ) ( ) | ( ) ( ) | ( X H Y X H x p x p y x p y x p x p y x p y x p y x p x p y x p y x p x p y x p y x p x y p y x p x y p x y p x p x X Y H x p X Y H
x y x x y x y x y x y x y x y x
− = + = + − = − − = − = − = − = = =
77
) ; ( ) , ( ) ( ) ( ) ( )) ( (log ) ( )) ( (log ) , ( ) , ( ) ( log ) , ( ) ( log ) , ( ) ( log ) , ( ) ( log ) , ( ) , ( log ) , ( ) ( ) ( ) , ( log ) , ( ) ; ( X Y I Y X H Y H X H y p y p x p x p Y X H y x p y p y x p x p Y X H y p y x p x p y x p y x p y x p y p x p y x p y x p Y X I
y x x y y x y x x y x y x y
= − + = − − = − − = − − = =