Distributional Semantics
LING 571 — Deep Processing Methods in NLP November 4, 2019 Shane Steinert-Threlkeld
1
Distributional Semantics LING 571 Deep Processing Methods in NLP - - PowerPoint PPT Presentation
Distributional Semantics LING 571 Deep Processing Methods in NLP November 4, 2019 Shane Steinert-Threlkeld 1 Walking the Walk Ski Chomp = Chomsky! 2 Punny Department 3 Recap: What is a word? Acoustically or orthographically
LING 571 — Deep Processing Methods in NLP November 4, 2019 Shane Steinert-Threlkeld
1
2
3
4
5
6
7
8
9
10
1 2 3 4 5 6
1 2 3 4 5 6
y-axis x-axis
b a
11
1 2 3 4 5 6
1 2 3 4 5 6
y-axis
b a
12
1 2 3 4 5 6
1 2 3 4 5 6
y-axis
Skyscraper Highway Bridge
13 xkcd.com/388
14 xkcd.com/388
WTF, Grapefruit?
15
As You Like It Twelfth Night Julius Caesar Henry V battle
1 1 8 15
soldier
2 2 12 36
fool
37 58 1 5
clown
5 117
16
As You Like It Twelfth Night Julius Caesar Henry V battle
1 1 8 15
soldier
2 2 12 36
fool
37 58 1 5
clown
5 117
Comedic Dramatic
17
5 10 15 20 25 30 5 10 Henry V [5,15] As You Like It [37,1] Julius Caesar [1,8]
Twelfth Night [58,1] 15 40 35 40 45 50 55 60
J&M 3rd ed, 6.3.1 [link]
18
19
20
21
22
23
24
25
26
27
28
29
30
aardvark … computer data pinch result sugar apricot
… 1 1
pineapple
… 1 1
digital
… 2 1 1
information
… 1 6 4
31
32
33
34
1 3 3 16 1 6 cell 2 1 3 1 30 2 11 2 8
s u b j
, a b s
b s u b j
, a d a p t s u b j
, b e h a v e p
j
, i n s i d e p
j
, i n t
m
, a b n
m a l i t y n m
, a r c h i t e c t u r e n m
, a n e m i a
j
, a t t a c k
j
, c a l l
j
, c
e f r
j
, d e c
a t e n m
, b a c t e r i a n m
, b
y n m
, b
e m a r r
… … … …
35
36
37
38
39
40
41
42
aardvark computer data pinch result sugar apricot
1 1
pineapple
1 1
digital
2 1 1
information
1 6 4
43
aardvark computer data pinch result sugar apricot
1 1
pineapple
1 1
digital
2 1 1
information
1 6 4
44
aardvark computer data pinch result sugar apricot
1 1
pineapple
1 1
digital
2 1 1
information
1 6 4
45
aardvark computer data pinch result sugar apricot
1 1
pineapple
1 1
digital
2 1 1
information
1 6 4
46
aardvark computer data pinch result sugar apricot
1 1
pineapple
1 1
digital
2 1 1
information
1 6 4
47
48
49
50
51
euclidean
52
53
prolonged, week-long
54
55
1 2 3 4 tezgüino 1 1 1 1 tequila 1 1 1 1 apricots 1 pizza 1 1
56
Context matrix for tezgüino with w=3
similarities
57
arts boil data function large sugar summarized water Apricot
1 1 1 1
Pineapple
1 1 1 1
Digital
1 1 1 1
Information
1 1 1 1
58
59
c1 c2 c3 c4 c5 m1 m2 m3 m4 human 1 1 interface 1 1 computer 1 1 user 1 1 1 system 1 1 2 response 1 1 time 1 1 EPS 1 1 survey 1 1 trees 1 1 1 graph 1 1 1 minors 1 1