Predicting Semantic Relations using Global Graph Properties
Yuval Pinter and Jacob Eisenstein @yuvalpi
@jacobeisenstein
code: github.com/yuvalpinter/m3gm contact: uvp@gatech.edu
Predicting Semantic Relations using Global Graph Properties Yuval - - PowerPoint PPT Presentation
Predicting Semantic Relations using Global Graph Properties Yuval Pinter and Jacob Eisenstein @yuvalpi @jacobeisenstein code: github.com/yuvalpinter/m3gm contact: uvp@gatech.edu Semantic Graphs WordNet -like resources are curated to
Yuval Pinter and Jacob Eisenstein @yuvalpi
@jacobeisenstein
code: github.com/yuvalpinter/m3gm contact: uvp@gatech.edu
describe relations between word senses
○ Edges have form <S, r, T>: <zebra, is-a, equine> ○ Still, some relations are symmetric
○ Hypernym (is-a) <zebra, r, equine> ○ Meronym (is-part-of) <tree, r, forest> ○ Is-instance-of <rome, r, capital> ○ Derivational Relatedness <nice, r, nicely>
3 mammal canine equine horse zebra wolf fenec
scoring edges
4
equine zebra hypernym
scoring edges
5
equine zebra hypernym Translational Embeddings (transE) [Bordes et al. 2013]
scoring edges
6
equine zebra hypernym Full-Bilinear (Bilin) [Nickel et al. 2011]
scoring edges
graphs
7 mammal equine horse zebra canine equine zebra
○ Only takes care of impossible graphs ○ Requires domain knowledge
8
○ Only takes care of impossible graphs ○ Requires domain knowledge
9
graphs that have a fixed number n of nodes
Weights vector Graph features
10
graphs that have a fixed number n of nodes
Weights vector Graph features
11
12
1 4 2 5 6 3
13
1 4 2 5 6 3
14
1 4 2 5 6 3
15
1 4 2 5 6 3
16
1 4 2 5 6 3
17
1 4 2 5 6 3 (some) joint blue/orange motifs:
18
1 4 2 5 6 3 (some) joint blue/orange motifs:
19
(some) joint blue/orange motifs:
1 4 2 5 6 3
20
1 4 2 5 6 3 (some) joint blue/orange motifs:
is implausible
○ Number of possible directed graphs with n nodes: O(exp(n2)) ○ n nodes, R relations: O(exp(R*n2)) ○ Estimation begins to be hard at ~n=100 for R=1. In WordNet: n = 40K, R = 11.
21
is implausible
○ Number of possible directed graphs with n nodes: O(exp(n2)) ○ n nodes, R relations: O(exp(R*n2)) ○ Estimation begins to be hard at ~n=100 for R=1. In WordNet: n = 40K, R = 11.
algorithm either
22
is implausible
○ Number of possible directed graphs with n nodes: O(exp(n2)) ○ n nodes, R relations: O(exp(R*n2)) ○ Estimation begins to be hard at ~n=100 for R=1. In WordNet: n = 40K, R = 11.
algorithm either What can we do?
23
neighborhood” of the true WN
24
neighborhood” of the true WN
25
neighborhood” of the true WN
26
neighborhood” of the true WN
27
neighborhood” of the true WN
28
proposal distribution (source of the negative samples)
s v t v v v
29
proposal distribution (source of the negative samples)
s v t v v v
30
○ No reciprocal relations (hypernym ⇔ hyponym) ○ Still includes symmetric relations
○ Used in all models as default for symmetric relations
○ Synset embeddings - averaged from FastText
○ ~ 3000 motifs, ~900 non-zero
31
○ No reciprocal relations (hypernym ⇔ hyponym) ○ Still includes symmetric relations
○ Used in all models as default for symmetric relations
○ Synset embeddings - averaged from FastText
○ ~ 3000 motifs, ~900 non-zero
32
transE DistMult Bilin
○ No reciprocal relations (hypernym ⇔ hyponym) ○ Still includes symmetric relations
○ Used in all models as default for symmetric relations
○ Synset embeddings - averaged from FastText
○ ~ 3000 motifs, ~900 non-zero
33
transE
34 [Dettmers et al. 2018] [Nguyen et al. 2018] [Bordes et al. 2013] [Trouillon et al. 2016]
○ Targets of has_part ○ Two-paths hypernym → derivationally_related_form
○ Targets of hypernym ○ Two-cycles of hypernym ○ Target of both has_part and verb_group
35
○ Targets of has_part ○ Two-paths hypernym → derivationally_related_form
○ Targets of hypernym ○ Two-cycles of hypernym ○ Target of both has_part and verb_group
vienna france austria european union germany ...
Seen in training data Local-only prediction M3GM prediction Unseen in data
36
○ Targets of has_part ○ Two-paths hypernym → derivationally_related_form
○ Targets of hypernym ○ Two-cycles of hypernym ○ Target of both has_part and verb_group
indian lettuce lettuce herb garden lettuce ... ... ... ...
Seen in training data Local-only prediction M3GM prediction
37
○ Targets of has_part ○ Two-paths hypernym → derivationally_related_form
○ Targets of hypernym ○ Two-cycles of hypernym ○ Target of both has_part and verb_group
38 mammal equine
○ Targets of has_part ○ Two-paths hypernym → derivationally_related_form
○ Targets of hypernym ○ Two-cycles of hypernym ○ Target of both has_part and verb_group
Hypernym
39
“Derivations occur in the abstract parts of the graph”
(bodega / canteen vs. shop)
○ Targets of has_part ○ Two-paths hypernym → derivationally_related_form
○ Targets of hypernym ○ Two-cycles of hypernym ○ Target of both has_part and verb_group Nouns Verbs
40
קנוי יבלכיסוס סוסהרבזבאזקנפ
41 mammal canine equine horse zebra wolf fenec
קנוי יבלכיסוס סוסהרבזבאזקנפ
42 mammal canine equine horse zebra wolf fenec
קנוי יבלכיסוס סוסהרבזבאזקנפ
43 mammal canine equine horse zebra wolf fenec
44
@Georgia Tech
45
code + bonus WordNet analysis tools: github.com/yuvalpinter/m3gm contact: uvp@gatech.edu
@Georgia Tech
Fellowship Program
46
code + bonus WordNet analysis tools: github.com/yuvalpinter/m3gm contact: uvp@gatech.edu
@Georgia Tech
Fellowship Program
47
code + bonus WordNet analysis tools: github.com/yuvalpinter/m3gm contact: uvp@gatech.edu