[PPT] - Align, Disambiguate, and Walk A Unified Approach for Measuring PowerPoint Presentation

SLIDE 1

Align, Disambiguate, and Walk

A Unified Approach for Measuring Semantic Similarity

SLIDE 2

Semantic Similarity; how similar are a pair of lexical items?

SLIDE 3

Semantic Similarity

SLIDE 4

Semantic Similarity

SLIDE 5

Semantic Similarity

Sentence level

(Tsatsaronis et al., 2010)
(Kauchak and Barzilay, 2006)
(Surdeanu et al., 2011)
(Dagan et al., 2006)

SLIDE 6

Semantic Similarity

SLIDE 7

Semantic Similarity

Word level

(Biran et al., 2011)

Locuacious → Talkative

Lexical substitution

(McCarthy and Navigli, 2009)

SLIDE 8

Semantic Similarity

SLIDE 9

Semantic Similarity

Sense level 

SLIDE 10

Semantic Similarity

Sense level

(Snow et al., 2007)
(Neely et al., 1989)

SLIDE 11

Exisiting Similarity Measures

SLIDE 12

Exisiting Similarity Measures

Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 13

Exisiting Similarity Measures

Salton and McGill (1983) Gabrilovich and Markovitch (2007) Radinsky et al. (2011) Ramage et al. (2009) Yeh et al., (2009) Turney (2007) Landauer et al. (1998) Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 14

Exisiting Similarity Measures

Patwardan (2003) Banerjee and Pederson (2003) Hirst and St-Onge (1998) Lin (1998) Jiang and Conrath (1997) Resnik (1995) Sussna (1993, 1997) Wu and Palmer (1994) Leacock and Chodorow (1998) Salton and McGill (1983) Gabrilovich and Markovitch (2007) Radinsky et al. (2011) Ramage et al. (2009) Yeh et al., (2009) Turney (2007) Landauer et al. (1998) Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 15

Exisiting Similarity Measures

Patwardan (2003) Banerjee and Pederson (2003) Hirst and St-Onge (1998) Lin (1998) Jiang and Conrath (1997) Resnik (1995) Sussna (1993, 1997) Wu and Palmer (1994) Leacock and Chodorow (1998) Salton and McGill (1983) Gabrilovich and Markovitch (2007) Radinsky et al. (2011) Ramage et al. (2009) Yeh et al., (2009) Turney (2007) Landauer et al. (1998) Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 16

Exisiting Similarity Measures

Patwardan (2003) Banerjee and Pederson (2003) Hirst and St-Onge (1998) Lin (1998) Jiang and Conrath (1997) Resnik (1995) Sussna (1993, 1997) Wu and Palmer (1994) Leacock and Chodorow (1998) Salton and McGill (1983) Gabrilovich and Markovitch (2007) Radinsky et al. (2011) Ramage et al. (2009) Yeh et al., (2009) Turney (2007) Landauer et al. (1998) Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 17

Exisiting Similarity Measures

Patwardan (2003) Banerjee and Pederson (2003) Hirst and St-Onge (1998) Lin (1998) Jiang and Conrath (1997) Resnik (1995) Sussna (1993, 1997) Wu and Palmer (1994) Leacock and Chodorow (1998) Salton and McGill (1983) Gabrilovich and Markovitch (2007) Radinsky et al. (2011) Ramage et al. (2009) Yeh et al., (2009) Turney (2007) Landauer et al. (1998) Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 18

Exisiting Similarity Measures

Patwardan (2003) Banerjee and Pederson (2003) Hirst and St-Onge (1998) Lin (1998) Jiang and Conrath (1997) Resnik (1995) Sussna (1993, 1997) Wu and Palmer (1994) Leacock and Chodorow (1998) Salton and McGill (1983) Gabrilovich and Markovitch (2007) Radinsky et al. (2011) Ramage et al. (2009) Yeh et al., (2009) Turney (2007) Landauer et al. (1998) Allison and Dix (1986) Gusfield (1997) Wise (1996) Keselj et al. (2003)

SLIDE 19

Contribution

SLIDE 20

Contribution

SLIDE 21

Contribution

SLIDE 22

Contribution

SLIDE 23

Advantage 1

Unified representation

SLIDE 24

Advantage 2

Cross-level semantic similarity

𝑤𝑡. 𝑤𝑡.

SLIDE 25

Advantage 3

Sense-level operation

SLIDE 26

Outline

SLIDE 27

How Does it work?

SLIDE 28

How Does it work?

SLIDE 29

How Does it work?

SLIDE 30

Semantic Signature

SLIDE 31

Semantic Signature

SLIDE 32

Semantic Signature

SLIDE 33

Semantic Signature

SLIDE 34

Semantic Signature

SLIDE 35

Semantic Signature

SLIDE 36

Semantic Signature

SLIDE 37

Semantic Signature

SLIDE 38

Semantic Signature

SLIDE 39

Semantic Signature

SLIDE 40

Semantic Signature

SLIDE 41

Semantic Signature

ver all synsets in WordNet

. . .

SLIDE 42

Semantic Signature

ver all synsets in WordNet

. . .

SLIDE 43

Semantic Signature

ver all synsets in WordNet

. . .

SLIDE 44

Semantic Signature

SLIDE 45

Semantic Signature

SLIDE 46

Semantic Signature

SLIDE 47

Semantic Signature

SLIDE 48

Semantic Signature

SLIDE 49

Semantic Signature

SLIDE 50

Personalized PageRank

SLIDE 51

Personalized PageRank

SLIDE 52

Personalized PageRank

SLIDE 53

Personalized PageRank

n

woman 1

n

food 1

v

fry 2

SLIDE 54

Personalized PageRank

n

woman 1

n

food 1

v

fry 2

SLIDE 55

Personalized PageRank

v

fry 2

n

woman 1

n

food 1

v

fry 2

SLIDE 56

Personalized PageRank

v

fry 2

n

food 1

n

woman 1

n

food 1

v

fry 2

SLIDE 57

Personalized PageRank

v

fry 2

n

food 1

n

woman 1

n

woman 1

n

food 1

v

fry 2

SLIDE 58

Personalized PageRank

v

fry 2

n

food 1

n

woman 1

n

woman 1

n

food 1

v

fry 2

SLIDE 59

v

fry

2

n

food

1

n

cooking 1

v

cook 3

n

fat 1

n

french_fries 1

n

dish 2

n

nutriment 1

n

food

2

n

beverage

1

SLIDE 60

v

fry

2

n

food

1

n

cooking 1

v

cook 3

n

fat 1

n

french_fries 1

n

dish 2

n

nutriment 1

n

food

2

n

beverage

1

SLIDE 61

Comparing Semantic Signatures

SLIDE 62

Comparing Semantic Signatures

– Cosine
– Weighted Overlap

– Top-k Jaccard

SLIDE 63

Comparing Semantic Signatures

Weighted Overlap

SLIDE 64

Comparing Semantic Signatures

Weighted Overlap

SLIDE 65

Comparing Semantic Signatures

Weighted Overlap

SLIDE 66

Comparing Semantic Signatures

Weighted Overlap

SLIDE 67

Comparing Semantic Signatures

Weighted Overlap

SLIDE 68

Comparing Semantic Signatures

T

p-𝑙 Jaccard

SLIDE 69

Comparing Semantic Signatures

T

p-𝑙 Jaccard

𝑙 = 4

SLIDE 70

Comparing Semantic Signatures

T

p-𝑙 Jaccard

𝑙 = 4

SLIDE 71

Comparing Semantic Signatures

T

p-𝑙 Jaccard

𝑙 = 4

SLIDE 72

Alignment-based disambiguation

SLIDE 73

Alignment-based disambiguation

SLIDE 74

Why is disambiguation needed?

SLIDE 75

Why is disambiguation needed?

SLIDE 76

Why is disambiguation needed?

SLIDE 77

Why is disambiguation needed?

SLIDE 78

Why is disambiguation needed?

SLIDE 79

Alignment-based disambiguation

SLIDE 80

Alignment-based disambiguation

SLIDE 81

Alignment-based disambiguation

SLIDE 82

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

fire worker fire fire worker

. . .

v

1

v

2

v

3

n

1

n

2

. . .

n

Alignment-based disambiguation

SLIDE 83

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

fire worker fire fire worker

. . .

v

1

v

2

v

3

n

1

n

2

. . .

n

Alignment-based disambiguation

SLIDE 84

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

n

Alignment-based disambiguation

SLIDE 85

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

n

Alignment-based disambiguation

SLIDE 86

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

n

Alignment-based disambiguation

0.5

SLIDE 87

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

n

Alignment-based disambiguation

0.5 Tversky (1977) Markman and Gentner (1993)

SLIDE 88

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

n

Alignment-based disambiguation

0.5 0.3 Tversky (1977) Markman and Gentner (1993)

SLIDE 89

terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

Alignment-based disambiguation

0.3

n

manager 2

n

manager

n

1

0.5 0.3

SLIDE 90

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

fire worker fire fire worker

. . .

v

1

v

2

v

3

n

1

n

2

. . .

n

fire v

4

Alignment-based disambiguation

SLIDE 91

manager terminate work boss boss work

. . .

terminate terminate terminate employee work

. . .

n

manager 2

n

1

n

1

v

1

v

2

v

3

v

4

n

2

n

1

n

3

n

1

n

2

fire worker fire fire worker

. . .

v

1

v

2

v

3

n

1

n

2

. . .

n

fire v

4

Alignment-based disambiguation

SLIDE 92

Outline

SLIDE 93

Experiments

– Semantic Textual Similarity (SemEval-2012)

SLIDE 94

Experiments

– Semantic Textual Similarity (SemEval-2012)
– Synonymy recognition (TOEFL dataset)

– Correlation-based (RG-65 dataset)

SLIDE 95

Experiments

– Semantic Textual Similarity (SemEval-2012)
– Synonymy recognition (TOEFL dataset)

– Correlation-based (RG-65 dataset)

– Coarsening WordNet sense inventory

SLIDE 96

Experiment 1

Similarity at Sentence level

– 5 datasets

– Three evaluation measures

ALL, ALLnrm, and Mean

SLIDE 97

Experiment 1

Similarity at Sentence level

– 5 datasets

– Three evaluation measures

ALL, ALLnrm, and Mean
– UKP2 (Bär et al., 2012)

– TLSim and TLSyn (Šarić et al., 2012)

SLIDE 98

Experiment 1

Similarity at Sentence level

Features

– Main features

Cosine
Weighted Overlap
Top-k Jaccard

SLIDE 99

Experiment 1

Similarity at Sentence level

Features

– Main features

Cosine
Weighted Overlap
Top-k Jaccard

– String-based features

Longest common substring
Longest common subsequence
Greedy string tiling
Character/word n-grams

SLIDE 100

STS Results

Experiment 1

Similarity at Sentence level

ADW ADW ADW UKP2 UKP2 UKP2 TLsyn TLsyn TLsyn TLsim TLsim TLsim

SLIDE 101

STS Results

Experiment 1

Similarity at Sentence level

ADW ADW ADW UKP2 UKP2 UKP2 TLsyn TLsyn TLsyn TLsim TLsim TLsim

SLIDE 102

Experiments

– Semantic Textual Similarity (Semeval-12)
– Synonymy recognition (TOEFL dataset)

– Correlation-based (RG-65 dataset)

– Coarsening WordNet sense inventory

SLIDE 103

Experiment 2

Similarity at Word Level

SLIDE 104

Experiment 2

Similarity at Word Level

SLIDE 105

Experiment 2

Similarity at Word Level

SLIDE 106

Similarity at Word Level

Synonym Recognition

TOEFL dataset (Landauer and Dumais, 1997)

– 80 multiple choice questions – Human test takers: 64.5% only

SLIDE 107

Similarity at Word Level

Synonym Recognition

Accuracy on TOEFL dataset

75 80 85 90 95 100

SLIDE 108

Similarity at Word Level

Judgment Correlation

Dataset: RG-65 (Rubenstein and Goodenough, 1965)

– 65 word pairs

judged by 51 human subjects

– Scale of 0 → 4

SLIDE 109

Similarity at Word Level

Judgment Correlation

Spearman correlation, RG-65 dataset

SLIDE 110

Similarity at Word Level

Judgment Correlation

Spearman correlation, RG-65 dataset

SLIDE 111

Experiments

– Semantic Textual Similarity (Semeval-12)
– Synonymy recognition (TOEFL dataset)

– Correlation-based (RG-65 dataset)

– Coarsening WordNet sense inventory

SLIDE 112

Experiment 3

Similarity at Sense Level

Coarse-graining WordNet

SLIDE 113

Experiment 3

Similarity at Sense Level

Coarse-graining WordNet

Navigli (2006)

SLIDE 114

Experiment 3

Similarity at Sense Level

Coarse-graining WordNet

Snow et al. (2007) Navigli (2006)

SLIDE 115

Experiment 3

Similarity at Sense Level

Binary classification: Merged or not-merged

SLIDE 116

Experiment 3

Similarity at Sense Level

Binary classification: Merged or not-merged

(Kilgarriff, 2001) (Hovy et al., 2006)

about 3500 sense pairs (Noun) about 5000 sense pairs (Verb) about 16000 sense pairs (Noun) about 31000 sense pairs (Verb)

SLIDE 117

Experiment 3

Similarity at Sense Level

Binary classification: Merge or not-merged

if similarity ≥ 𝑢 𝑝𝑢ℎ𝑓𝑠𝑥𝑗𝑡𝑓

SLIDE 118

Experiment 3

Similarity at Sense Level

Binary classification: Merge or not-merged

if similarity ≥ 𝑢 𝑝𝑢ℎ𝑓𝑠𝑥𝑗𝑡𝑓

SLIDE 119

Experiment 3

Similarity at Sense Level

F-score on OntoNotes dataset

0.41 0.42 0.42 0.37 0.22

Noun

0.52 0.54 0.53 0.45 0.37

Verb

SLIDE 120

Experiment 3

Similarity at Sense Level

F-score on Senseval-2 dataset

0.44 0.47 0.47 0.42 0.37

Noun

0.49 0.5 0.49 0.43 0.29

Verb

SLIDE 121

Conclusions

A unified approach for computing semantic

similarity for any pair of lexical items

Experiments with SOA performance

– Sense level (sense coarsening) – Word level (synonymy recognition and judgment) – Sentence level (Semantic Textual Similarity)

SLIDE 122

Future Direction

Larger sense inventories (e.g., BabelNet)

SLIDE 123

Future Direction

Larger sense inventories (e.g., BabelNet)
Cross-level semantic similarity

SLIDE 124

Future Direction

Larger sense inventories (e.g., BabelNet)
Cross-level semantic similarity

Create datasets for cross-level similarity

– Future Semeval task?

SLIDE 125

n

listening 1

n

thank_you 1

n

gratitude 1

n

thanks 1

n

expression 3

v

heed 1

n

sensing 2

v

listen 2

n

auscultation 1

n

ear 2

n

hearing 6

j

audio-lingual 1

v

listen 1

SLIDE 126

SLIDE 127

STS-13

System HDL OnWN FNWN SMT mean Rank Dkpro 0.735 0.735 0.341 0.323 0.565 6 TakeLab 0.486 0.633 0.269 0.279 0.434 58 ADW (STS-13) 0.621 0.511 0.446 0.384 0.502 34 ADW (All) GP 0.717 0.697 0.411 0.272 0.538 20 ADW (All) LR 0.667 0.735 0.409 0.374 0.565 6