A graphical view of distance between rankings: the Point and Area - - PowerPoint PPT Presentation

a graphical view of distance between rankings the point
SMART_READER_LITE
LIVE PREVIEW

A graphical view of distance between rankings: the Point and Area - - PowerPoint PPT Presentation

A graphical view of distance between rankings: the Point and Area measures Giorgio Maria Di Nunzio and Gianmaria Silvello Dept. of Information Engineering University of Padua IIR 2015 - 6th Italian Information Retrieval Workshop May 25 th ,


slide-1
SLIDE 1

IIR 2015 - 6th Italian Information Retrieval Workshop May 25th, 2015, Cagliari, Italy

A graphical view of distance between rankings: the Point and Area measures

Giorgio Maria Di Nunzio and Gianmaria Silvello

  • Dept. of Information Engineering

University of Padua

slide-2
SLIDE 2

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Outline

  • Classification of rank similarity measures
  • Spearman foot-rule and Kendall distance
  • Point and Area measures
  • Measures of effectiveness
  • Conclusions
slide-3
SLIDE 3

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Comparing ranked list

  • Search engines effectiveness can be measured by

analyzing their visible outcomes

  • lists of documents ranked in descending order of relevance to a

given topic

slide-4
SLIDE 4

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Comparing ranked list

  • Correlation among rankings can be used to assess

the search engines effectiveness

D1 D4 D3 D2 D2 D1 D3 D4 D1 D2 D3 D4 Ideal list List 1 List 2

slide-5
SLIDE 5

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Classification of rank similarity measures

  • Weighted / non-weighted
  • Exchanges in the ordering at the top of the ranking are

more significant than those at the bottom

  • Any perturbation has the same importance
  • Conjoint / non-conjoint
  • Two rankings have the same elements
  • Some elements in one list do not appear in the other
slide-6
SLIDE 6

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Find elements in a ranked list

  • Map index of a document from one list to the other
  • Given k-th element in list r_α, return index of that

element in list r_β

Fα,β(k) = idxβ(rα(k))

D1 D4 D3 D2 D1 D4 D3 D2 k = 2

Fα,β(2) = 4

rα rβ

slide-7
SLIDE 7

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Spearman foot-rule

  • Compute the total element-wise misplacements

between two ranked lists

  • Non-weighted, conjoint

Sα,β(i) =

i

X

k=1

|Fα,β(k) − k|

D1 D4 D3 D2 D1 D4 D3 D2

rα rβ Sα,β(4) = 0 + 2 + 0 + 2

slide-8
SLIDE 8

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Kendall distance

  • Number of adjacent swaps that are necessary to

reorder one list as the other

  • Non-weighted, conjoint

D1 D4 D3 D2 D1 D4 D3 D2

rα rβ Kα,β = X

(i,j):i<j

e Ki,j(rα, rβ) Kα,β = 0 + 2 + 1

slide-9
SLIDE 9

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Spearman and Kendall

Sα,β(i) =

i

X

k=1

|Fα,β(k) − k| Kα,β(i) =

i

X

k=1

(Fα,β(k) − k) + (rα[1 : k] ∩ rβ[(Fα,β(k) + 1) : n]) | {z }

X

slide-10
SLIDE 10

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Point-wise distance

  • Spearman without absolute value (!)
  • Non-weighted, conjoint

D1 D4 D3 D2 D1 D4 D3 D2

rα rβ Pα,β(i) =

i

X

k=1

(Fα,β(k) − k) Pα,β(4) = 0 + 2 + 0 − 2

slide-11
SLIDE 11

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Spearman, Kendall, Point-wise

Sα,β(i) =

i

X

k=1

|Fα,β(k) − k| Kα,β(i) =

i

X

k=1

(Fα,β(k) − k) + (rα[1 : k] ∩ rβ[(Fα,β(k) + 1) : n]) | {z }

X

Pα,β(i) =

i

X

k=1

(Fα,β(k) − k)

slide-12
SLIDE 12

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Visualization analysis

slide-13
SLIDE 13

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Visualization analysis

slide-14
SLIDE 14

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Area-wise distance

  • The area-wise measure considers the area formed by the

segments between two adjacent points (point distance) and the x-axis.

  • h is the height of each trapezoid. It can be tuned to weight

misplacements that occur in different part of the ranking list.

Aα,β(i) =

i

X

k=1

  • Pα,β(k − 1) + Pα,β(k)
  • 2

h

slide-15
SLIDE 15

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Area-wise normalization

  • Divide the area of a relevance list at rank i by the largest
  • btainable area given by the worst possible ranking.
  • It is in the [0,1] range, where 0 indicates the ideal case

and 1 the worst case

nAα,β = Aα,β A∗

α,β

slide-16
SLIDE 16

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Area Correlation (A-corr)

  • A-corr is an indicator of the correlation between two

ranked lists

  • It is in the range [0,1], where 0 indicates that two

lists are not correlated and 1 that they are the same

A-corrα,β = 1 − nAα,β

slide-17
SLIDE 17

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

A-corr as an effectiveness measure (quantitative)

  • We can calculate the point-wise measure by

considering the relevance of documents

PR HR PR NR HR PR PR HR

rα rβ

NR HR NR NR NR NR

slide-18
SLIDE 18

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

A-corr on TREC test collections

MAP A−corr Kendall −0.2 0.2 0.4 0.6 0.8 1

TREC 8

MAP A−corr Kendall −0.2 0.2 0.4 0.6 0.8 1

TREC 10

MAP A−corr Kendall −0.2 0.2 0.4 0.6 0.8 1

TREC 14

slide-19
SLIDE 19

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Point-wise as an effectiveness measure (qualitative)

20 40 60 80 100 120 140 160 180 200 −2500 −2000 −1500 −1000 −500

TREC2001, topic 510 − Qualitative comparison

Rank Spearman − fdut10wac01 Point−wise − fdut10wac01 Kendall − fdut10wac01 Spearman − uwmtaw1 Point−wise − uwmtaw1 Kendall − uwmtaw1

slide-20
SLIDE 20

A graphical view of distance between rankings: the Point and Area measure

  • G. M. Di Nunzio and G. Silvello

Conclusions

  • A correlation and an effectiveness measure for

qualitative and quantitative evaluation

  • We plan to:
  • compare A-corr with the Twist measure (Cumulative

Relative Position)*

  • analyze its stability, sensitivity and correlation with other

measures

  • define a weighted measure to model user behavior

* N. Ferro, G. Silvello, H. Keskustalo, A. Pirkola and K. Jӓrvelin (2015), 


The Twist Measure for IR Evaluation: Taking User’s Effort into Account
 Journal of the Association for Information Science and Technology (JASIST) in print (http://onlinelibrary.wiley.com/doi/10.1002/asi.23416).