A graphical view of distance between rankings: the Point and Area - PowerPoint PPT Presentation

A graphical view of distance between rankings: the Point and Area measures Giorgio Maria Di Nunzio and Gianmaria Silvello Dept. of Information Engineering University of Padua IIR 2015 - 6th Italian Information Retrieval Workshop May 25 th , 2015, Cagliari, Italy

Outline • Classification of rank similarity measures • Spearman foot-rule and Kendall distance • Point and Area measures • Measures of effectiveness • Conclusions G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Comparing ranked list • Search engines effectiveness can be measured by analyzing their visible outcomes • lists of documents ranked in descending order of relevance to a given topic G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Comparing ranked list • Correlation among rankings can be used to assess the search engines effectiveness Ideal list List 1 List 2 D1 D1 D2 D4 D2 D1 D3 D3 D3 D4 D2 D4 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Classification of rank similarity measures • Weighted / non-weighted • Exchanges in the ordering at the top of the ranking are more significant than those at the bottom • Any perturbation has the same importance • Conjoint / non-conjoint • Two rankings have the same elements • Some elements in one list do not appear in the other G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Find elements in a ranked list • Map index of a document from one list to the other • Given k-th element in list r_ α , return index of that element in list r_ β F α , β ( k ) = idx β ( r α ( k )) r α r β D1 D1 F α , β (2) = 4 k = 2 D2 D4 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Spearman foot-rule • Compute the total element-wise misplacements between two ranked lists • Non-weighted, conjoint i X S α , β ( i ) = |F α , β ( k ) − k | k =1 r α r β D1 D1 D2 D4 S α , β (4) = 0 + 2 + 0 + 2 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Kendall distance • Number of adjacent swaps that are necessary to reorder one list as the other • Non-weighted, conjoint X e K α , β = K i,j ( r α , r β ) ( i,j ): i<j r α r β D1 D1 D2 D4 K α , β = 0 + 2 + 1 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Spearman and Kendall i X S α , β ( i ) = |F α , β ( k ) − k | k =1 i X K α , β ( i ) = ( F α , β ( k ) − k ) + ( r α [1 : k ] ∩ r β [( F α , β ( k ) + 1) : n ]) | {z } k =1 X G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Point-wise distance • Spearman without absolute value (!) • Non-weighted, conjoint i X P α , β ( i ) = ( F α , β ( k ) − k ) k =1 r α r β D1 D1 D2 D4 P α , β (4) = 0 + 2 + 0 − 2 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Spearman, Kendall, Point-wise i X S α , β ( i ) = |F α , β ( k ) − k | k =1 i X K α , β ( i ) = ( F α , β ( k ) − k ) + ( r α [1 : k ] ∩ r β [( F α , β ( k ) + 1) : n ]) | {z } k =1 X i X P α , β ( i ) = ( F α , β ( k ) − k ) k =1 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Visualization analysis G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Area-wise distance • The area-wise measure considers the area formed by the segments between two adjacent points (point distance) and the x-axis. • h is the height of each trapezoid. It can be tuned to weight misplacements that occur in different part of the ranking list. i � � P α , β ( k − 1) + P α , β ( k ) X A α , β ( i ) = h 2 k =1 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Area-wise normalization • Divide the area of a relevance list at rank i by the largest obtainable area given by the worst possible ranking. • It is in the [0,1] range, where 0 indicates the ideal case and 1 the worst case nA α , β = A α , β A ∗ α , β G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Area Correlation (A-corr) • A-corr is an indicator of the correlation between two ranked lists • It is in the range [0,1], where 0 indicates that two lists are not correlated and 1 that they are the same A-corr α , β = 1 − nA α , β G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

A-corr as an effectiveness measure (quantitative) • We can calculate the point-wise measure by considering the relevance of documents r α r β HR PR HR HR PR PR PR NR NR NR NR HR NR NR G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

A-corr on TREC test collections TREC 8 TREC 10 TREC 14 1 1 1 0.8 0.8 0.8 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 0 0 0 − 0.2 − 0.2 − 0.2 MAP A − corr Kendall MAP A − corr Kendall MAP A − corr Kendall G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Point-wise as an effectiveness measure (qualitative) TREC2001, topic 510 − Qualitative comparison 0 − 500 − 1000 − 1500 Spearman − fdut10wac01 Point − wise − fdut10wac01 − 2000 Kendall − fdut10wac01 Spearman − uwmtaw1 Point − wise − uwmtaw1 Kendall − uwmtaw1 − 2500 0 20 40 60 80 100 120 140 160 180 200 Rank G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Conclusions • A correlation and an effectiveness measure for qualitative and quantitative evaluation • We plan to: • compare A-corr with the Twist measure (Cumulative Relative Position)* • analyze its stability, sensitivity and correlation with other measures • define a weighted measure to model user behavior * N. Ferro, G. Silvello, H. Keskustalo, A. Pirkola and K. J ӓ rvelin (2015),   The Twist Measure for IR Evaluation: Taking User’s Effort into Account   Journal of the Association for Information Science and Technology (JASIST) in print ( http://onlinelibrary.wiley.com/doi/10.1002/asi.23416) . G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

A graphical view of distance between rankings: the Point and Area - PowerPoint PPT Presentation

A graphical view of distance between rankings: the Point and Area measures Giorgio Maria Di Nunzio and Gianmaria Silvello Dept. of Information Engineering University of Padua IIR 2015 - 6th Italian Information Retrieval Workshop May 25 th ,

HOW HEALTHY IS OUR COUNTY? 2013 COUNTY HEALTH RANKINGS & ROADMAPS OUTLINE Rankings Background

ROUNDERS (1998) CASINO ROYALE (2006) HAND RANKINGS HIGH CARD HAND RANKINGS PAIR HIGH CARD

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Wheatley and National Rankings May 2013 1 Recent National Rankings Several national High

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Generalized Distances Between Rankings Ravi Kumar Sergei Vassilvitskii Yahoo! Research

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

BUILDING RESIDENT ENGAGEMENT IN THE DELTA REGION A County Health Rankings & Roadmaps Special

VIEW SYSTEM Roberto Beraldi View Users interact with an application mostly through a

Notes about correlation (for Asgn 2) Sharon Goldwater Sharon Goldwater Correlation Overview of

Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and

Introduction CSCE CSCE If no label information is available, can still perform 478/878 478/878

Data Mining Techniques: Cluster Analysis Mirek Riedewald Many slides based on presentations by

A Similarity Measure for the ALN Description Logic Nicola Fanizzi, Claudia dAmato Dipartimento

Generalized similarity measures for text data. Hubert Wagner (IST Austria) Joint work with

Passage Based Retrieval (COSC 488) Nazli Goharian nazli@cs.georgetown.edu 1 Passage Based

Measuring distance/ similarity of data objects Multiple data types Records of users

Sambuz

Useful Links

Newsletter

Mail Us

A graphical view of distance between rankings: the Point and Area - PowerPoint PPT Presentation

A graphical view of distance between rankings: the Point and Area measures Giorgio Maria Di Nunzio and Gianmaria Silvello Dept. of Information Engineering University of Padua IIR 2015 - 6th Italian Information Retrieval Workshop May 25 th ,

HOW HEALTHY IS OUR COUNTY? 2013 COUNTY HEALTH RANKINGS &amp; ROADMAPS OUTLINE Rankings Background

ROUNDERS (1998) CASINO ROYALE (2006) HAND RANKINGS HIGH CARD HAND RANKINGS PAIR HIGH CARD

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Wheatley and National Rankings May 2013 1 Recent National Rankings Several national High

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Generalized Distances Between Rankings Ravi Kumar Sergei Vassilvitskii Yahoo! Research

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical &gt; Tangible? What are their limitations? 93 94 Graphical &gt; Tangible? Graphical

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

BUILDING RESIDENT ENGAGEMENT IN THE DELTA REGION A County Health Rankings &amp; Roadmaps Special

VIEW SYSTEM Roberto Beraldi View Users interact with an application mostly through a

Notes about correlation (for Asgn 2) Sharon Goldwater Sharon Goldwater Correlation Overview of

Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and

Introduction CSCE CSCE If no label information is available, can still perform 478/878 478/878

Data Mining Techniques: Cluster Analysis Mirek Riedewald Many slides based on presentations by

A Similarity Measure for the ALN Description Logic Nicola Fanizzi, Claudia dAmato Dipartimento

Generalized similarity measures for text data. Hubert Wagner (IST Austria) Joint work with

Passage Based Retrieval (COSC 488) Nazli Goharian nazli@cs.georgetown.edu 1 Passage Based

Measuring distance/ similarity of data objects Multiple data types Records of users

Sambuz

Useful Links

Newsletter

Mail Us

HOW HEALTHY IS OUR COUNTY? 2013 COUNTY HEALTH RANKINGS & ROADMAPS OUTLINE Rankings Background

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

BUILDING RESIDENT ENGAGEMENT IN THE DELTA REGION A County Health Rankings & Roadmaps Special