a graphical view of distance between rankings the point
play

A graphical view of distance between rankings: the Point and Area - PowerPoint PPT Presentation

A graphical view of distance between rankings: the Point and Area measures Giorgio Maria Di Nunzio and Gianmaria Silvello Dept. of Information Engineering University of Padua IIR 2015 - 6th Italian Information Retrieval Workshop May 25 th ,


  1. A graphical view of distance between rankings: the Point and Area measures Giorgio Maria Di Nunzio and Gianmaria Silvello Dept. of Information Engineering University of Padua IIR 2015 - 6th Italian Information Retrieval Workshop May 25 th , 2015, Cagliari, Italy

  2. Outline • Classification of rank similarity measures • Spearman foot-rule and Kendall distance • Point and Area measures • Measures of effectiveness • Conclusions G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  3. Comparing ranked list • Search engines effectiveness can be measured by analyzing their visible outcomes • lists of documents ranked in descending order of relevance to a given topic G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  4. Comparing ranked list • Correlation among rankings can be used to assess the search engines effectiveness Ideal list List 1 List 2 D1 D1 D2 D4 D2 D1 D3 D3 D3 D4 D2 D4 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  5. Classification of rank similarity measures • Weighted / non-weighted • Exchanges in the ordering at the top of the ranking are more significant than those at the bottom • Any perturbation has the same importance • Conjoint / non-conjoint • Two rankings have the same elements • Some elements in one list do not appear in the other G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  6. Find elements in a ranked list • Map index of a document from one list to the other • Given k-th element in list r_ α , return index of that element in list r_ β F α , β ( k ) = idx β ( r α ( k )) r α r β D1 D1 F α , β (2) = 4 k = 2 D2 D4 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  7. Spearman foot-rule • Compute the total element-wise misplacements between two ranked lists • Non-weighted, conjoint i X S α , β ( i ) = |F α , β ( k ) − k | k =1 r α r β D1 D1 D2 D4 S α , β (4) = 0 + 2 + 0 + 2 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  8. Kendall distance • Number of adjacent swaps that are necessary to reorder one list as the other • Non-weighted, conjoint X e K α , β = K i,j ( r α , r β ) ( i,j ): i<j r α r β D1 D1 D2 D4 K α , β = 0 + 2 + 1 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  9. Spearman and Kendall i X S α , β ( i ) = |F α , β ( k ) − k | k =1 i X K α , β ( i ) = ( F α , β ( k ) − k ) + ( r α [1 : k ] ∩ r β [( F α , β ( k ) + 1) : n ]) | {z } k =1 X G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  10. Point-wise distance • Spearman without absolute value (!) • Non-weighted, conjoint i X P α , β ( i ) = ( F α , β ( k ) − k ) k =1 r α r β D1 D1 D2 D4 P α , β (4) = 0 + 2 + 0 − 2 D3 D3 D4 D2 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  11. Spearman, Kendall, Point-wise i X S α , β ( i ) = |F α , β ( k ) − k | k =1 i X K α , β ( i ) = ( F α , β ( k ) − k ) + ( r α [1 : k ] ∩ r β [( F α , β ( k ) + 1) : n ]) | {z } k =1 X i X P α , β ( i ) = ( F α , β ( k ) − k ) k =1 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  12. Visualization analysis G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  13. Visualization analysis G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  14. Area-wise distance • The area-wise measure considers the area formed by the segments between two adjacent points (point distance) and the x-axis. • h is the height of each trapezoid. It can be tuned to weight misplacements that occur in different part of the ranking list. i � � P α , β ( k − 1) + P α , β ( k ) X A α , β ( i ) = h 2 k =1 G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  15. Area-wise normalization • Divide the area of a relevance list at rank i by the largest obtainable area given by the worst possible ranking. • It is in the [0,1] range, where 0 indicates the ideal case and 1 the worst case nA α , β = A α , β A ∗ α , β G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  16. Area Correlation (A-corr) • A-corr is an indicator of the correlation between two ranked lists • It is in the range [0,1], where 0 indicates that two lists are not correlated and 1 that they are the same A-corr α , β = 1 − nA α , β G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  17. A-corr as an effectiveness measure (quantitative) • We can calculate the point-wise measure by considering the relevance of documents r α r β HR PR HR HR PR PR PR NR NR NR NR HR NR NR G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  18. A-corr on TREC test collections TREC 8 TREC 10 TREC 14 1 1 1 0.8 0.8 0.8 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 0 0 0 − 0.2 − 0.2 − 0.2 MAP A − corr Kendall MAP A − corr Kendall MAP A − corr Kendall G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  19. Point-wise as an effectiveness measure (qualitative) TREC2001, topic 510 − Qualitative comparison 0 − 500 − 1000 − 1500 Spearman − fdut10wac01 Point − wise − fdut10wac01 − 2000 Kendall − fdut10wac01 Spearman − uwmtaw1 Point − wise − uwmtaw1 Kendall − uwmtaw1 − 2500 0 20 40 60 80 100 120 140 160 180 200 Rank G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

  20. Conclusions • A correlation and an effectiveness measure for qualitative and quantitative evaluation • We plan to: • compare A-corr with the Twist measure (Cumulative Relative Position)* • analyze its stability, sensitivity and correlation with other measures • define a weighted measure to model user behavior * N. Ferro, G. Silvello, H. Keskustalo, A. Pirkola and K. J ӓ rvelin (2015), 
 The Twist Measure for IR Evaluation: Taking User’s Effort into Account 
 Journal of the Association for Information Science and Technology (JASIST) in print ( http://onlinelibrary.wiley.com/doi/10.1002/asi.23416) . G. M. Di Nunzio and G. Silvello A graphical view of distance between rankings: the Point and Area measure

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend