Explaining rankings Maartje ter Hoeve University of Amsterdam & - PowerPoint PPT Presentation

Explaining rankings Maartje ter Hoeve University of Amsterdam & Blendle Maartje ter Hoeve maartje.terhoeve@student.uva.nl

C o n t e n t Answering Research Discussion and What and why? Rankings Blendle Related work research questions Conclusion questions Maartje ter Hoeve maartje.terhoeve@student.uva.nl

C o n t e n t Answering Research Discussion and What and why? Rankings Blendle Related work research questions conclusion questions Maartje ter Hoeve maartje.terhoeve@student.uva.nl

I n t r o d u c t i o n What is explainability and why is it needed? Maartje ter Hoeve maartje.terhoeve@student.uva.nl

E x p l a i n a b i l i t y : w h a t a n d w h y ? Model Maartje ter Hoeve maartje.terhoeve@student.uva.nl

E x p l a i n a b i l i t y : w h a t a n d w h y ? ? Maartje ter Hoeve maartje.terhoeve@student.uva.nl

E x p l a i n a b i l i t y : w h a t a n d w h y ? Maartje ter Hoeve maartje.terhoeve@student.uva.nl

E x p l a i n a b i l i t y : w h a t a n d w h y ? User Developer Maartje ter Hoeve maartje.terhoeve@student.uva.nl

B u t w h a t i s a n e x p l a n a t i o n ? An explanation needs to faithfully give the underlying cause of an event Maartje ter Hoeve maartje.terhoeve@student.uva.nl

B u t w h a t i s a n e x p l a n a t i o n ? Justification Provide conceptual explanations that do not necessarily expose the underlying structure of the algorithm Description Provide conceptual explanations that do expose the underlying structure of the algorithm Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R a n k i n g s 1 2 3 4 n - 2 n - 1 n Maartje ter Hoeve maartje.terhoeve@student.uva.nl

H o w d o w e e x p l a i n a r a n k i n g ? Only looking at the score of an item is not sufficient 231 543 228 432 203 398 157 231 6 8 3 4 1 2 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

B l e n d l e Maartje ter Hoeve maartje.terhoeve@student.uva.nl

B l e n d l e Blendle already has heuristic justifications We use these as one of our baselines Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R e s e a r c h q u e s t i o n s PART 1 RQ1 Do users want to receive explanations of why particular news items are recommended to them? RQ2 What way of showing news recommendations reasons do users prefer: textual or visual reasons; a single reason or multiple reasons; apparent or less apparent reasons? Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R e s e a r c h q u e s t i o n s PART 2 RQ3 How do we provide users with easy to understand, uncluttered, listwise explanations? RQ4 How do we build an explanation system that produces faithful, model-agnostic explanations for the outcome of a ranking algorithm, yet is scalable so that it can run in real time? RQ5 Does the reading behaviour of users who are provided with model-agnostic listwise explanations for a personalized ranked selection of news articles differ from the reading behaviour of users who are provided with heuristic or pointwise explanations for a personalized ranked selection of news articles? Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R e l a t e d w o r k LIME (Ribeiro et al, 2016): find a local, faithful explanation for the decision of any classifier LIME is a baseline of this research We work with rankings, not classifiers Therefore we bin our ranking scores mLIME Maartje ter Hoeve maartje.terhoeve@student.uva.nl

P a r t 1 - U s e r s t u d y 1 2 3 Single Reason - Visible Single Reason - Invisible Multiple Reasons - Visible Multiple Reasons - Combined Bar chart 179 sent type 1 41 answered type 1 180 sent type 2 36 answered type 2 182 sent type 3 43 answered type 3 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 1 D o u s e r s w a n t e x p l a n a t i o n s ? User wants reasons Times answered Yes 65 Somewhat 24 No 26 I don't know 5 X 2 = 14.55, p < 0.001 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 2 P r e f e r e n c e s h o w e x p l a n a t i o n s a r e s h o w n ? Single Reason - Visible Single Reason - Invisible Multiple Reasons - Visible Multiple Reasons - Combined Bar chart Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 2 P r e f e r e n c e s h o w e x p l a n a t i o n s a r e s h o w n ? Transparency Sufficiency Trust Satisfaction Maartje ter Hoeve maartje.terhoeve@student.uva.nl

P a r t 2 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 3 H o w d o w e m a k e l i s t w i s e e x p l a n a t i o n s ? Explain the entire list? 231 543 228 432 Explain items in comparison to other items? 203 398 Explain which features were important for the 157 231 position in the ranking? 6 8 3 4 1 2 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 3 H o w d o w e m a k e l i s t w i s e e x p l a n a t i o n s ? Which features are most important for the item's position in the ranking? f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 4 H o w d o w e d e s i g n t h i s ? Main intuition If we change feature values and the ranking changes, then this feature was important Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 4 H o w d o w e d e s i g n t h i s ? Training phase Find how feature values change the ranking Find disruptive distributions and points of interests LISTEN - LISTwise ExplaiNer Explaining phase Use distributions to sample feature values from Find most important features Return most important features as explanations Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 4 H o w d o w e d e s i g n t h i s ? Training phase Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 4 H o w d o w e d e s i g n t h i s ? Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6] f0 f1 f2 f3 f4 200 f0 f1 f2 f3 f4 180 160 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 120 100 f0 f1 f2 f3 f4 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 4 H o w d o w e d e s i g n t h i s ? Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6] f0 f1 f2 f3 f4 200 f0 f1 f2 f3 f4 200 f0 f1 f2 f3 f4 180 f0 f1 f2 f3 f4 180 160 90 f0 f1 f2 f3 f4 1 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 120 120 100 100 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

R Q 4 H o w d o w e d e s i g n t h i s ? Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6] f0 f1 f2 f3 f4 200 f0 f1 f2 f3 f4 200 f0 f1 f2 f3 f4 180 f0 f1 f2 f3 f4 180 160 110 f0 f1 f2 f3 f4 2 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 120 120 100 100 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Explaining rankings Maartje ter Hoeve University of Amsterdam & - PowerPoint PPT Presentation

Explaining rankings Maartje ter Hoeve University of Amsterdam & Blendle Maartje ter Hoeve maartje.terhoeve@student.uva.nl C o n t e n t Answering Research Discussion and What and why? Rankings Blendle Related work research

HOW HEALTHY IS OUR COUNTY? 2013 COUNTY HEALTH RANKINGS & ROADMAPS OUTLINE Rankings Background

ROUNDERS (1998) CASINO ROYALE (2006) HAND RANKINGS HIGH CARD HAND RANKINGS PAIR HIGH CARD

Wheatley and National Rankings May 2013 1 Recent National Rankings Several national High

Explaining Deep Learning Predictions and Isaac Ahern Integrating Domain Ontologies Outline

Explaining Type Errors Brent Yorgey Richard Eisenberg Harley Eades Off the Beaten Track 13

BUILDING RESIDENT ENGAGEMENT IN THE DELTA REGION A County Health Rankings & Roadmaps Special

MAKING THE MOST OF THE 2020 COUNTY HEALTH RANKINGS Exploring new interactive features, tips to

Learning diverse rankings with multi-armed bandits Radlinski, Kleinberg & Joachims. ICML

(Bayesian) Statistics with Rankings Marina Meil a University of Washington

Generalized Distances Between Rankings Ravi Kumar Sergei Vassilvitskii Yahoo! Research

Clustering Rankings in the Fourier Domain Stphan Clmenon and Romaric Gaudel and Jrmie

NRC Sociology Rankings Andrew J. Perrin November 3, 2010 Andrew J. Perrin () NRC Sociology

Creating A Resilient Community A County Health Rankings & Roadmaps Webinar

Attachments: Slides 1-14 1. WB EODB Report Rankings, ASEAN-6, 2006-2020 (graph) 2. WB EODB Report

Quality of Similarity Rankings in Time Series T. Bernecker, in Time Series M. E. Houle, H.-P.

WE WILL BE STARTING THE WEBINAR SOON CRACKING THE CODE TO AMAZON BESTSELLER RANKINGS HOW TO

Degree Progress Reports Whats been happening? DPR and Transfer Credit project started in July

SABER: Module-LWR based KEM Round 2 J. P. DAnvers A. Karmakar S. S. Roy F. Vercauteren KU

Getting the Measure of Child Wellbeing with Rumbles Quest A Tool for Schools and Their

RPAS Swarms in Disaster Management Missions Efficient Deployment through Optimized Mission

Audit Partner Characteristics, Audit Quality and Audit Pricing in the U.S. Ally Zimmerman,

Zero-query information retrieval system no explicit query from user IR triggered by

Publier dans une revue anglophone Le research design Stylistic Guidelines Outline Key

Elliptic Curves and the State of Survaillence Aleksander Horawa Imperial College London February

Sambuz

Useful Links

Newsletter

Mail Us

Explaining rankings Maartje ter Hoeve University of Amsterdam & - PowerPoint PPT Presentation

Explaining rankings Maartje ter Hoeve University of Amsterdam & Blendle Maartje ter Hoeve maartje.terhoeve@student.uva.nl C o n t e n t Answering Research Discussion and What and why? Rankings Blendle Related work research

HOW HEALTHY IS OUR COUNTY? 2013 COUNTY HEALTH RANKINGS &amp; ROADMAPS OUTLINE Rankings Background

ROUNDERS (1998) CASINO ROYALE (2006) HAND RANKINGS HIGH CARD HAND RANKINGS PAIR HIGH CARD

Wheatley and National Rankings May 2013 1 Recent National Rankings Several national High

Explaining Deep Learning Predictions and Isaac Ahern Integrating Domain Ontologies Outline

Explaining Type Errors Brent Yorgey Richard Eisenberg Harley Eades Off the Beaten Track 13

BUILDING RESIDENT ENGAGEMENT IN THE DELTA REGION A County Health Rankings &amp; Roadmaps Special

MAKING THE MOST OF THE 2020 COUNTY HEALTH RANKINGS Exploring new interactive features, tips to

Learning diverse rankings with multi-armed bandits Radlinski, Kleinberg &amp; Joachims. ICML

(Bayesian) Statistics with Rankings Marina Meil a University of Washington

Generalized Distances Between Rankings Ravi Kumar Sergei Vassilvitskii Yahoo! Research

Clustering Rankings in the Fourier Domain Stphan Clmenon and Romaric Gaudel and Jrmie

NRC Sociology Rankings Andrew J. Perrin November 3, 2010 Andrew J. Perrin () NRC Sociology

Creating A Resilient Community A County Health Rankings &amp; Roadmaps Webinar

Attachments: Slides 1-14 1. WB EODB Report Rankings, ASEAN-6, 2006-2020 (graph) 2. WB EODB Report

Quality of Similarity Rankings in Time Series T. Bernecker, in Time Series M. E. Houle, H.-P.

WE WILL BE STARTING THE WEBINAR SOON CRACKING THE CODE TO AMAZON BESTSELLER RANKINGS HOW TO

Degree Progress Reports Whats been happening? DPR and Transfer Credit project started in July

SABER: Module-LWR based KEM Round 2 J. P. DAnvers A. Karmakar S. S. Roy F. Vercauteren KU

Getting the Measure of Child Wellbeing with Rumbles Quest A Tool for Schools and Their

RPAS Swarms in Disaster Management Missions Efficient Deployment through Optimized Mission

Audit Partner Characteristics, Audit Quality and Audit Pricing in the U.S. Ally Zimmerman,

Zero-query information retrieval system no explicit query from user IR triggered by

Publier dans une revue anglophone Le research design Stylistic Guidelines Outline Key

Elliptic Curves and the State of Survaillence Aleksander Horawa Imperial College London February

Sambuz

Useful Links

Newsletter

Mail Us

HOW HEALTHY IS OUR COUNTY? 2013 COUNTY HEALTH RANKINGS & ROADMAPS OUTLINE Rankings Background

BUILDING RESIDENT ENGAGEMENT IN THE DELTA REGION A County Health Rankings & Roadmaps Special

Learning diverse rankings with multi-armed bandits Radlinski, Kleinberg & Joachims. ICML

Creating A Resilient Community A County Health Rankings & Roadmaps Webinar