Explaining rankings Maartje ter Hoeve University of Amsterdam & - - PowerPoint PPT Presentation

explaining rankings
SMART_READER_LITE
LIVE PREVIEW

Explaining rankings Maartje ter Hoeve University of Amsterdam & - - PowerPoint PPT Presentation

Explaining rankings Maartje ter Hoeve University of Amsterdam & Blendle Maartje ter Hoeve maartje.terhoeve@student.uva.nl C o n t e n t Answering Research Discussion and What and why? Rankings Blendle Related work research


slide-1
SLIDE 1

Explaining rankings

Maartje ter Hoeve University of Amsterdam & Blendle Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-2
SLIDE 2

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and Conclusion Blendle

slide-3
SLIDE 3

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-4
SLIDE 4

I n t r o d u c t i o n

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

What is explainability and why is it needed?

slide-5
SLIDE 5

E x p l a i n a b i l i t y : w h a t a n d w h y ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl Model

slide-6
SLIDE 6

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

?

E x p l a i n a b i l i t y : w h a t a n d w h y ?

slide-7
SLIDE 7

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

E x p l a i n a b i l i t y : w h a t a n d w h y ?

slide-8
SLIDE 8

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

User Developer

E x p l a i n a b i l i t y : w h a t a n d w h y ?

slide-9
SLIDE 9

B u t w h a t i s a n e x p l a n a t i o n ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

An explanation needs to faithfully give the underlying cause of an event

slide-10
SLIDE 10

B u t w h a t i s a n e x p l a n a t i o n ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Justification Provide conceptual explanations that do not necessarily expose the underlying structure of the algorithm Description Provide conceptual explanations that do expose the underlying structure of the algorithm

slide-11
SLIDE 11

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-12
SLIDE 12

R a n k i n g s

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

1 2 3 n - 1 n 4 n - 2

slide-13
SLIDE 13

H o w d o w e e x p l a i n a r a n k i n g ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

231 203 1 157 6 228 3 543 398 2 231 8 432 4 Only looking at the score of an item is not sufficient

slide-14
SLIDE 14

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-15
SLIDE 15

B l e n d l e

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-16
SLIDE 16

B l e n d l e

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Blendle already has heuristic justifications We use these as one of our baselines

slide-17
SLIDE 17

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-18
SLIDE 18

R e s e a r c h q u e s t i o n s

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

RQ2 What way of showing news recommendations reasons do users prefer: textual or visual reasons; a single reason or multiple reasons; apparent or less apparent reasons?

PART 1

RQ1 Do users want to receive explanations of why particular news items are recommended to them?

slide-19
SLIDE 19

R e s e a r c h q u e s t i o n s

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

RQ3 How do we provide users with easy to understand, uncluttered, listwise explanations? RQ4 How do we build an explanation system that produces faithful, model-agnostic explanations for the outcome of a ranking algorithm, yet is scalable so that it can run in real time? RQ5 Does the reading behaviour of users who are provided with model-agnostic listwise explanations for a personalized ranked selection of news articles differ from the reading behaviour of users who are provided with heuristic or pointwise explanations for a personalized ranked selection of news articles?

PART 2

slide-20
SLIDE 20

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-21
SLIDE 21

R e l a t e d w o r k

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

LIME (Ribeiro et al, 2016): find a local, faithful explanation for the decision of any classifier

LIME is a baseline of this research We work with rankings, not classifiers Therefore we bin our ranking scores mLIME

slide-22
SLIDE 22

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-23
SLIDE 23

P a r t 1 - U s e r s t u d y

Single Reason - Visible Single Reason - Invisible Multiple Reasons - Visible Multiple Reasons - Combined Bar chart 179 sent type 1 180 sent type 2 182 sent type 3

1 2 3

41 answered type 1 36 answered type 2 43 answered type 3

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-24
SLIDE 24

R Q 1 D o u s e r s w a n t e x p l a n a t i o n s ?

User wants reasons Times answered Yes 65 Somewhat 24 No 26 I don't know 5 X2 = 14.55, p < 0.001

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-25
SLIDE 25

Single Reason - Visible Single Reason - Invisible Multiple Reasons - Visible Multiple Reasons - Combined Bar chart

R Q 2 P r e f e r e n c e s h o w e x p l a n a t i o n s a r e s h o w n ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-26
SLIDE 26

R Q 2 P r e f e r e n c e s h o w e x p l a n a t i o n s a r e s h o w n ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Transparency Sufficiency Trust Satisfaction

slide-27
SLIDE 27

P a r t 2

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-28
SLIDE 28

R Q 3 H o w d o w e m a k e l i s t w i s e e x p l a n a t i o n s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

231 203 1 157 6 228 3 543 398 2 231 8 432 4 Explain the entire list? Explain items in comparison to other items? Explain which features were important for the position in the ranking?

slide-29
SLIDE 29

R Q 3 H o w d o w e m a k e l i s t w i s e e x p l a n a t i o n s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

231 203 1 157 6 228 3 543 398 2 231 8 432 4 Explain the entire list? Explain items in comparison to other items? Explain which features were important for the position in the ranking?

slide-30
SLIDE 30

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

Which features are most important for the item's position in the ranking?

R Q 3 H o w d o w e m a k e l i s t w i s e e x p l a n a t i o n s ?

slide-31
SLIDE 31

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Main intuition If we change feature values and the ranking changes, then this feature was important

slide-32
SLIDE 32

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Training phase Find how feature values change the ranking Find disruptive distributions and points of interests Use distributions to sample feature values from Find most important features Return most important features as explanations Explaining phase

LISTEN - LISTwise ExplaiNer

slide-33
SLIDE 33

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Training phase

slide-34
SLIDE 34

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-35
SLIDE 35

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 1 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 200 90 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-36
SLIDE 36

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 2 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 200 110 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-37
SLIDE 37

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 3 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 200 160 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-38
SLIDE 38

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 4 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 200 170 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-39
SLIDE 39

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 5 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 200 190 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-40
SLIDE 40

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 6 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 200 195 120 100 180

Find how feature values change the ranking: f0 [1, 2, 3, 4, 5, 6]

slide-41
SLIDE 41

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Find disruptive distributions and points of interests

slide-42
SLIDE 42

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Explanation phase

slide-43
SLIDE 43

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Find most important features

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

200 160 120 100

f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

180 170 160 120 100 180

slide-44
SLIDE 44

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4 f0 f1 f2 f3 f4

Return most important features

slide-45
SLIDE 45

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Is this faithful? Construct dummy data where we know what to expect LISTEN gives the correct results Disruptive distribution approach slightly decreases faithfulness, yet increases speed

slide-46
SLIDE 46

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

This is still not fast enough to run in production

f0 f1 f2 …. fn-1 fn f0 f1 f2 …. fn-1 fn f0 f1 f2 …. fn-1 fn f0 f1 f2 …. fn-1 fn f0 f1 f2 …. fn-1 fn f0 f1 f2 …. fn-1 fn

Q-LISTEN

slide-47
SLIDE 47

R Q 4 H o w d o w e d e s i g n t h i s ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Same holds for mLIME Q-mLIME

f0 f1

. .

fn f0 f1

. .

fn

slide-48
SLIDE 48

R Q 5 H o w d o u s e r s b e h a v e ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

  • A. Heuristic

A/B/C - test

  • B. Q-mLIME
  • C. Q-LISTEN
slide-49
SLIDE 49

R Q 5 H o w d o u s e r s b e h a v e ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-50
SLIDE 50

R Q 5 H o w d o u s e r s b e h a v e ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Results

slide-51
SLIDE 51

R Q 5 H o w d o u s e r s b e h a v e ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Results

Heuristic Q-mLIME Q-LISTEN % reasons seen (of total reads) 3.55 3.54 3.51 % reasons seen (of total reasons) 34.1 32.9 32.0 Reasons per user (of users that see reasons) 1.83 1.86 1.72 Article opened within 2 minutes (% of all reasons seen) 12.1 12.9 10.6 Reasons seen after 20 minutes of opening (% of all reasons seen) 11.0 10.1 10.7

slide-52
SLIDE 52

C o n t e n t

Maartje ter Hoeve maartje.terhoeve@student.uva.nl What and why? Rankings Research questions Related work Answering research questions Discussion and conclusion Blendle

slide-53
SLIDE 53

C o n c l u s i o n a n d d i s c u s s i o n

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

RQ1 Do users want to receive explanations of why particular news items are recommended to them? RQ2 What way of showing news recommendations reasons do users prefer: textual or visual reasons; a single reason or multiple reasons; apparent or less apparent reasons?

PART 1

Yes No specific one

slide-54
SLIDE 54

C o n c l u s i o n a n d d i s c u s s i o n

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

RQ3 How do we provide users with easy to understand, uncluttered, listwise explanations? RQ4 How do we build an explanation system that produces faithful, model-agnostic explanations for the outcome of a ranking algorithm, yet is scalable so that it can run in real time?

PART 2

We look at which features were most important for an item's position in the ranking We find feature importance by changing the feature values and changing the rankings We make disruptive distributions We learn the explanation space with a neural net

slide-55
SLIDE 55

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

RQ5 Does the reading behaviour of users who are provided with model-agnostic listwise explanations for a personalized ranked selection of news articles differ from the reading behaviour of users who are provided with heuristic or pointwise explanations for a personalized ranked selection of news articles?

C o n c l u s i o n a n d d i s c u s s i o n

We do not find any significant differences

slide-56
SLIDE 56

C o n c l u s i o n a n d d i s c u s s i o n

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

Take home message Users clearly state that they want explanations. Therefore, even though their behaviour is not affected by the explanations, still provide them with faithful explanations

slide-57
SLIDE 57

R e a s o n s s e e n

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-58
SLIDE 58

L o s s a n d a c c u r a c y c u r v e s

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-59
SLIDE 59

m L I M E v s L I S T E N

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-60
SLIDE 60

A c t i v e a n d l e s s a c t i v e u s e r s

Maartje ter Hoeve maartje.terhoeve@student.uva.nl

slide-61
SLIDE 61

R Q 5 H o w d o u s e r s b e h a v e ?

Maartje ter Hoeve maartje.terhoeve@student.uva.nl