A User Study on the Effect of Aggregating Explanations for - - PowerPoint PPT Presentation

a user study on the effect of aggregating explanations
SMART_READER_LITE
LIVE PREVIEW

A User Study on the Effect of Aggregating Explanations for - - PowerPoint PPT Presentation

A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models [work in progress] Josua Krause* , Adam Perer**, Enrico Bertini* Mon, August 20th 2018 * ** Instance Explanations "Why Should I Trust


slide-1
SLIDE 1

A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models

Josua Krause*, Adam Perer**, Enrico Bertini* * ** Mon, August 20th 2018

[work in progress]

slide-2
SLIDE 2

2

"Why Should I Trust You?" Explaining the Predictions of Any Classifier
 Marco Riberio, Sameer Singh, Carlos Guestrin
 International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD 2016)

Instance Explanations

slide-3
SLIDE 3

3

"Why Should I Trust You?" Explaining the Predictions of Any Classifier
 Marco Riberio, Sameer Singh, Carlos Guestrin
 International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD 2016)

Finding Data Biases

slide-4
SLIDE 4

4

Problem: Inspecting single instances
 does not scale well

slide-5
SLIDE 5

5

Solution: Aggregating data and explanations

slide-6
SLIDE 6

6

Solution: Aggregating data and explanations

Correct Incorrect Negative Negative Positive Positive Prediction Ground Truth

slide-7
SLIDE 7

7

Solution: Aggregating data and explanations

Living Area (numeric) Correct Incorrect Negative Negative Positive Positive Prediction Ground Truth

slide-8
SLIDE 8

8

Living Area (numeric) Correct Incorrect Negative Negative Positive Positive Prediction Ground Truth Feature Value

slide-9
SLIDE 9

9

Living Area (numeric) Correct Incorrect Negative Negative Positive Positive Prediction Ground Truth Feature Value Concentration Within Subset

slide-10
SLIDE 10

10

Living Area (numeric) Correct Incorrect Negative Negative Positive Positive Prediction Ground Truth Feature Value Concentration Within Subset Feature Importance

slide-11
SLIDE 11

Sorted by Importance

slide-12
SLIDE 12

What is the impact of aggregation? What is the impact of instance-level explanations? How do those settings affect the ability to detect biases in the data?

12

slide-13
SLIDE 13

13

Four Conditions

Table Histogram No Explanation Explanation

slide-14
SLIDE 14

14

Four Conditions

Table Histogram No Explanation Explanation

slide-15
SLIDE 15

15

Four Conditions

Table Histogram No Explanation Explanation

slide-16
SLIDE 16

16

Four Conditions

Table Histogram No Explanation Explanation

slide-17
SLIDE 17

17

Four Conditions

Table Histogram No Explanation Explanation

slide-18
SLIDE 18

18

Two Data Sets

slide-19
SLIDE 19

19

Two Data Sets

Living Area (numeric) High Price Low Price

slide-20
SLIDE 20

20

Two Data Sets

Model Accuracy: 81.959% Model Accuracy: 88.325%

slide-21
SLIDE 21

21

Individual models:

  • Do you think the predictions of the model make sense?

5 point Likert scale (Not at all – Very much)

  • How well does the model perform in terms of accuracy?

5 point Likert scale (Not much – Very well)

  • How much do you trust the model?

5 point Likert scale (Not at all – Very much)

  • Why do you trust or not trust this model?

Free text answer

Summary: Which model do you prefer?

Multiple choice and text answer

Questions

slide-22
SLIDE 22

22

100 participants 4 conditions (25 each):

  • Table without Explanations (T/N)
  • Table with Explanations (T/E)
  • Histogram without Explanations (H/N)
  • Histogram with Explanations (H/E)

Random model order Correctly identified more accurate model Evaluation metrics: Model preference (trust) Bias detection

Study

slide-23
SLIDE 23

23

Participants Who Trusted the Correct Model

40% 30% 20% 10% 00% T/E H/N H/E

T: Table H: Histogram E: Explanation N: No Explanation

slide-24
SLIDE 24

40% 30% 20% 10% 00% T/E H/N H/E

24

Participants Who Trusted the Correct Model

Significant improvement!

p-value 0.0477 < 0.05

vs.

T: Table H: Histogram E: Explanation N: No Explanation

slide-25
SLIDE 25

40% 30% 20% 10% 00% T/E H/N H/E

25

Participants Who Trusted the Correct Model

p-value 0.0982 > 0.05

vs.

T: Table H: Histogram E: Explanation N: No Explanation

slide-26
SLIDE 26

40% 30% 20% 10% 00% T/E H/N H/E

vs.

Participants Who Trusted the Correct Model

26

"It has higher accuracy so should be more trustworthy than the other one. However some of the results don’t make sense to me. Maybe this is just an atypical property market." "It is accurate, yet the predictions do not make much sense. Higher quality houses having a larger amount of low priced houses, percentage-wise? More rooms, area, or stories resulting in lower prices? The logic does not work out." "larger houses are valued lower than others which are smaller"

T: Table H: Histogram E: Explanation N: No Explanation

slide-27
SLIDE 27

40% 30% 20% 10% 00% T/E H/N H/E

Participants Who Trusted the Correct Model

vs.

27

T: Table H: Histogram E: Explanation N: No Explanation

"If the data says it’s true, then it’s true I suppose and it’s more trustworthy than my common sense." "I feel like the results of [the biased model] where strange even though they where correct according to the dataset." "I’m drawn to trusting the model which was more accurate even though it didn’t entirely make sense to me." 25% of the participants who found the bias did not change their mind!

slide-28
SLIDE 28

40% 30% 20% 10% 00% T/E H/N H/E 50%

28

Participants Who Detected the Bias

p-value 0.0359 < 0.05

vs.

Significant improvement!

T: Table H: Histogram E: Explanation N: No Explanation

slide-29
SLIDE 29

29

Participants Who Detected the Bias

T: Table H: Histogram E: Explanation N: No Explanation

40% 30% 20% 10% 00% T/E H/N H/E 50% T/N

p-value 0.0311 < 0.05

slide-30
SLIDE 30

30

Bootstrapped 95% Confidence Intervals

T: Table H: Histogram E: Explanation N: No Explanation

Number of Hovered Cells Number of Hovered Bars

H/E H/N T/E T/N 100 200 300 400 500 200 400 600 800 1000

slide-31
SLIDE 31

Number of Hovered Cells Number of Hovered Bars

H/E H/N T/E T/N 100 200 300 400 500 200 400 600 800 1000

Bootstrapped 95% Confidence Intervals

Number of Hovered Bars

31

slide-32
SLIDE 32

40% 30% 20% 10% 00% T/E H/N H/E 50% T/N

32

Participants Who Detected the Bias

Similar performance!

T: Table H: Histogram E: Explanation N: No Explanation

slide-33
SLIDE 33

33

vs.

Note that the task was chosen in a way that under all conditions it was possible to find the bias. Histograms scale better to larger data sets or more complex errors in the data. In tables you have to extrapolate...

slide-34
SLIDE 34

Lessons Learned

People trust accuracy (too much). Aggregating instance-level explanations significantly helps detecting biases compared to individual explanations. Individual instance-level explanations may hurt performance.

34

slide-35
SLIDE 35

Further Work

More targeted studies
 to confirm hypotheses Different results for expert users?

35

slide-36
SLIDE 36

A User Study on the Effect of Aggregating Explanations for Interpreting Machine Learning Models

Josua Krause*, Adam Perer**, Enrico Bertini* * **

Thank You!

[work in progress]