The European Commissions science and knowledge service Joint - - PowerPoint PPT Presentation

the european commission s science and knowledge service
SMART_READER_LITE
LIVE PREVIEW

The European Commissions science and knowledge service Joint - - PowerPoint PPT Presentation

The European Commissions science and knowledge service Joint Research Centre Why machine learning may lead to unfairness Songl Tolan 1 , Marius Miron 1 , Emilia Gomez 1,2 , Carlos Castillo 2 1 European Commissions Joint Research Centre 2


slide-1
SLIDE 1

The European Commission’s science and knowledge service

Joint Research Centre

slide-2
SLIDE 2

Why machine learning may lead to unfairness

Songül Tolan1, Marius Miron1, Emilia Gomez1,2, Carlos Castillo2

1European Commission’s Joint Research Centre 2Universitat Pompeu Fabra

slide-3
SLIDE 3

Machine learning for decision making

slide-4
SLIDE 4

The criminal justice case

Trade-off: predictive performance vs fairness

slide-5
SLIDE 5

Criminal recidivism

slide-6
SLIDE 6

Criminal recidivism prediction

Human expert Decision / Sentence Prisoner

slide-7
SLIDE 7

Criminal recidivism prediction

Human expert Decision / Sentence Outcome Prisoner

slide-8
SLIDE 8

Criminal recidivism prediction

Human expert Decision / Sentence Outcome Prisoner

slide-9
SLIDE 9

Criminal recidivism prediction

Machine learning model Prediction Outcome Features

slide-10
SLIDE 10

Criminal recidivism prediction

Age at crime Sex Nationality Previous number of crimes Sentence Year of crime Probation Examples of static features:

slide-11
SLIDE 11

Fairness

A decision is fair if it does not discriminate against people based on their membership to a protected group

slide-12
SLIDE 12

Fairness

Age at crime Sex Nationality Previous number of crimes Sentence Year of crime Probation Example of protected features:

slide-13
SLIDE 13

Measuring unfairness

Machine learning model Prediction Outcome Features Nationality Sex

slide-14
SLIDE 14

Measuring unfairness

False positive False negative Prediction Outcome

slide-15
SLIDE 15

False negative rate = Miss rate

Σ Σ

slide-16
SLIDE 16

False positive rate = False alarm rate

Σ Σ

slide-17
SLIDE 17

Group fairness - sex

Σ Σ

sex=Male sex=Male

slide-18
SLIDE 18

False negative rate disparity

How likely it is for a member of a group to be wrongfully labeled as non-recidivist. FNRdisparity= FNRfemale FNRmale

slide-19
SLIDE 19

Headache?

slide-20
SLIDE 20

Too complicated?

The fairness in machine learning literature comprises at least 21 disparity metrics.

slide-21
SLIDE 21

Juvenile recidivism

slide-22
SLIDE 22

Structured Assessment of Violence Risk in Youth (SAVRY)

Risk assessment tools

  • high degree of involvement from human experts
  • open and interpretable (in comparison with COMPAS)
  • 24 risk factors scored low, medium or high
slide-23
SLIDE 23

SAVRY

Early violence Self-harm history Home violence Poor school achievement Stress and poor coping Substance abuse Criminal parent/caregiver Examples of SAVRY features:

slide-24
SLIDE 24

Criminal recidivism prediction

Expert Final expert evaluation Outcome SAVRY features

Σ

SAVRY sum

slide-25
SLIDE 25

Static ML

Machine learning model Prediction Outcome Features

slide-26
SLIDE 26

SAVRY ML

Machine learning model Prediction Outcome SAVRY features

slide-27
SLIDE 27

Static + SAVRY ML

Machine learning model Prediction Outcome Features

slide-28
SLIDE 28

Juvenile offenders in Catalonia1

Dataset

  • 855 people
  • crimes between 2002 -2010, release in 2010
  • age at crime between 12 and 17 years old
  • status followed up on 2013 and 2015

1. Open data: http://cejfe.gencat.cat/en/recerca/opendata/jjuvenil/reincidencia-justicia-menors/index.html

slide-29
SLIDE 29

Training a set of ML methods

Experimental setup

  • logistic regression (logit), multi-layer perceptron (mlp),

support vector machines (lsvm), k-nearest neighbors (knn), random forest (rf), naive bayes (nb)

  • k-fold cross validation with k=10 (10% test, 10% validation,

80% training)

  • we run 50 different experiments with different initial conditions
  • we compute feature importance with LIME1

1. LIME https://github.com/marcotcr/lime

slide-30
SLIDE 30

Predictive performance - AUC ROC

slide-31
SLIDE 31

Results, predictive performance AUC

SAVRY Sum has 0.64 AUC Expert has 0.66 AUC

slide-32
SLIDE 32

Results: disparity, sex

False alarm rates Miss rates

slide-33
SLIDE 33

Results: disparity, sex

False alarm rates Miss rates

slide-34
SLIDE 34

Results: disparity, sex

False alarm rates Miss rates

slide-35
SLIDE 35

Results: disparity, sex

False alarm rates Miss rates

slide-36
SLIDE 36

Results: disparity, sex

False alarm rates Miss rates

slide-37
SLIDE 37

Results: disparity, sex

False alarm rates Miss rates

slide-38
SLIDE 38

Results: disparity, nationality

False alarm rates Miss rates

slide-39
SLIDE 39

Results: disparity, nationality

False alarm rates Miss rates

slide-40
SLIDE 40

Results: disparity, nationality

False alarm rates Miss rates

slide-41
SLIDE 41

Results: disparity, nationality

False alarm rates Miss rates

slide-42
SLIDE 42

Results: disparity, nationality

False alarm rates Miss rates

slide-43
SLIDE 43

Results: feature importance for logit

slide-44
SLIDE 44

Results: feature importance for mlp

slide-45
SLIDE 45

Results: difference in base rates (prevalence)

slide-46
SLIDE 46

Results: difference in base rates

slide-47
SLIDE 47

Results: difference in base rates

slide-48
SLIDE 48

Conclusions

  • ML models have better predictive performance
  • ML models tend to discriminate more
  • static features outweigh SAVRY features as importance
  • preliminary study: the cause may be in the data (base rates)
slide-49
SLIDE 49

We propose a methodology and a ML framework1

Contributions

  • to easily train ML models on tabular data (csv files)
  • to evaluate these models in terms of predictive

performance and fairness

  • to connect to interpretability frameworks
  • to reproduce with ease results and research

1. Open framework: https://gitlab.com/HUMAINT/humaint-fatml

slide-50
SLIDE 50

Thank you!

Any questions?

You can find me at @nkundiushuti & marius.miron@ec.europa.eu & mariusmiron.com