SLIDE 1 Ethical Machine Learning
Taking “Don’t be Evil” Literally Katharine Jarmul #QCONSP Kjamistan.com
SLIDE 2 I Can’t Breathe: The Killing of Eric Garner
Joe Raedle/Getty Images
SLIDE 3
“Broken Windows” Policing
SLIDE 4
Disparate Impact Pr(C = YES|X = 0) Pr(C = YES|X = 1) ≤ τ = 0.8
SLIDE 5 Predictive Policing: Runaway Feedback Loops
Ensign et al, 2017
SLIDE 6
If our models mimic current police behavior, are we creating a valid model?
SLIDE 7
If our models mimic social inequalities and prejudice, are we creating a valid model?
SLIDE 8
Are social inequalities and prejudice valid?
SLIDE 9
Breaking the Cycle: Determining if Your Data has Prejudice
SLIDE 10
FairTest: Evaluating Correlations to Sensitive Attributes
SLIDE 11 GenderShades: Creating Better Datasets
GenderShades.org
SLIDE 12
NLP: Looking at Word Vector Correlations
SLIDE 13
SLIDE 14 NLP: Google News Vectors
https://blog.kjamistan.com/embedded-isms-in-vector-based-natural-language-processing/
SLIDE 15 Debiasing Word Vectors
https://github.com/tolga-b/debiaswe (Bolukbasi, Chang, Zou, Saligrama and Kalai, 2016)
SLIDE 16
Modeling Fairness: Evaluating Models for Prejudice
SLIDE 17 Defining Fair
https://algorithmicfairness.wordpress.com/
SLIDE 18 Evaluating Fair
https://blog.godatadriven.com/fairness-in-ml/
SLIDE 19 NLP: Testing Bias
https://developers.googleblog.com/2018/04/text-embedding-models-contain-bias.html
SLIDE 20 Interpreting Our Models
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention Xu et al., 2016
SLIDE 21
Radical Transparency: Promoting Conversation & Accountability
SLIDE 22 Talking Fair
https://www.fatml.org/
SLIDE 23 Acting Fair: Building Accountable Applications
https://2017.ind.ie/ethical-design/
SLIDE 24
Ethical Machine Learning: Taking a Logical Stance against Oppression
SLIDE 25 Ethical ML Takeaways
- Doing “nothing” assumes prejudice and unfair treatment is a valid action
- We need better data
- Diverse data which better reflects the real world
- Stop using datasets which are non-representative
- We need built-in ethics-driven evaluation criteria
- Scikit-learn disparate impact?
- Scikit-learn equal odds / opportunity?
- You can contribute
- pen-source your work and datasets
- volunteer with the Algorithmic Justice League or local organization
SLIDE 26 Thanks!
Questions?
- Now?
- Later?
- @kjam
- katharine@kiprotect.com