Detecting and Reducing Social Discrimination in Machine Learning - PowerPoint PPT Presentation

Detecting and Reducing Social Discrimination in Machine Learning Models New York Open Stats Meetup Niels Bantilan 12/14/2017

What is bias?

What is discrimination? Inmate recidivism risk model Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing

Bias is an amoral concept “The preference for or against something” Discrimination is a moral, socially contextual concept “When an action is based on biases that lead to the systematic disenfranchisement of people based on arbitrary group membership” Fairness is the “opposite” of discrimination (according to a specific definition of discrimination)

Biased Biased Decisions Data Biased Biased Predictions Algorithm

Problem Machine learning models optimized only for accuracy reflect and amplify real-world social biases.

Themis-ml (thee-mus em-el) An open source library that implements a fairness-aware machine learning interface ( FMLI ) to measure and reduce social bias in machine learning algorithms. Available on github! https://github.com/cosmicBboy/themis-ml

FMLI Design Principles Model flexibility Users have varying degrees of control over the model training process. Fairness as performance Build tooling that enable users to optimize for both accuracy and fairness. Transparency of fairness-utility tradeoff Enable the user to assess the impact of fairness-aware methods on predictive power.

Discrimination Discovery Objective Given a set of decision records {(X, y)} ∈ D , a measure of social bias b , and a protected class s , identify a subset of potentially discriminated decision records. ?

Discrimination Discovery Disparate Treatment only accept men’s resumes Hiring decision process Disparate Impact ? accept all resumes Hiring decision process

Discrimination Discovery Individual-level Group-level (individual consistency) (statistical parity)

Fairness-aware Machine Learning Objective Given a set of decision records {(X, y)} ∈ D , a measure of social bias b , a protected class s , and a measure of performance p , train a machine learning model that makes fair predictions while preserving the accuracy of decisions. Utility Fairness

Fairness-aware Machine Learning Assumptions Prediction target y is a binary variable where y + = 1 is a beneficial 1. outcome (e.g. “good credit risk”) and y - = 0 is an adverse outcome (e.g. “bad credit risk”) 2. Protected class {d, a} ∈ s is a binary variable where d = 1 is the disadvantaged group (e.g. “immigrant”) and a = 0 is the advantaged group (e.g. “citizen”).

Machine Learning Pipeline Raw Data Preprocessing Model Specifications Training Instantiated Models Evaluation Deployed Model Prediction Predictions on New Data

Machine Learning Pipeline Raw Data Evaluation Scorer API Mean difference Model Specifications Compute p(a ⋃ y + ) – p(d ⋃ y + ). ● ● Values range from -1 to 1 ○ 1 is the maximum discrimination case. Instantiated Models ○ 0 is the statistical parity case. ○ -1 is the maximum reverse-discrimination case. Deployed Model Predictions on New Data

Mean Difference Loan Granted Loan Denied Woman mean difference = 0.5 - 0.5 = 0 Man mean difference = 0.67 - 0.25 = 0.42

Machine Learning Pipeline Raw Data Evaluation Scorer API Consistency Model Specifications ● For each observation x i , find its k-nearest neighbors x j ∈ knn(x i ). ● Sum the differences between the label y i and all of its Instantiated Models neighbors y j . ● Normalize the sum of these differences by the total number of observations N × k number of neighbors. ● Subtract this normalized value from 1. Deployed Model ● Values range from 0 to 1 ○ 0 indicates no discrimination ○ 1 indicates maximum discrimination Predictions on New Data

Consistency k neighbors = 5 Loan Granted Loan Denied Woman consistency = 1 - [ ( 0 + 0 ) / ( 2 × 5 ) ] = 1 Man Home-owner Not home-owner consistency = 1 - [ ( 5 + 5 ) / ( 2 × 5 ) ] = 0

Machine Learning Pipeline Raw Data Preprocessing Transformer API Relabelling Model Specifications ● Train a ranker model R on the dataset D . ● Modify D to create D new such that we achieve statistical parity: Instantiated Models Top-ranked X d, y- are “promoted” to X d, y+ ○ Bottom-ranked X a, y+ are “demoted” to X a, y- ○ ● Training a model on D new should yield fairer Deployed Model predictions compared to a model trained on D . Predictions on New Data

Relabeling Original Data Relabeled Data Good Credit Risk Bad Credit Risk Woman income Man is homeowner Assumption: Labels are incorrect, and we should directly change them in favor of the disadvantaged group.

Machine Learning Pipeline Raw Data Prediction Predictor API Reject Option Classification (ROC) Model Specifications ● Train an initial classifier K ● Generate predicted probabilities ŷ on the test set. ● Compute the proximity of each prediction ŷ to the Instantiated Models decision boundary learned by the classifier. ● Modify labels within the critical region threshold � around the decision boundary (where 0.5 < � < 1 ): ○ predicted labels of X d are assigned as y + . Deployed Model predicted labels X a are assigned as y – . ○ Predictions on New Data

Reject Option Classification Original Prediction Relabeled Data Good Credit Risk Bad Credit Risk Woman income Man is homeowner Assumption: Observations close to the decision boundary were labelled incorrectly based on their sex, so we should offset this by flipping the prediction for those observations.

Machine Learning Pipeline Raw Data Training Estimator API Additive Counterfactually Fair Model Model Specifications ● Train linear models to predict each feature in X using s attribute(s) as input. Compute residuals � ij between the predicted ● Instantiated Models feature values and true feature values for each observation i and each feature j . ● The final model is trained on � ij as features to predict y . Deployed Model Predictions on New Data

Additive Counterfactually Fair Model predicted features ˆ protected residual s X y labels classes model � ˆ X y ˆ model X - X features residual features

Machine Learning Pipeline Raw Data Training Estimator API Prejudice Remover Regularization Model Specifications ● Define “prejudice index” (PI) as the degree to which a prediction ŷ depends on a sensitive attribute s (otherwise known as mutual Instantiated Models information). ● Add PI as a term to your objective function ● Minimize (or maximize) with gradient descent or other optimization method of your choice. Deployed Model Predictions on New Data

Prejudice Remover Regularization Minimize objective function L2 Regularization “Don’t weight a particular feature too much” Logistic Regression Cost “Don’t make mistakes on predictions with respect to ‘true labels’” Prejudice Index Regularization “Don’t depend on sensitive features too much to make predictions”

Prejudice Remover Regularization Cost Fairness-unaware Objective Fairness-aware Model Objective Fairness-utility tradeoff Value of weight Θ

Case Study: German Credit Data 1000 loan application records 1 binary target variable y : 700 “good” credit_risk 300 “bad” credit_risk ~20 input variables X : housing credit_history purpose foreign_worker personal_status_and_sex age_in_years 3 binary protected classes s : is_foreign is_female age_below_25 Source: https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data)

Case Study: German Credit Data Measure potential discrimination in the raw data

Case Study: German Credit Data Does the algorithm make socially biased predictions? Mean difference score of logistic regression test predictions Mean difference in raw data 6.4 9.5 12.6 24.3 29.4 34.5 29.2 32.3 35.4

Case Study: German Credit Data Assess Fairness-utility Trade-off Mean difference and AUC scores generated by a logistic regression model for each experimental condition, by protected attributes. Baseline Remove protected attributes Relabel target variable Counterfactually fair model Reject-option classification

Live Demo: Fatal Encounters Dataset www.fatalencounters.org

So now what?

MEASURE AND MITIGATE ALL THE THINGS

FML-as-a-service “FMLAAS” Use Case: Organizations “As a law-enforcement (private, nonprofit, govt) agency, I need to identify individuals who are most likely to be connected to gang X.” Data Predictions Models Third-party Predictive Service

FML-as-a-service “FMLAAS” FMLI Use Case 1: Organizations “As a law-enforcement (private, nonprofit, govt) agency, I want to Metrics measure the degree to which my data contains discriminatory patterns” Data measure FMLI fit Web Predictions Service predict Models Third-party Predictive Service

Detecting and Reducing Social Discrimination in Machine Learning - PowerPoint PPT Presentation

Detecting and Reducing Social Discrimination in Machine Learning Models New York Open Stats Meetup Niels Bantilan 12/14/2017 What is bias? What is discrimination? Inmate recidivism risk model Source:

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

Linear Discrimination Discriminant-Based Classification 1 Linear Discrimination Linearly

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

Linear Discrimination Steven J Zeil Old Dominion Univ. Fall 2010 1 Discriminant-Based

Case 2: Reducing Cardiovascular Risk Type 2 Diabetes Management Case 1: Reducing Hypoglycemic

NetFlow Analysis: Detecting covert channels on the network Detecting malicious traffic by using

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Dave Mark Intrinsic Algorithm Reducing the world to mathematical equations! Reducing

Discrimination in the Auto Loan Market Alexander W. Butler Rice Erik J. Mayer SMU James P.

Auditory Perception - Detection versus Discrimination - Localization versus Discrimination -

2.2 Price Discrimination Matilde Machado Download the slides from:

2.2 Price Discrimination Matilde Machado Download the slides from:

Racial Discrimination in the Coronary Racial Discrimination in the Artery Risk Development in

Detecting price and search discrimination on the Internet Jakub Mikians*, Lszl Gyarmati,

RIF: Reducing Risk When You Are Reducing Your Workforce John F. Birmingham, Jr. David J.B.

Addressing Discrimination Against People Living in Poverty NGO Committee for Social Development

Action 15: Multilateral Instrument Prof. Dr. Danil Vinnitskiy Ural State Law University, Russia

The Logic of (Where and) While in the 13th and 14th Centuries Sara L. Uckelman

The Logic of While in the 13th and 14th Centuries Dr. Sara L. Uckelman s.l.uckelman@durham.ac.uk

Meta-Complexity Theorems for Bottom-up Logic Programs Harald Ganzinger Max-Planck-Institut f

OR CHRIST-CENTERED WHICH WAY DOES YOUR PENDULUM SWING? THE PENDULUM SWING OF LEGALISM What

1 John Series Lesson #059 May 5, 2002 Dean Bible Ministries www.deanbibleministries.org Dr.

1 John Series Lesson #056 April 14, 2002 Dean Bible Ministries www.deanbibleministries.org Dr.

For I say to you, that unless your righteousness exceeds the righteousness of the scribes and