Is There a Trade-Off Between Fairness and Accuracy? A Perspective - PowerPoint PPT Presentation

Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing Sanghamitra Dutta Hazar Yueksel Dennis Wei sanghamd@andrew.cmu.edu hazar.yueksel@ibm.com dwei@us.ibm.com Kush Varshney Pin-Yu Chen Sijia Liu krvarshn@us.ibm.com pin-yu.chen@ibm.com sijia.liu@ibm.com 1

Motivational Example Noisy Mapping (𝑌, 𝑍) (𝑌 ! , 𝑍 ! ) 𝑌: Exam Score 𝑌 ! : True Ability 𝑍: Data Label (0) or (1) 𝑍 ! : True Label 𝑎: Protected Attribute (Gender, Race, etc.) Construct Space Observed Space No trade-off between accuracy Accuracy-fairness trade-off in observed space and fairness is due to noisier mappings for one group Bayes optimal classifier achieves making the 0 and 1 labels “less separable” fairness (Equal Opportunity) Setup inspired from [Friedler et al. ’16] [Yeom et al. ’18]; Definition of Equal Opportunity [Hardt et al. ‘16] 2

Main Contributions Alleviate Trade-off in Real World Concept of Separability Ideal Distributions Gather knowledge from active data Chernoff Information: approximation to where accuracy and fairness are in collection, often improving separability best error exponent in binary classification accord Explain the trade-off (Theorem 1) • Proof of existence (Theorem 2) • Criterion to alleviate (Theorem 3) • With analytical forms Compute fundamental limits • Interpretation • Compute alleviated trade-off • Trade-off after Data Collection 1.4 1.2 Accuracy 1.2 Accuracy Trade-off on Existing Data 1 1 Trade-off on Existing Data 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0 0 0.2 0.4 0.6 0.8 1 0 Discrimination 0 0.2 0.4 0.6 0.8 1 Discrimination Accuracy with respect to observed dataset is Plausible distributions in observed space, These results also explain why a problematic measure of performance or distributions in the construct space active fairness works 3

Related Works • Characterizing Accuracy-Fairness Trade-Off Exponent Analysis with [Menon & Williamson ‘18] [Garg et al. ‘19] Geometric Interpretability [Chen et al. ‘18] [Zhao & Gordon ‘19] • Empirical Datasets for Accuracy Evaluation [Wick et al. ’19] [Sharma et al. ‘19] • Pre-processing Datasets for Fairness [Calmon et al. ‘18] [Feldman et al. ‘15] [Zemel et al. ‘13] • Explainability/ Active Fairness [Varshney et al. ‘18] [Noriega-Campero et al. ‘19] 4

Preliminaries For group Z=1 , For group Z=0 , Noisy Mapping 𝑌| +-),,-* ∼ 𝑅 ) 𝑦 𝑌| +-),,-) ∼ 𝑄 ) 𝑦 𝑍 = 𝑍 𝑌| +-*,,-* ∼ 𝑅 * (𝑦) 𝑌| +-*,,-) ∼ 𝑄 * (𝑦) (𝑌, 𝑍) ! (𝑌 ! , 𝑍 ! ) 𝑌 = 𝑔 +,, 𝑌 ! 𝑈 ! 𝑦 = log 𝑄 " (𝑦) " 𝑦 = log 𝑅 " (𝑦) 𝑄 ! (𝑦) ≥ 𝜐 ! 𝑈 𝑅 ! (𝑦) ≥ 𝜐 " EQUAL OPPORTUNITY à EQUAL Prob. of FN Construct Space Observed Space • Probability of F alse N egative(FN): 𝑄 "#,% ! 𝜐 & = Pr(𝑈 & 𝑦 < 𝜐 & |𝑍 = 1, 𝑎 = 𝑨) Wrongful Reject of True ( + ), i.e., True Y=1 • Probability of F alse P ositive(FP): 𝑄 "',% ! 𝜐 & = Pr(𝑈 & 𝑦 ≥ 𝜐 & |𝑍 = 0, 𝑎 = 𝑨) Wrongful Accept of True ( − ), i.e., True Y=0 • Probability of error: 𝑄 (,% 𝜐 = 𝜌 ) 𝑄 "',% 𝜐 + 𝜌 * 𝑄 "#,% 𝜐 Prior probabilities (assume 𝜌 ) = 𝜌 * = 1/2 ) 5

Quick Background on Chernoff Error Exponents ! 𝜐 % ≲ 𝑓 &' "#,%! () ! ) 𝑄 !",$ Chernoff exponents of probabilities of FN and FP (Larger exponent à lower error) ! 𝜐 % ≲ 𝑓 &' "&,%! () ! ) 𝑄 !+,$ 1 1 (Larger exponent Since 𝑄 .,0 𝜐 = 2 𝑄 34,0 𝜐 + 2 𝑄 35,0 𝜐 , we define à lower error the Chernoff exponent of overall error probability as à higher accuracy) 𝐹 .,0 ! 𝜐 6 = min{𝐹 35,0 ! 𝜐 6 , 𝐹 34,0 ! (𝜐 6 )} Lemma: Chernoff exponent of error probability for Bayes optimal classifier between distributions 𝑄 7 (𝑦) under 𝑍 = 0 and 𝑄 1 (𝑦) under 𝑍 = 1 : 8∈[7,1] ∑𝑄 7 𝑦 8 𝑄 1 𝑦 1<8 Chernoff information 𝐷 𝑄 7 , 𝑄 1 = − log min [Cover & Thomas] 6

Our Proposition: Concept of Separability • Definition of Separability: For a group of people with data distributions 𝑄 7 (𝑦) and 𝑄 1 (𝑦) under hypotheses 𝑍 = 0 and 𝑍 = 1 , we define the separability as their Chernoff information 𝐷 𝑄 7 , 𝑄 1 . Geometric interpretability makes them tractable 7

Geometric understanding of the results Λ 1 ( u ) Λ 0 ( u ) For group Z=0 , log-generating function 0.4 2 𝑄 ) 𝑦 ~ 𝑂 (1,1) 𝑄 ! 𝑦 𝑄 " 𝑦 0.2 𝑄 * 𝑦 ~𝑂(4,1) 1 𝑈 ) 𝑦 ≥ 𝜐 ) 0 𝜈 = 4 𝜈 = 1 -5 0 5 0 Tangent with E FN E FP ! ) 𝑍 = 0, 𝑎 = 0 = 9 Λ ! 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 − 1) -1 ! ) 𝑍 = 1, 𝑎 = 0 = 9 slope 𝝊 𝟏 Λ " 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 + 1) -2 u -1.5 -1 -0.5 0 0.5 1 1.5 𝐹 #$,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ ! 𝑣 ) '(! 𝐹 #*,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ " 𝑣 ) '+! 𝐹 ,,& ! 𝜐 ! = min{𝐹 #*,& ! 𝜐 ! , 𝐹 #$,& ! (𝜐 ! )} 8

Geometric understanding of the results Λ 1 ( u ) Λ 0 ( u ) For group Z=0 , log-generating function 0.4 2 𝑄 ) 𝑦 ~ 𝑂 (1,1) 𝑄 ! 𝑦 𝑄 " 𝑦 0.2 𝑄 * 𝑦 ~𝑂(4,1) 1 𝑈 ) 𝑦 ≥ 𝜐 ) 0 𝜈 = 4 𝜈 = 1 -5 0 5 0 E FN E FP ! ) 𝑍 = 0, 𝑎 = 0 = 9 Λ ! 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 − 1) -1 ! ) 𝑍 = 1, 𝑎 = 0 = 9 Λ " 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 + 1) -2 u -1.5 -1 -0.5 0 0.5 1 1.5 𝐹 #$,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ ! 𝑣 ) '(! 𝐹 #*,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ " 𝑣 ) '+! 𝐹 ,,& ! 𝜐 ! = min{𝐹 #*,& ! 𝜐 ! , 𝐹 #$,& ! (𝜐 ! )} 9

Geometric understanding of the results Λ 1 ( u ) Λ 0 ( u ) For group Z=0 , log-generating function 0.4 2 𝑄 ) 𝑦 ~ 𝑂 (1,1) 𝑄 ! 𝑦 𝑄 " 𝑦 0.2 𝑄 * 𝑦 ~𝑂(4,1) 1 𝑈 ) 𝑦 ≥ 𝜐 ) 0 𝜈 = 4 𝜈 = 1 -5 0 5 0 E FN E FP ! ) 𝑍 = 0, 𝑎 = 0 = 9 Λ ! 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 − 1) C ( P 0 , P 1 ) -1 ! ) 𝑍 = 1, 𝑎 = 0 = 9 Λ " 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 + 1) -2 u -1.5 -1 -0.5 0 0.5 1 1.5 𝐹 #$,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ ! 𝑣 ) '(! 𝐹 #*,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ " 𝑣 ) 𝐹 35 = 𝐹 34 = 𝐷(𝑄 7 , 𝑄 1 ) '+! 𝐹 ,,& ! 𝜐 ! = min{𝐹 #*,& ! 𝜐 ! , 𝐹 #$,& ! (𝜐 ! )} 10

Geometric understanding of the results Λ 1 ( u ) Λ 0 ( u ) For group Z=0 , log-generating function 0.4 2 𝑄 ) 𝑦 ~ 𝑂 (1,1) 𝑄 ! 𝑦 𝑄 " 𝑦 0.2 𝑄 * 𝑦 ~𝑂(4,1) 1 𝑈 ) 𝑦 ≥ 𝜐 ) 0 𝜈 = 4 𝜈 = 1 -5 0 5 0 E FN E FP ! ) 𝑍 = 0, 𝑎 = 0 = 9 Λ ! 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 − 1) C ( P 0 , P 1 ) -1 ! ) 𝑍 = 1, 𝑎 = 0 = 9 Λ " 𝑣 = 𝐦𝐩𝐡 𝐅 𝑓 '& 2 𝑣(𝑣 + 1) -2 u -1.5 -1 -0.5 0 0.5 1 1.5 𝐹 #$,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ ! 𝑣 ) '(! 𝐹 #*,& ! 𝜐 ! = sup (𝑣𝜐 ! − Λ " 𝑣 ) '+! 𝐹 ,,& ! 𝜐 ! = min{𝐹 #*,& ! 𝜐 ! , 𝐹 #$,& ! (𝜐 ! )} 11

Accuracy-fairness trade-off is due to difference in separability of one group of people over another Theorem 1 (informal): One of the following is true in observed space: • Unbiased Mappings 𝐷 𝑄 7 , 𝑄 1 = 𝐷 𝑅 7 , 𝑅 1 : Bayes optimal classifiers for both groups also satisfy equal opportunity, i.e., 𝐹 35,0 " 𝜐 7 = 𝐹 35,0 # 𝜐 1 . • Biased Mappings 𝐷 𝑄 7 , 𝑄 1 < 𝐷 𝑅 7 , 𝑅 1 : Given two classifiers (one for each group) that satisfy equal opportunity, for at least one of the groups it is not the Bayes optimal classifier, i.e., Either 𝐹 .,0 " 𝜐 7 < 𝐷(𝑄 7 , 𝑄 1 ) or 𝐹 .,0 # 𝜐 1 < 𝐷(𝑅 7 , 𝑅 1 ) or both 12

Geometric understanding of the results For group Z=0 , 4 0.4 𝑄 ) 𝑦 ~ 𝑂 (1,1) 𝑄 ! 𝑦 𝑄 " 𝑦 0.2 𝑄 * 𝑦 ~𝑂(4,1) 2 𝑈 ) 𝑦 ≥ 𝜐 ) 0 𝜈 = 4 𝜈 = 1 -5 0 5 0 C ( P 0 , P 1 ) For group Z=1 , C ( Q 0 , Q 1 ) 0.4 -2 𝑅 ) 𝑦 ~ 𝑂 (0,1) 𝑅 ! 𝑦 𝑅 " 𝑦 𝑅 𝑦 ~𝑂(4,1) 0.2 -4 𝑈 * 𝑦 ≥ 𝜐 * -1.5 -1 -0.5 0 0.5 1 1.5 0 𝜈 = 0 𝜈 = 4 -5 0 5 For group Z=0, we have 𝐹 #* = 𝐹 #$ = 𝐷(𝑄 ! , 𝑄 " ) Bayes optimal classifiers do not satisfy Equal Opportunity (unequal 𝐹 67 ) For group Z=1, we have 𝐹 #* = 𝐹 #$ = 𝐷(𝑅 ! , 𝑅 " ) 13

Geometric understanding of the results 4 For group Z=0 , 0.4 𝑄 ) 𝑦 ~ 𝑂 (1,1) 𝑄 ! 𝑦 𝑄 " 𝑦 0.2 𝑄 * 𝑦 ~𝑂(4,1) 2 𝑈 ) 𝑦 ≥ 𝜐 ) 0 𝜈 = 4 𝜈 = 1 -5 0 5 0 For group Z=1 , 0.4 𝑅 ) 𝑦 ~ 𝑂 (0,1) 𝑅 ! 𝑦 -2 𝑅 " 𝑦 𝑅 𝑦 ~𝑂(4,1) 0.2 -4 𝑈 * 𝑦 ≥ 𝜐 * 0 𝜈 = 0 𝜈 = 4 -1 0 1 -5 0 5 𝐹 67,% " 𝜐 ) = 𝐹 67,% # (𝜐 * ) Equal Opportunity (equal 𝐹 67 ) satisfied but sub-optimal for privileged group Z=1 Avoid active harm to privileged group? 14

Is There a Trade-Off Between Fairness and Accuracy? A Perspective - PowerPoint PPT Presentation

Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing Sanghamitra Dutta Hazar Yueksel Dennis Wei sanghamd@andrew.cmu.edu hazar.yueksel@ibm.com dwei@us.ibm.com Kush Varshney Pin-Yu Chen Sijia

Trade-Off Between Trade-off problems for . . . Sample Size and Accuracy: Solutions Case of

Bounding the fairness and accuracy of classifiers from population statistics ICML 2020 Sivan

COMP30112: Concurrency Topics 5.4: Fairness and Starvation Howard Barringer Room KB2.20: email:

Media Fairness, Diversity 1 Outline Fairness (case studies, basic definitions) Diversity

Fairness in Machine Learning Fairness in Supervised Learning Make decisions by machine learning:

On Combining State Space Reductions with Global Fairness Assumptions Shaojie Zhang 1 Jun Sun 2 Jun

Fairness in Machine Learning: Part I Privacy & Fairness in Data Science CS848 Fall 2019

PubPol 201 Module 3: International Trade Policy Class 1 Introduction to Trade and Trade Policy

PubPol 201 Module 3: International Trade Policy Class 1 Introduction to Trade and Trade Policy

Indoor Accuracy Test Bed Framework Indoor Accuracy Test Bed Framework Working Group #3 E911

the myth of accuracy Damian Harty, Lucid Motors the myth of accuracy Its easy to believe

There s no s no there there there! there! There W. Hyattsville Station

PubPol 201 Introduction to Trade and Trade Policy Module 3: International Growth of world

GAINS FROM TRADE IN NEW TRADE MODELS Phemelo Tamasiga Bielefeld University

Trade-off Between Computational Complexity and Accuracy in Evolutionary Image Feature Extraction

Fairness opinions (FO) and Italian Valuation Standards (PIV) Mauro Bini Chair of OIV

James Minutilli Talent Sourcing Coordinator Dennis DeYoung Business Relations Specialist

in Global Routing Hamid Shojaei, Azadeh Davoodi, and Jeffrey Linderoth* Department of Electrical

MAS.S61: Emerging Wireless & Mobile Technologies aka The Extreme IoT Class Lecture 2:

Tools and Methods based on Task Models Fabio Patern (ISTI- C.N.R.) f.paterno@ isti.cnr.it -

Pin Hole Cameras & Warp Functions Instructor - Simon Lucey 16-423 - Designing Computer Vision

Bayesian Networks and Decision Graphs Chapter 6 Chapter 6 p. 1/17 Learning probabilities

Project 5 Virtual Memory COS 318 Fall 2015 Project

Virtual Private Networks Distributed Systems Paul Krzyzanowski Private networks Problem You

Is There a Trade-Off Between Fairness and Accuracy? A Perspective - PowerPoint PPT Presentation

Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing Sanghamitra Dutta Hazar Yueksel Dennis Wei sanghamd@andrew.cmu.edu hazar.yueksel@ibm.com dwei@us.ibm.com Kush Varshney Pin-Yu Chen Sijia

Trade-Off Between Trade-off problems for . . . Sample Size and Accuracy: Solutions Case of

Bounding the fairness and accuracy of classifiers from population statistics ICML 2020 Sivan

COMP30112: Concurrency Topics 5.4: Fairness and Starvation Howard Barringer Room KB2.20: email:

Media Fairness, Diversity 1 Outline Fairness (case studies, basic definitions) Diversity

Fairness in Machine Learning Fairness in Supervised Learning Make decisions by machine learning:

On Combining State Space Reductions with Global Fairness Assumptions Shaojie Zhang 1 Jun Sun 2 Jun

Fairness in Machine Learning: Part I Privacy &amp; Fairness in Data Science CS848 Fall 2019

PubPol 201 Module 3: International Trade Policy Class 1 Introduction to Trade and Trade Policy

PubPol 201 Module 3: International Trade Policy Class 1 Introduction to Trade and Trade Policy

Indoor Accuracy Test Bed Framework Indoor Accuracy Test Bed Framework Working Group #3 E911

the myth of accuracy Damian Harty, Lucid Motors the myth of accuracy Its easy to believe

There s no s no there there there! there! There W. Hyattsville Station

PubPol 201 Introduction to Trade and Trade Policy Module 3: International Growth of world

GAINS FROM TRADE IN NEW TRADE MODELS Phemelo Tamasiga Bielefeld University

Trade-off Between Computational Complexity and Accuracy in Evolutionary Image Feature Extraction

Fairness opinions (FO) and Italian Valuation Standards (PIV) Mauro Bini Chair of OIV

James Minutilli Talent Sourcing Coordinator Dennis DeYoung Business Relations Specialist

in Global Routing Hamid Shojaei, Azadeh Davoodi, and Jeffrey Linderoth* Department of Electrical

MAS.S61: Emerging Wireless &amp; Mobile Technologies aka The Extreme IoT Class Lecture 2:

Tools and Methods based on Task Models Fabio Patern (ISTI- C.N.R.) f.paterno@ isti.cnr.it -

Pin Hole Cameras &amp; Warp Functions Instructor - Simon Lucey 16-423 - Designing Computer Vision

Bayesian Networks and Decision Graphs Chapter 6 Chapter 6 p. 1/17 Learning probabilities

Project 5 Virtual Memory COS 318 Fall 2015 Project

Virtual Private Networks Distributed Systems Paul Krzyzanowski Private networks Problem You

Fairness in Machine Learning: Part I Privacy & Fairness in Data Science CS848 Fall 2019

MAS.S61: Emerging Wireless & Mobile Technologies aka The Extreme IoT Class Lecture 2:

Pin Hole Cameras & Warp Functions Instructor - Simon Lucey 16-423 - Designing Computer Vision