Bounding the fairness and accuracy of classifiers from population statistics ICML 2020
Sivan Sabato and Elad Yom-Tov
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 1 / 15
Bounding the fairness and accuracy of classifiers from population - - PowerPoint PPT Presentation
Bounding the fairness and accuracy of classifiers from population statistics ICML 2020 Sivan Sabato and Elad Yom-Tov Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 1 / 15 The 1-slide summary We show how to study a
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 1 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 2 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 2 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 2 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 2 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 2 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 2 / 15
◮ Unavailability of representative individual-level validation data ◮ Company of government secret: not even black-box access
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 3 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 4 / 15
◮ Accuracy ◮ Fairness Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 4 / 15
◮ Accuracy ◮ Fairness
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 4 / 15
◮ E.g., race, age, gender, state of residence
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 5 / 15
◮ E.g., race, age, gender, state of residence
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 5 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 6 / 15
State Population Fraction Have condition Classified as positive California 12.2% 0.3% 0.4% Texas 8.6% 1.2% 5% ... ... ... ...
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 6 / 15
State Population Fraction Have condition Classified as positive California 12.2% 0.3% 0.4% Texas 8.6% 1.2% 5% ... ... ... ...
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 6 / 15
State Population Fraction Have condition Classified as positive California 12.2% 0.3% 0.4% Texas 8.6% 1.2% 5% ... ... ... ...
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 6 / 15
State Population Fraction Have condition Classified as positive California 12.2% 0.3% 0.4% Texas 8.6% 1.2% 5% ... ... ... ...
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 6 / 15
State Population Fraction Have condition Classified as positive California 12.2% 0.3% 0.4% Texas 8.6% 1.2% 5% ... ... ... ...
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 6 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 7 / 15
Population Fraction Have condition Classified as positive State A 1/2 1/3 1/2 State B 1/2 2/3 2/3 Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 7 / 15
Population Fraction Have condition Classified as positive State A 1/2 1/3 1/2 State B 1/2 2/3 2/3 ◮ True positives:
.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 7 / 15
Population Fraction Have condition Classified as positive State A 1/2 1/3 1/2 State B 1/2 2/3 2/3 ◮ True positives:
.
◮ Which are the predicted positives? Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 7 / 15
Population Fraction Have condition Classified as positive State A 1/2 1/3 1/2 State B 1/2 2/3 2/3 ◮ True positives:
.
◮ Which are the predicted positives? ◮ Smallest error:
. Error of 12.5%, unfair.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 7 / 15
Population Fraction Have condition Classified as positive State A 1/2 1/3 1/2 State B 1/2 2/3 2/3 ◮ True positives:
.
◮ Which are the predicted positives? ◮ Smallest error:
. Error of 12.5%, unfair.
◮ Fair solution:
. 25% error.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 7 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
◮ What is the minimal unfairness that the classifier must have,
given an upper bound on its error?
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
◮ What is the minimal unfairness that the classifier must have,
given an upper bound on its error?
◮ What is the minimal error that the classifier must have,
given an upper bound on its unfairness?
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ error: Fraction of the population classified with the wrong label. ◮ unfairness: Fraction of the population treated differently than a
common baseline. We expand on this next.
◮ What is the minimal unfairness that the classifier must have,
given an upper bound on its error?
◮ What is the minimal error that the classifier must have,
given an upper bound on its unfairness?
◮ What is the minimal combined cost of this classifier? Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 8 / 15
◮ A baseline distribution which is common to all sub-populations;
FPR = α1 and FNR = α0,
◮ A nuisance distribution for each sub-population s;
FPR = α1
s and FNR = α0 s,
◮ The distribution for sub-population s is a mixture:
ηs · Nuisances + (1 − ηs) · Baseline.
η
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 9 / 15
◮ A baseline distribution which is common to all sub-populations;
FPR = α1 and FNR = α0,
◮ A nuisance distribution for each sub-population s;
FPR = α1
s and FNR = α0 s,
◮ The distribution for sub-population s is a mixture:
ηs · Nuisances + (1 − ηs) · Baseline.
◮ Define unfairness as the fraction of the population that is treated
differently from the baseline treatment =
s ηs.
η
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 9 / 15
◮ A baseline distribution which is common to all sub-populations;
FPR = α1 and FNR = α0,
◮ A nuisance distribution for each sub-population s;
FPR = α1
s and FNR = α0 s,
◮ The distribution for sub-population s is a mixture:
ηs · Nuisances + (1 − ηs) · Baseline.
◮ Define unfairness as the fraction of the population that is treated
differently from the baseline treatment =
s ηs.
◮ The decomposition to baseline and nuisance is unobserved.
η
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 9 / 15
◮ A baseline distribution which is common to all sub-populations;
FPR = α1 and FNR = α0,
◮ A nuisance distribution for each sub-population s;
FPR = α1
s and FNR = α0 s,
◮ The distribution for sub-population s is a mixture:
ηs · Nuisances + (1 − ηs) · Baseline.
◮ Define unfairness as the fraction of the population that is treated
differently from the baseline treatment =
s ηs.
◮ The decomposition to baseline and nuisance is unobserved. ◮ Set ηs to the minimum consistent with the input statistics.
η
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 9 / 15
◮ A baseline distribution which is common to all sub-populations;
FPR = α1 and FNR = α0,
◮ A nuisance distribution for each sub-population s;
FPR = α1
s and FNR = α0 s,
◮ The distribution for sub-population s is a mixture:
ηs · Nuisances + (1 − ηs) · Baseline.
◮ Define unfairness as the fraction of the population that is treated
differently from the baseline treatment =
s ηs.
◮ The decomposition to baseline and nuisance is unobserved. ◮ Set ηs to the minimum consistent with the input statistics.
η(αy, αy
s ) =
1 − αy
s /αy
αy
s < αy
1 − (1 − αy
s )/(1 − αy)
αy
s > αy
αy
s = αy.
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.2 0.4 0.6 0.8 1 η(a,b)
b
b = 0.01 b = 0.5 b = 0.5 b = 0.75 b = 0.99
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 9 / 15
s } in each sub-population,
discrepancyβ({αy
s }) =
β · min
(α0,α1)∈[0,1]2
ws
πy
s η(αy, αy s ) + (1 − β) ·
ws
πy
s αy s .
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 10 / 15
s } in each sub-population,
discrepancyβ({αy
s }) =
β · min
(α0,α1)∈[0,1]2
ws
πy
s η(αy, αy s ) + (1 − β) ·
ws
πy
s αy s .
s } discrepancyβ({αy
s }) subject to
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 10 / 15
s } in each sub-population,
discrepancyβ({αy
s }) =
β · min
(α0,α1)∈[0,1]2
ws
πy
s η(αy, αy s ) + (1 − β) ·
ws
πy
s αy s .
s } discrepancyβ({αy
s }) subject to
s }) subject to the constraints imposed
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 10 / 15
β
β
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 11 / 15
β
β
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 11 / 15
β
β
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 11 / 15
0% 20% 40% 60% 80% 100% >99%>90%>80%>70%>60%>50%>40%>30%>20%>10%
Percent with ratio above threshold
0.2 0.4 0.6 0.8 1
0.2 0.4 0.6 0.8 1 lower bound on discβ/true value β
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 11 / 15
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 12 / 15
0.2 0.4 0.6 0.8 1
0.2 0.4 0.6 0.8 1 Error/true positives Unfairness/true positives
Breast Cervical Colon Liver Lung Skin Stomach testicular Thyroid Bladder Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 12 / 15
0.05 0.1 0.15 0.2
0.02 0.04 0.06 0.08 0.1 0.12 0.14 Error/true positives Unfairness/true positives
Sep-01 (a) Sep-01 (b) Sep-13 Oct-13 Oct-26 Oct-27 Nov-06 (a) Nov-06 (b) Nov-07 (a) Nov-07 (b)
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 13 / 15
0.5 1 1.5 2
0.2 0.4 0.6 0.8 1 Error/true positives Unfairness/true positives
Brain Breast Colon Kidney Liver Lung Oral Pancreatic Stomach Thyroid
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 14 / 15
◮ Estimating the quality of a classifier during development stages ◮ Studying classifiers of public importance ◮ Analysis of statistical phenomena by defining an appropriate classifier
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 15 / 15
◮ Estimating the quality of a classifier during development stages ◮ Studying classifiers of public importance ◮ Analysis of statistical phenomena by defining an appropriate classifier
Sabato & Yom-Tov (Microsoft & BGU) Bounding fairness and accuracy 15 / 15