fair regression
play

Fair Regression: Quantitative Definitions and Reduction- Based - PowerPoint PPT Presentation

Fair Regression: Quantitative Definitions and Reduction- Based Algorithms Steven Wu (University of Minnesota) Joint work with: Alekh Agarwal and Miro Dudk (Microsoft Research) Problem setting Distribution ! over examples: ( ", $,


  1. Fair Regression: Quantitative Definitions and Reduction- Based Algorithms Steven Wu (University of Minnesota) Joint work with: Alekh Agarwal and Miro Dudík (Microsoft Research)

  2. Problem setting • Distribution ! over examples: ( ", $, %) • ": feature vector • $: discrete protected attribute (e.g. racial groups, gender) • % ∈ [0, 1]: real-valued label (e.g. risk score, recidivism rate) • Prediction task: given loss function ℓ (e.g. square loss, logistic loss) find a predictor . ∈ / to minimize 0 1 [ ℓ(%, .(")] • ℓ is 1-Lipschitz: ≤ 3 − 3 6 + |4 − 4 6 | ℓ 3, 4 − ℓ 3 6 , 4 6

  3. Fairness notion: Statistical Parity • Statistical parity (SP): !(#) is independent of protected attribute % & ! # ≥ ( % = * ] = & ! # ≥ ( for all groups * and ( ∈ [0, 1] • Implies any thresholding of !(#) is fair! • Motivated by practice of affirmative action as well as four-fifths rule

  4. Fairness notion: Bounded Group Loss • Bounded group loss (BGL): bounded group loss at level ! " # [ℓ &, ( ) |+ = -] ≤ ! for all groups -. • Enforces minimum prediction quality for each group • Diagnostic to detect groups requiring further data collection, better features, etc. • Similar to minmax fairness

  5. Main results • Reduction-based algorithm: a provably efficient algorithms that iteratively solves a sequence of supervised learning problems (without fairness constraints): • Risk minimization under ℓ • Square loss minimization • Cost-sensitive classification (or weighted classification problem) • Finite sample guarantees on: • Accuracy • Fairness violations

  6. Empirical Evaluation • Fairness constraint: statistical parity • Data sets: Adult, Law School, Communities & Crime • Losses: square loss, logistic loss • Reductions: • Cost-sensitive classification (CS) • Square loss minimization (LS) • Logistic loss minimization (LR) • Predictor classes: linear and tree ensemble

  7. Statistical Parity Disparity (CDF distance)

  8. Statistical Parity Disparity (CDF distance)

  9. Fair Regression: Quantitative Definitions and Reduction- Based Algorithms Poster: Thurs @ Pacific Ballroom #132

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend