Biases in NLP Models and What It Takes to Control them Kai-Wei - PowerPoint PPT Presentation

Biases in NLP Models and What It Takes to Control them Kai-Wei Chang 1

A carton of ML (NLP) pipeline Prediction Evaluation (Structured) Inference Auxiliary Corpus/Models Representation (e.g, word embedding) Data Kai-Wei Chang (kw@kwchang.net) 2

Motivate Example: Coreference Resolution • Coreference resolution is biased 1,2 • Model fails for female when given same context Semantics Only w/ Syntactic Cues his ⇒ her 1 Zhao et al. Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods. NAACL 2018. 2 Rudinger et al. Gender Bias in Coreference Resolution. NAACL 2018 Kai-Wei Chang (kw@kwchang.net) 3

Wino-bias data v Stereotypical dataset v Anti-stereotypical dataset Kai-Wei Chang (kw@kwchang.net) 4

Gender bias in Coref System 78 73 68 63 58 53 48 Neural Coref Model E2E E2E (Debiased WE) E2E (Full model) Steoetype Anti-Steoretype Avg Kai-Wei Chang (kw@kwchang.net) 5

Gender bias in Coref System 78 73 68 63 58 53 48 Neural Coref Model E2E E2E (Debiased WE) E2E (Full model) Steoetype Anti-Steoretype Avg Kai-Wei Chang (kw@kwchang.net) 6

Gender bias in Coref System 78 73 68 63 58 53 48 Neural Coref Model Mitigate WE Bias Mitigate Data Bias E2E E2E (Debiased WE) E2E (Full model) Steoetype Anti-Steoretype Avg Kai-Wei Chang (kw@kwchang.net) 7

Misrepresentation and Bias Kai-Wei Chang (kw@kwchang.net) 8

Stereotypes Which word is more likely to be used by a female ? Giggle – Laugh (Preotiuc-Pietro et al. ‘16) Credit: Yulia Tsvetkov Kai-Wei Chang (kw@kwchang.net) 9

Stereotypes Which word is more likely to be used by a female ? Giggle – Laugh (Preotiuc-Pietro et al. ‘16) Credit: Yulia Tsvetkov Kai-Wei Chang (kw@kwchang.net) 10

Stereotypes Which word is more likely to be used by a older person ? Impressive – Amazing (Preotiuc-Pietro et al. ‘16) Credit: Yulia Tsvetkov Kai-Wei Chang (kw@kwchang.net) 11

Stereotypes Which word is more likely to be used by a older person ? Impressive – Amazing (Preotiuc-Pietro et al. ‘16) Credit: Yulia Tsvetkov Kai-Wei Chang (kw@kwchang.net) 12

Why do we intuitively recognize a default social group? Credit: Yulia Tsvetkov 13

Why do we intuitively recognize a default social group? Implicit Bias Credit: Yulia Tsvetkov 14

BIASED AI Data is riddled with Implicit Bias Modified from Yulia Tsvetkov’s slide 15

Bias in Wikipedia v Only small portion of editors are female v Have less extensive articles about women v Have fewer topics important to women. (Ruediger et al., 2010) Kai-Wei Chang (kw@kwchang.net) 16

BIASED AI Consequence: models are biased Credit: Yulia Tsvetkov 17

Bias in Language Generation The Woman Worked as a Babysitter: On Biases in Language Generation (Sheng EMNLP 2019) • Language generation is biased (GPT-2) Kai-Wei Chang (kw@kwchang.net) 18

Where’s Biases? Kai-Wei Chang (kw@kwchang.net) 19

A carton of ML (NLP) pipeline Prediction Evaluation (Structured) Inference Auxiliary Corpus/Models Representation (e.g, word embedding) Data Kai-Wei Chang (kw@kwchang.net) 20

Representational Harm in NLP: Word Embeddings can be Sexist Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings [ Bolukbasi et al. NeurIPS16] Given gender direction ( 𝑤 #$ − 𝑤 &#$ ) , find word pairs with parallel direction by cos(𝑤 , − 𝑤 - , 𝑤 #$ − 𝑤 &#$ ) he: _______ she:_______ brother sister she beer cocktail he physician registered_nurse professor associate professor Google w2v embedding trained from the news Kai-Wei Chang (kw@kwchang.net) 21

Implicit association test (IAT) v Greenwald et al. 1998 v Detect the strength of a person's subconscious association between mental representations of objects (concepts) Boy Math Girl Reading https://en.wikipedia.org/wiki/Implicit-association_test https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 22

Implicit association test (IAT) Boy Girl https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 23

Implicit association test (IAT) Boy Girl Emily https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 24

Implicit association test (IAT) Boy Girl Tom https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 25

Implicit association test (IAT) Math Reading https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 26

Implicit association test (IAT) Math Reading number https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 27

Implicit association test (IAT) Boy Girl Math Reading https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 28

Implicit association test (IAT) Boy Girl Math Reading Algebra https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 29

Implicit association test (IAT) Boy Girl Math Reading Julia https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 30

Implicit association test (IAT) Boy Girl Reading Math https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 31

Implicit association test (IAT) Boy Girl Reading Math Literature https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 32

Implicit association test (IAT) Boy Girl Reading Math Dan https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 33

Implicit association test (IAT) https://implicit.harvard.edu Kai-Wei Chang (kw@kwchang.net) 34

Word Embedding Association Test (WEAT) • X : “mathematics”, “science”; Y : “arts”, “design” • A : “male”, “boy”; B : “female”, “girl” “mathematics” “male”, “boy” “female”, “girl” Caliskan et al. Semantics derived automatically from language corpora contain human-like biases Science. 2017 35 Kai-Wei Chang (kw@kwchang.net)

Word Embedding Association Test (WEAT) • X : “mathematics”, “science”; Y : “arts”, “design” • A : “male”, “boy”; B : “female”, “girl” Differential association of the two sets of words with the Aggregate the target words attributes Caliskan et al. Semantics derived automatically from language corpora contain human-like biases Science. 2017 36 Kai-Wei Chang (kw@kwchang.net)

Word Embedding Association Test (WEAT) • X : “mathematics”, “science”; Y : “arts”, “design” • A : “male”, “boy”; B : “female”, “girl” The effect size of bias: Caliskan et al. Semantics derived automatically from language corpora contain human-like biases Science. 2017 37 Kai-Wei Chang (kw@kwchang.net)

Word Embedding Association Test Caliskan et al. (2017) IAT WEAT Kai-Wei Chang (kw@kwchang.net) 38

Word Embedding Association Test Caliskan et al. (2017) Kai-Wei Chang (kw@kwchang.net) 39 WEAT finds similar biases in Word Embeddings as IAT did for humans

she he father mother king queen Kai-Wei Chang (kw@kwchang.net) 40

Kai-Wei Chang (kw@kwchang.net) 41

Can we Extend the Analysis beyond Binary Gender? Kai-Wei Chang (kw@kwchang.net) 42

Beyond Gender & Race/Ethnicity Bias Manzini et al. NAACL 2019 Biases in word embeddings trained on Kai-Wei Chang (kw@kwchang.net) 43 the Reddit data from US users.

How about other Embedding? Kai-Wei Chang (kw@kwchang.net) 44

Bias Only in English? v Language with grammatical gender v Morphological agreement (Zhou et al, EMNLP 2019) Kai-Wei Chang (kw@kwchang.net) 45

v Linear Discriminative Analysis (LDA) v Identify grammatical gender direction feminine words masculine words Kai-Wei Chang (kw@kwchang.net) 46

masculine Female Male feminine Kai-Wei Chang (kw@kwchang.net) 47

How about bilingual embedding? [Zhou et al. EMNLP19] Female doctor in Spanish male doctor in Spanish Kai-Wei Chang (kw@kwchang.net) 50

How about Contextualized Representation? Gender Bias in Contextualized Word Embeddings Zhao et al. NAACL 19 v First two components explain more variance than others (Feminine) The driver stopped the car at the hospital because she was paid to do so (Masculine) The driver stopped the car at the hospital because he was paid to do so gender direction: ELMo(driver) – ELMo(driver) Kai-Wei Chang (kw@kwchang.net) 51

Biases in NLP Models and What It Takes to Control them Kai-Wei - PowerPoint PPT Presentation

Biases in NLP Models and What It Takes to Control them Kai-Wei Chang 1 A carton of ML (NLP) pipeline Prediction Evaluation (Structured) Inference Auxiliary Corpus/Models Representation (e.g, word embedding) Data Kai-Wei Chang

ASEAN Airline Takes Wing ASEAN Airline Takes Wing ASEAN Airline Takes Wing ASEAN Airline Takes

Heuristics and biases Tina Nane 2 Heuristics and biases Lotto Icon by Dapete is

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Unconscious Bias 1 Questions to Start: Are we aware of our unconscious biases? Do we accept

TEXT AND TEXT AND AUTOMATED BIASES AUTOMATED BIASES NATURAL LANGUAGES ARE THE NATURAL

Biases in Decision Making Alexander Felfernig alexander.felfernig@ist.tugraz.at Decision Biases

Investigating Potential Investigating Potential Biases in Aerosol Light Biases in Aerosol Light

Capital Budgeting: Biases (Welch, Chapter 13-5) Ivo Welch More Biases Overconfidence Are you

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Ontologies for NLP NLP for Ontologies FOIS 2014 - LogOnto Workshop on Logics and Ontologies for

Lay Them Down Chorus: Lay them down, Lay them down, Lay your branches down for Him Spread them

NLP Programming Tutorial 7 - Topic Models Graham Neubig Nara Institute of Science and Technology

Capsule Networks for NLP Will Merrill Advanced NLP 10/25/18 Capsule Networks: A Better ConvNet

Natural Language Processing with Deep Learning Footprint of Societal Biases in NLP Navid

Paycheck Protection Program EXPLAINED CHERYL PANTHER, CPA/PFS, ADFA/CDFA LILI VASILEFF, CFP,

Carol Kando-Pineda Counsel, Division of Consumer and Business Education, Federal Trade Commission

How to Bootstrap a BSD Conference Li-Wen Hsu <lwhsu@FreeBSD.org> Something about Me Li-Wen

Efficient matchmaking in assignment games with application to online platforms Peng Shi (USC)

Deep Learning Tutorial Part I Greg Shakhnarovich TTI-Chicago December 2016 Deep Learning

Algebra I Solving & Graphing Inequalities 2016-01-11 www.njctl.org Slide 3 / 182 Table of

Video Sur Video Sur rveillance, rveillance, , Video Analyti Video Analyti ics, and You.

Dynamic Federations Seamless aggregation of standard-protocol-based storage endpoints Fabrizio

Biases in NLP Models and What It Takes to Control them Kai-Wei - PowerPoint PPT Presentation

Biases in NLP Models and What It Takes to Control them Kai-Wei Chang 1 A carton of ML (NLP) pipeline Prediction Evaluation (Structured) Inference Auxiliary Corpus/Models Representation (e.g, word embedding) Data Kai-Wei Chang

ASEAN Airline Takes Wing ASEAN Airline Takes Wing ASEAN Airline Takes Wing ASEAN Airline Takes

Heuristics and biases Tina Nane 2 Heuristics and biases Lotto Icon by Dapete is

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Unconscious Bias 1 Questions to Start: Are we aware of our unconscious biases? Do we accept

TEXT AND TEXT AND AUTOMATED BIASES AUTOMATED BIASES NATURAL LANGUAGES ARE THE NATURAL

Biases in Decision Making Alexander Felfernig alexander.felfernig@ist.tugraz.at Decision Biases

Investigating Potential Investigating Potential Biases in Aerosol Light Biases in Aerosol Light

Capital Budgeting: Biases (Welch, Chapter 13-5) Ivo Welch More Biases Overconfidence Are you

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Ontologies for NLP NLP for Ontologies FOIS 2014 - LogOnto Workshop on Logics and Ontologies for

Lay Them Down Chorus: Lay them down, Lay them down, Lay your branches down for Him Spread them

NLP Programming Tutorial 7 - Topic Models Graham Neubig Nara Institute of Science and Technology

Capsule Networks for NLP Will Merrill Advanced NLP 10/25/18 Capsule Networks: A Better ConvNet

Natural Language Processing with Deep Learning Footprint of Societal Biases in NLP Navid

Paycheck Protection Program EXPLAINED CHERYL PANTHER, CPA/PFS, ADFA/CDFA LILI VASILEFF, CFP,

Carol Kando-Pineda Counsel, Division of Consumer and Business Education, Federal Trade Commission

How to Bootstrap a BSD Conference Li-Wen Hsu &lt;lwhsu@FreeBSD.org&gt; Something about Me Li-Wen

Efficient matchmaking in assignment games with application to online platforms Peng Shi (USC)

Deep Learning Tutorial Part I Greg Shakhnarovich TTI-Chicago December 2016 Deep Learning

Algebra I Solving &amp; Graphing Inequalities 2016-01-11 www.njctl.org Slide 3 / 182 Table of

Video Sur Video Sur rveillance, rveillance, , Video Analyti Video Analyti ics, and You.

Dynamic Federations Seamless aggregation of standard-protocol-based storage endpoints Fabrizio

How to Bootstrap a BSD Conference Li-Wen Hsu <lwhsu@FreeBSD.org> Something about Me Li-Wen

Algebra I Solving & Graphing Inequalities 2016-01-11 www.njctl.org Slide 3 / 182 Table of