What It Takes to Control Societal Bias in Natural Language Processing
Kai-Wei Chang UCLA
References: http://kwchang.net
1 Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Kai-Wei Chang UCLA References: http://kwchang.net Kai-Wei Chang - - PowerPoint PPT Presentation
What It Takes to Control Societal Bias in Natural Language Processing Kai-Wei Chang UCLA References: http://kwchang.net Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 1 Always working?!
1 Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 2
http://viralscape.com/travel-expectations-vs-reality/
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 3
Generating Natural Language Adversarial Examples [ASEHSC(EMNLP 18)] Retrofitting Contextualized Word Embeddings with Paraphrases [SCZC (EMNLP 19)]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 4
Semantics Only w/ Syntactic Cues
5
2Rudinger et al. Gender Bias in Coreference Resolution. NAACL 2018
1, Gender Bias in Coreference Resolution: Evaluation and Debiasing
Methods [ZWYOC NAACL 2018]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 6
[ZWYOC NAACL 2018]
48 53 58 63 68 73 78 E2E E2E (Debiased WE) E2E (Full model)
Steoetype Anti-Steoretype Avg
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 7
8
The Woman Worked as a Babysitter: On Biases in Language Generation [SCNP EMNLP 2019]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 9
[ACL 2019]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 10
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 11
Auxiliary Corpus/Models (e.g, word embedding)
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings [BCZSK NeurIPS16]
12 We use Google w2v embedding trained from the news
he: ________ she:_______ brother sister beer cocktail physician registered_nurse professor associate professor
Concurrent work: replicated IAT findings using word embeddings
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 13
he she father mother king queen
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 14
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 15
[BCZSK; NeurIPS 16]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 16
Make Gender Information Transparent in Word Embedding
Learning Gender-Neutral Word Embeddings [ZZLWC; EMNLP18] Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 17
1
?
mother father doctor dimensions reserve for gender information π₯1 dimensions for other latent aspects π₯#
Make Gender Information Transparent in Word Embedding
Learning Gender-Neutral Word Embeddings [ZZLWC; EMNLP18] Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 18
π₯# π₯1
Make Gender Information Transparent in Word Embedding
Learning Gender-Neutral Word Embeddings [ZZLWC; EMNLP18] Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 19
π₯# π₯1
48 53 58 63 68 73 78 E2E E2E (Debiased WE) E2E (Full model)
Steoetype Anti-Steoretype Avg
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 20
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 21
[ZSZHCCC EMNLP19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 22
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 23
masculine words feminine words
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 24
Male Female masculine feminine
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 25
Male Female masculine feminine
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 26
Male Female masculine feminine
[ZSZHCCC EMNLP19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 27 Female doctor in Spanish male doctor in Spanish
Gender Bias in Contextualized Word Embeddings
[ZWYCOC; NAACL19]
28 Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
(Feminine) The driver stopped the car at the hospital because she was paid to do so (Masculine) The driver stopped the car at the hospital because he was paid to do so
gender direction: ELMo(driver) β ELMo(driver)
The driver stopped the car at the hospital because she was paid to do so
v Classifier
29
ELMo(occupation) β
<latexit sha1_base64="r2/BIv92hsxnS9nS8QghuqsRs=">AB9HicbVA9TwJBEJ3DL8Qv1NLmIphYkTstCTaWGIiYAIXsrcsGFv9ydw5ALv8PGQmNs/TF2/hsXuELBl0zy8t5MZuaFseAGPe/bya2tb2xu5bcLO7t7+wfFw6OmUYmrEGVUPohJIYJLlkDOQr2EGtGolCwVji6mfmtMdOGK3mPk5gFERlI3ueUoJWCckfzwRCJ1uqp3C2WvIo3h7tK/IyUIEO9W/zq9BRNIiaRCmJM2/diDFKikVPBpoVOYlhM6IgMWNtSJmgnR+9NQ9s0rP7StS6I7V39PpCQyZhKFtjMiODTL3kz8z2sn2L8KUi7jBJmki0X9RLio3FkCbo9rRlFMLCFUc3urS4dE4o2p4INwV9+eZU0qxX/olK9q5Zq1kceTiBUzgHy6hBrdQhwZQeIRneIU3Z+y8O/Ox6I152Qzx/AHzucPgN+R6w=</latexit>context gender
ELMo embeddings gender predictio n
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
v Classifier
Acc (%) 80 85 90 95 100 Male Context Female Context
gender information to
14% more accurately propagated than female
30
ELMo(occupation) β
<latexit sha1_base64="r2/BIv92hsxnS9nS8QghuqsRs=">AB9HicbVA9TwJBEJ3DL8Qv1NLmIphYkTstCTaWGIiYAIXsrcsGFv9ydw5ALv8PGQmNs/TF2/hsXuELBl0zy8t5MZuaFseAGPe/bya2tb2xu5bcLO7t7+wfFw6OmUYmrEGVUPohJIYJLlkDOQr2EGtGolCwVji6mfmtMdOGK3mPk5gFERlI3ueUoJWCckfzwRCJ1uqp3C2WvIo3h7tK/IyUIEO9W/zq9BRNIiaRCmJM2/diDFKikVPBpoVOYlhM6IgMWNtSJmgnR+9NQ9s0rP7StS6I7V39PpCQyZhKFtjMiODTL3kz8z2sn2L8KUi7jBJmki0X9RLio3FkCbo9rRlFMLCFUc3urS4dE4o2p4INwV9+eZU0qxX/olK9q5Zq1kceTiBUzgHy6hBrdQhwZQeIRneIU3Z+y8O/Ox6I152Qzx/AHzucPgN+R6w=</latexit>context gender The writer taught himself to play violin .
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
40 50 60 70 80 GloVe + ELMo
OntoNotes Pro. Anti.
Ξ: 29.6 Ξ: 26.6
31 Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 32
Representation (Structured) Inference Prediction Auxiliary Corpus/Models (e.g, word embedding) Data
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 33
Named Entity are anonymized Gender words are swapped
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 34
20 40 60 80 without augment with augment OntoNotes Pro. Anti.
35/9
Cooking Role Object agent woman food vegetable container bowl tool knife place kitchen
Whatβs the agent for this image?
?
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints [EMNLP 17*] Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, Kai-Wei Chang An example from a vSRL (visual Semantic Role Labeling) system
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
*Best Long Paper Award at EMNLP 17
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 36
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 37
38
COOKING ROLES NOUNS AGENT woman FOOD vegetable CONTAINER pot TOOL spatula
Yatskar et al. CVPR β16, Yang et al. NAACL β16, Gupta and Malik arXiv β16
Convolutional Neural Network Regression Conditional Random Field
(events)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
Adversarial Removal of Gender from Deep Image Representations [WZYCO19; ICCV 19] Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 39
40
41 Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
42
Dataset Model RBA
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
43
Sontag et al., 2011; Rush and Collins, 2012; Chang and Collins, 2011; Peng et al., 2015, Chang et al., 2013; Dalvi, 2015
44
Sontag et al., 2011; Rush and Collins, 2012; Chang and Collins, 2011; Peng et al., 2015, Chang et al., 2013; Dalvi, 2015
45
π3 β₯ 0
Lagrangian :
Sontag et al., 2011; Rush and Collins, 2012; Chang and Collins, 2011; Peng et al., 2015, Chang et al., 2013; Dalvi, 2015
46
Sontag et al., 2011; Rush and Collins, 2012; Chang and Collins, 2011; Peng et al., 2015, Chang et al., 2013; Dalvi, 2015
47
Sontag et al., 2011; Rush and Collins, 2012; Chang and Collins, 2011; Peng et al., 2015, Chang et al., 2013; Dalvi, 2015
48
Sontag et al., 2011; Rush and Collins, 2012; Chang and Collins, 2011; Peng et al., 2015, Chang et al., 2013; Dalvi, 2015
49
50
Male bias
51
Male bias
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 52
[ACL 2019]
53
Embeddings for the input sentence
w1 w2 w3 w4
An encoder to produce contextualized representations
s1 s2 s3 s4
A decoder that makes (structured) predictions
h1 h2 h3 h4 Credit: Nanyun Peng
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
54
Multi-Lingual Embeddings for the input sentence
w1 w2 w3 w4
An encoder to produce contextualized representations
s1 s2 s3 s4
A decoder that makes (structured) predictions
h1 h2 h3 h4 Credit: Nanyun Peng
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
55
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
56
Dozat and Manning (ICLR2017)
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
57
Flexible positional encoding (order-free)
Shaw et. al. (NAACL2018) Vaswani et. al. (NIPS 2017) [WZMCN NAACL 19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
v BiLSTMs (order-sensitive) v.s. v Multi-Head Self-Attention with Absolute Relative Positional Encoding (order-free)
v Pointer Network (order-sensitive) v.s. v BiAffine Attention (order-free)
58
Facebook MUSE
Conneau et. al. ICLR2018
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
59
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
10 20 30 40 50 60 70 80 90 100 English Swedish Spanish Croatian Hindi
Zero-shot Transfer UAS Results (Except for English)
RNN-Stack SelfAtt-Stack RNN-Graph SelfAtt-Graph
60
Distances to English increase, Transfer performances decrease.
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
61
The languages (x-axis) are sorted by this relative frequency from high to low
Kai-Wei Chang (http://kwchang.net/talks/genderbias/)
[WZMCN CoNLL 19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 62
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 63
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 64
[TPC EMNLP 19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 65
v Consider constraints in two forms: v specifies the ratio π of POS1 being on the left in all POS1- POS2 arcs v specifies the ratio π of the heads of a particular POS appears on the left of that POS v Compiling from WALS features:
v Dominant order: e.g. 85A β Binary constraint (ADP, NN)
v Prepositions: π β (0,0.25) v No dominant order: π β (0.25,0.75) v Postpositions: π β (0.75,1)
ADJ NN ADJ NN v.s. NN NN v.s. [TPC EMNLP 19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 66
v Introduce Lagrangian multipliers for each constraints. v Apply sub-gradient descent method to solve the dual form.
v Use constraints to define a feasible distribution set π . v Find the closest distribution π β π from πC, and do MAP inference on π.
πΏπ(π ||πC)
πC
Feasible Set π πβ
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 67
v LR, PR get improvements in 15,17 out of 19 target languages from variant of language families, respectively v The improvements are closely related to the ratio gap in constraints v LR has greater average improvement, while PR is a more robust inference algorithm
IE.Indic Dravidian Turkic Austronesian [TPC EMNLP 19]
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 68
Kai-Wei Chang (http://kwchang.net/talks/genderbias/) 69
Students: Jieyu Zhao, Tianlu Wang, Pei Zhou, Weijia Shi, Wasi Ahmad, Meng Tao, Moustafa Alzantot, Emily Sheng, Tony Sun, Andrew, Gaut Collaborators: Vicente Ordonez, Nanyun Peng, Muhao Chen, Mark Yatskar, Edward Hovy, Premkumar Natarajan, Wei Wang, Mani Srivastava, Tolga Bolukbasi, James Zou, Venkatesh Saligrama, Adam Kalai, William Wang