Tsvetkov – Socially Responsible NLP 1
Yulia Tsvetkov
Algorithms for NLP
IITP, Fall 2019
Lecture 25: Computational Ethics
Algorithms for NLP IITP, Fall 2019 Lecture 25: Computational Ethics - - PowerPoint PPT Presentation
Algorithms for NLP IITP, Fall 2019 Lecture 25: Computational Ethics Yulia Tsvetkov 1 Tsvetkov Socially Responsible NLP What NLP Has To Do With Ethics? Applications Machine Translation Information Retrieval Question
Tsvetkov – Socially Responsible NLP 1
Yulia Tsvetkov
IITP, Fall 2019
Lecture 25: Computational Ethics
Tsvetkov – Socially Responsible NLP
○ Machine Translation ○ Information Retrieval ○ Question Answering ○ Dialogue Systems ○ Information Extraction ○ Summarization ○ Sentiment Analysis ○ ...
Tsvetkov – Socially Responsible NLP
The common misconception is that language has to do with words and what they mean. It doesn’t. It has to do with people and what they mean.
Herbert H. Clark & Michael F. Schober, 1992
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
“Ethics is a study of what are good and bad ends to pursue in life and what it is right and wrong to do in the conduct of life. It is therefore, above all, a practical discipline. Its primary aim is to determine how one ought to live and what actions one ought to do in the conduct of one’s life.”
Tsvetkov – Socially Responsible NLP
It’s the good things It’s the right things
Tsvetkov – Socially Responsible NLP
It’s the good things It’s the right things
Tsvetkov – Socially Responsible NLP
Should you pull the lever to divert the trolley?
[From Wikipedia]
Tsvetkov – Socially Responsible NLP
hen rooster
Tsvetkov – Socially Responsible NLP
hen rooster
➔ Ethics is inner guiding, moral principles, and values of people and society ➔ There are grey areas, there are often no give binary answers. ➔ Ethics changes over time with values and beliefs of people ➔ Legal ≠ ethical
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
should we be then trying to quantify and evaluate ethics in AI? ○ It is another problem with an ill-defined answer ■ It still has some definition of good and bad ■ Not everyone agrees on all examples ■ But they do agree on some examples ■ They do have some correlation between people ○ Complex NLP problems are also hard to quantify and evaluate ■ Summarization, QA, dialog, speech synthesis
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Let’s train a classifier to predict people’s IQ from their photos.
Tsvetkov – Socially Responsible NLP
Let’s train a classifier to predict people’s IQ from their photos.
classifier?
Tsvetkov – Socially Responsible NLP
Let’s train a classifier to predict people’s IQ from their photos.
Tsvetkov – Socially Responsible NLP
Let’s train a classifier to predict people’s IQ from their photos.
○ Evaluation reveals that white females have 95% accuracy ○ People with blond hair under age of 25 have only 60% accuracy
Tsvetkov – Socially Responsible NLP
Let’s train a classifier to predict people’s IQ from their photos.
Tsvetkov – Socially Responsible NLP
Let’s train a classifier to predict people’s IQ from their photos.
○ Researcher/developer? Reviewer? University? Society?
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Applications pervasive in our daily life!
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
lives?
Tsvetkov – Socially Responsible NLP
censorship, fake news, targeted content
Tsvetkov – Socially Responsible NLP
○ Is my NLP model capturing social stereotypes? ○ Are my classifier’s predictions fair?
○ E.g., Persuasive Language generation ■ in targeted advertisement, say, in Payday loan ads?
○ Demographic factors prediction (gender, age, etc.) ○ Sexual orientation prediction
○ Hate speech detection ○ Monitoring disease outbreaks etc. ○ Psychological monitoring/counseling ○ Low resource NLP ○ +many more
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a female ? (Preotiuc-Pietro et al. ‘16)
Giggle – Laugh
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a female ? (Preotiuc-Pietro et al. ‘16)
Giggle – Laugh
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a female ? (Preotiuc-Pietro et al. ‘16)
Brutal – Fierce
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a female ? (Preotiuc-Pietro et al. ‘16)
Brutal – Fierce
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a older person ? (Preotiuc-Pietro et al. ‘16)
Impressive – Amazing
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a older person ? (Preotiuc-Pietro et al. ‘16)
Impressive – Amazing
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a person of higher occupational class ? (Preotiuc-Pietro et al. ‘16)
Suggestions – Proposals
Tsvetkov – Socially Responsible NLP
Which word is more likely to be used by a person of higher occupational class ? (Preotiuc-Pietro et al. ‘16)
Suggestions – Proposals
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Bias in ML ⬄ Cognitive bias ⬄ Human biases in ML
○ Bias of an estimator: the difference between this estimator's expected value and the true value
○ Inductive bias: assumptions made by the model to learn the target function and to generalize beyond training data
Economics
○ Our brains are evolutionarily hard-wired to store learned information for rapid retrieval and automatic judgments. Stereotypes inevitably form because of the innate tendency of the human mind to categorize the world to simplify processing
○ A mismatch between the data and assumptions used to build a model and the actual populations who would benefit from the technology.
Tsvetkov – Socially Responsible NLP
Kahneman & Tversky 1973, 1974, 2002
System 1
automatic fast parallel automatic effortless associative slow-learning
System 2
effortful slow serial controlled effort-filled rule-governed flexible
Tsvetkov – Socially Responsible NLP
~10MP
Tsvetkov – Socially Responsible NLP
Our brains are evolutionarily hard-wired to store learned information for rapid retrieval and automatic judgments. Over 95% of cognition is relegated to the System 1 “auto-pilot.” System 1
automatic
System 2
effortful
Tsvetkov – Socially Responsible NLP
Stereotypes inevitably form because of the innate tendency of the human mind to:
encounters a category member
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Stereotypes are internalized as associations through natural processes of learning and categorization
[Image credit: Geoff Kaufman]
Tsvetkov – Socially Responsible NLP
Social stereotypes are not necessarily negative, but still have negative effect
[Image credit: Geoff Kaufman]
Tsvetkov – Socially Responsible NLP
Implicit biases are distressingly pervasive, operate largely unconsciously, and can automatically influence the ways in which we see and treat others, even when we are determined to be fair and objective.
[Image credit: Geoff Kaufman]
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Category Items Good Spectacular, Appealing, Love, Triumph, Joyous, Fabulous, Excitement, Excellent Bad Angry, Disgust, Rotten, Selfish, Abuse, Dirty, Hatred, Ugly African Americans European Americans
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Terrell et al. (2016)
Tsvetkov – Socially Responsible NLP
Micro-inequities: ephemeral, covert, unintentional, frequently unrecognized events that reinforce power dynamics or perceptions of “difference”
slights, exclusions, slips of the tongue, nonverbal signals, unchecked assumptions, unequal expectations, etc.
Tsvetkov – Socially Responsible NLP
Online Disinhibition Effect (Suler’04) Benign disinhibition and Toxic disinhibition ➔ Dissociative anonymity (“You don’t know me”) ➔ Invisibility (“You can’t see me”) ➔ Asynchronicity (“See you later”) ➔ Solipsistic Introjection (“It’s all in my head”) ➔ Dissociative Imagination (“It’s just a game”) ➔ Minimization of Status and Authority (“Your rules don’t apply here”)
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Fear of confirming a negative stereotype about one’s group (Steele & Aronson, 1995)
and undermine one’s confidence and ability to succeed
sense of belonging or self-belief in a particular domain ○ e.g., women in STEM: Beasley & Fischer’12; Shapiro & Williams’12; Cimpian & Leslie’ 17;
Tsvetkov – Socially Responsible NLP
Readings About the Social Animal, 8th edition, ed. E. Aronson
Tsvetkov – Socially Responsible NLP
[Slide credit: Geoff Kaufman]
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
System 1: Stereotype Activation
cognitive processes of categorization, etc.
regardless of personal prejudice level; a “mental habit” System 2: Preventing Stereotype Application
their impact or weaken their power…
Tsvetkov – Socially Responsible NLP
Wittenbrink et al. 2001) ○ Deliberately and repeatedly negating stereotypes or associating individuals with counter-stereotypic traits or attributes
○ Cultivating a deliberative mindset, reminding oneself of egalitarian goals, reinforcing curiosity and constructive uncertainty about others
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
AI is only System 1
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Online data is riddled with SOCIAL STEREOTYPES
BIASED
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Consequence: models are biased
BIASED
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
1. Bolukbasi T., Chang K.-W., Zou J., Saligrama V., Kalai A. (2016) Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word
2. Caliskan, A., Bryson, J. J. and Narayanan, A. (2017) Semantics derived automatically from language corpora contain human-like biases. Science 3. Nikhil Garg, Londa Schiebinger, Dan Jurafsky, James Zou. (2018) Word embeddings quantify 100 years of gender and ethnic stereotypes. PNAS.
Slide from SRNLP Tutorial at NAACL 2018
Tsvetkov – Socially Responsible NLP 1. Bolukbasi et al. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. NIPS (2016) 2. Caliskan, et al. Semantics derived automatically from language corpora contain human-like biases. Science (2017) 3. Garg et al. Word embeddings quantify 100 years of gender and ethnic stereotypes. PNAS. (2018) 4. Zhao, Jieyu, et al. Men also like shopping: Reducing gender bias amplification using corpus-level constraints. arXiv (2017) 5. Zhao, Jieyu, et al. Gender bias in coreference resolution: Evaluation and debiasing methods. arXiv (2018) 6. Zhang, et al. Mitigating unwanted biases with adversarial learning. AIES, 2018 7. Webster, Kellie, et al. Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns. TACL (2018) 8. Svetlana and Mohammad. Examining gender and race bias in two hundred sentiment analysis systems. arXiv (2018) 9. Díaz, et al. Addressing age-related bias in sentiment analysis. CHI Conference on Human Factors in Computing Systems. (2018) 10. Dixon, et al. Measuring and mitigating unintended bias in text classification. AIES. (2018) 11. Prates, et al. Assessing gender bias in machine translation: a case study with Google Translate. Neural Computing and Applications (2018) 12. Park, et al. Reducing gender bias in abusive language detection. arXiv (2018) 13. Zhao, Jieyu, et al. Learning gender-neutral word embeddings. arXiv (2018) 14. Anne Hendricks, et al. Women also snowboard: Overcoming bias in captioning models. ECCV. (2018) 15. Elazar and Goldberg. Adversarial removal of demographic attributes from text data. arXiv (2018) 16. Hu and Strout. Exploring Stereotypes and Biased Data with the Crowd. arXiv (2018) 17. Swinger, De-Arteaga, et al. What are the biases in my word embedding? AIES (2019) 18. De-Arteaga et al. Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting. FAT* (2019) 19. Gonen, et al. Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them. NAACL (2019). 20. Manzini et al. Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings. NAACL (2019). 21. …
2018 2019
Tsvetkov – Socially Responsible NLP
Science (2017)
Word Embeddings. NAACL (2019).
Tsvetkov – Socially Responsible NLP
○ Zhao, Jieyu, et al. Gender bias in coreference resolution: Evaluation and debiasing methods. arXiv (2018) ○ Webster, Kellie, et al. Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns. TACL (2018)
○ Svetlana and Mohammad. Examining gender and race bias in two hundred sentiment analysis systems. arXiv (2018) ○ Díaz, et al. Addressing age-related bias in sentiment analysis. CHI Conference on Human Factors in Comp. Systems. (2018)
○ Dixon, et al. Measuring and mitigating unintended bias in text classification. AIES. (2018)
○ De-Arteaga et al. Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting. FAT* (2019)
○ Prates, et al. Assessing gender bias in machine translation: a case study with Google Translate. Neural Computing and Applications (2018)
Tsvetkov – Socially Responsible NLP
Amplification using Corpus-level Constraint. EMNLP (2017)
Bias in a High-Stakes Setting. FAT* (2019)
Tsvetkov – Socially Responsible NLP
Tsvetkov – Socially Responsible NLP
real-world stereotypes
Tsvetkov – Socially Responsible NLP
○ Preventing discrimination in AI-based technologies ■ in consumer products and services ■ in diagnostics, in medical systems ■ in parole decisions ■ in mortgage lending, credit scores, and other financial decisions ■ in educational applications ■ in search → access to information and knowledge
○ Improving performance particularly where our model’s accuracy is lower
Tsvetkov – Socially Responsible NLP
http://demo.clab.cs.cmu.edu/ethical_nlp/
Tsvetkov – Socially Responsible NLP