Gendered Language Pamela Jakiela Owen Ozier CGD, BREAD, & IZA - - PowerPoint PPT Presentation

gendered language
SMART_READER_LITE
LIVE PREVIEW

Gendered Language Pamela Jakiela Owen Ozier CGD, BREAD, & IZA - - PowerPoint PPT Presentation

Gendered Language Pamela Jakiela Owen Ozier CGD, BREAD, & IZA World Bank, BREAD, & IZA August 2019 Motivation Language structures thought: Languages differ not only in how they build their sentences but also in how they break


slide-1
SLIDE 1

Gendered Language

Pamela Jakiela Owen Ozier CGD, BREAD, & IZA World Bank, BREAD, & IZA August 2019

slide-2
SLIDE 2

Motivation

Language structures thought:

“Languages differ not only in how they build their sentences but also in how they break down nature to secure elements to put in those sentences.” – Benjamin Lee Whorf (1941)

Jakiela and Ozier (2019) Gendered Language, Slide 2

slide-3
SLIDE 3

Motivation

Language structures thought:

“Languages differ not only in how they build their sentences but also in how they break down nature to secure elements to put in those sentences.” – Benjamin Lee Whorf (1941)

Sapir-Whorf Hypothesis: linguistic determinism

  • Our native language limits the scope of our thinking
  • Example [now debunked]: the Inuit have 7 different words for snow

Jakiela and Ozier (2019) Gendered Language, Slide 2

slide-4
SLIDE 4

Motivation

Language structures thought:

“Languages differ not only in how they build their sentences but also in how they break down nature to secure elements to put in those sentences.” – Benjamin Lee Whorf (1941)

Sapir-Whorf Hypothesis: linguistic determinism

  • Our native language limits the scope of our thinking
  • Example [now debunked]: the Inuit have 7 different words for snow

Nonetheless, there is mounting evidence that the languages we speak influence our thoughts and actions in subtle, subconscious ways

Jakiela and Ozier (2019) Gendered Language, Slide 2

slide-5
SLIDE 5

Motivation: Language Structures Thought

“Languages are far from impartial ‘containers’ for the packaging of underlying thoughts, but rather are active players in the construction of those thoughts.” – Ogunnaike et al. (2010)

Language shapes our thoughts and actions in subtle ways:

  • Russian blue (Winawer et al. 2007; Maier and Rahman 2018)
  • Agentive language (Fausey et al. 2010, Fausey and Boroditsky 2011)
  • Speakers of languages that treat the future as a separate tense

save less than those treating the future like the present (Chen 2013)

Jakiela and Ozier (2019) Gendered Language, Slide 3

slide-6
SLIDE 6

Motivation: Gender Norms

Australia Qatar

.2 .4 .6 .8 1

Percent agreeing: when jobs are scarce, men should have more of a right to a job than women

Australia Qatar

.2 .4 .6 .8 1

Percent agreeing: when a woman works, the children suffer

Jakiela and Ozier (2019) Gendered Language, Slide 4

slide-7
SLIDE 7

Motivation: Gender and Language

Linguistic gender distinctions:

  • Using different words for occupations (profesor/profesora)
  • Pronominal distinctions between men and women
  • Nominal classification systems (grammatical gender)

Jakiela and Ozier (2019) Gendered Language, Slide 5

slide-8
SLIDE 8

Motivation: Gender and Language

Linguistic gender distinctions:

  • Using different words for occupations (profesor/profesora)
  • Pronominal distinctions between men and women
  • Nominal classification systems (grammatical gender)

Do linguistic gender distinctions impact gender norms?

Grammatical gender creates “a habitual consciousness of two sex classes as a standard classifacatory fact in our thought-world.”

– Benjamin Lee Whorf (1936) Builds on arguments advanced by Durkheim and Mauss (1903)

Jakiela and Ozier (2019) Gendered Language, Slide 5

slide-9
SLIDE 9

Motivation: Gender and Language

Suggestive evidence of a link between grammar and gender norms:

  • Givati and Troiano (2012) show that countries with gendered

pronouns have shorter government-mandated maternity leaves

  • Perez and Tavits (2018) show that grammatical gender impacts

gender attitudes among Estonian/Russian bilinguals

  • Santacreu-Vasut et al. (2013) and Shoham and Lee (2017) use

World Atlas of Language Structures to estimate cross-country relationship between grammatical gender, gender-related outcomes

  • Hicks et al. (2015) use WALS data to look at US immigrants

Jakiela and Ozier (2019) Gendered Language, Slide 6

slide-10
SLIDE 10

Our Contribution

  • 1. Characterize the grammatical gender structure of 4,336 languages

which together account for 99 percent of the world’s population

◮ India: 6 languages coded in WALS, we code 281 ◮ Kenya: 3 languages coded in WALS, we code (all) 51

  • 2. Estimate the proportion of each country’s population whose native

language uses a grammatical gender system to classify nouns

◮ Estimate the cross-country relationship between grammatical gender and women’s labor force participation and educational attainment, and the relationship with gender attitudes among men and women

  • 3. Use individual-level data from countries where both gender and

non-gender languages are indigenous and widely spoken

◮ Replicate cross-country results within countries

Jakiela and Ozier (2019) Gendered Language, Slide 7

slide-11
SLIDE 11

Outline of the Talk

  • 1. What is grammatical gender?
  • 2. Conceptual framework
  • 3. Identifying gender languages
  • 4. Cross-country analysis

◮ Labor force participation, educational attainment, attitudes

  • 5. Within-country analysis

◮ Labor force participation, educational attainment

  • 6. Discussion and conclusion

Jakiela and Ozier (2019) Gendered Language, Slide 8

slide-12
SLIDE 12

Preview of Main Results

  • About 38 percent of the world’s population speaks gender languages
  • Women who grow up speaking gender languages are less likely:

◮ To be in the labor force ◮ To complete primary school

  • Both men and women who grow up speaking gender languages hold

more traditional views of women’s role in society vis-a-vis men’s

Jakiela and Ozier (2019) Gendered Language, Slide 9

slide-13
SLIDE 13

Grammatical Gender

slide-14
SLIDE 14

Motivation: Gender and Language

Languages differ in their treatment of gender:

  • Pronominal distinctions between men and women
  • Nominal classification systems (grammatical gender)

Jakiela and Ozier (2019) Gendered Language, Slide 11

slide-15
SLIDE 15

Motivation: Gender and Language

Languages differ in their treatment of gender:

  • Pronominal distinctions between men and women
  • Nominal classification systems (grammatical gender)

Example: Swahili does not make pronominal gender distinctions

she goes to school he goes to school

  • [yeye] anaenda shuleni

Jakiela and Ozier (2019) Gendered Language, Slide 11

slide-16
SLIDE 16

Motivation: Gender and Language

Languages differ in their treatment of gender:

  • Pronominal distinctions between men and women
  • Nominal classification systems (grammatical gender)

Example: Swahili does not make pronominal gender distinctions

she goes to school he goes to school

  • [yeye] anaenda shuleni

There are different words for males and females (e.g. “boy” vs. “girl”), but genders are treated identically from a grammatical perspective

Jakiela and Ozier (2019) Gendered Language, Slide 11

slide-17
SLIDE 17

Motivation: Gender and Language

Example: Spanish uses different pronouns for males and females

she goes to school = ella va a la escuela he goes to school = ´ el va a la escuela

Jakiela and Ozier (2019) Gendered Language, Slide 12

slide-18
SLIDE 18

Motivation: Gender and Language

Example: Spanish uses different pronouns for males and females

she goes to school = ella va a la escuela he goes to school = ´ el va a la escuela

Spanish uses a system of grammatical gender to classify nouns

  • All Spanish nouns are either masculine or feminine
  • Grammatical gender determines agreement (e.g. with adjectives)

Jakiela and Ozier (2019) Gendered Language, Slide 12

slide-19
SLIDE 19

Nominal classification

Most languages have a system for categorizing nouns (Aikhenvald 2003)

  • Many languages partition nouns into noun classes or genders

Jakiela and Ozier (2019) Gendered Language, Slide 13

slide-20
SLIDE 20

Nominal classification

Most languages have a system for categorizing nouns (Aikhenvald 2003)

  • Many languages partition nouns into noun classes or genders

Elements of a noun class often often share morphological properties:

  • Spanish:

◮ Masculine words end in O ◮ Feminine words end in A

  • Swahili: class prefixes are used as class names

◮ small items belong in ki-/vi- class, humans in m-/wa- class

Jakiela and Ozier (2019) Gendered Language, Slide 13

slide-21
SLIDE 21

Nominal classification

Most languages have a system for categorizing nouns (Aikhenvald 2003)

  • Many languages partition nouns into noun classes or genders

Elements of a noun class often often share morphological properties:

  • Spanish:

◮ Masculine words end in O ◮ Feminine words end in A

  • Swahili: class prefixes are used as class names

◮ small items belong in ki-/vi- class, humans in m-/wa- class

Typical noun class system = semantic core + many exceptions

Jakiela and Ozier (2019) Gendered Language, Slide 13

slide-22
SLIDE 22

Nominal classification

Noun classes are defined by agreement — eg. nouns with adjectives Example: Swahili has nine distinct noun classes, each characterized by a set of prefixes for verbs, adjectives, demonstratives, possessives, etc.

[noun] new these these new chairs = viti vipya hivi these new teachers = walimu wapya hawa

Example: agreement depends on gender (masc./fem.) in Spanish

the [noun] white the white shirt = la camisa blanca the white hat = el sombrero blanco

Jakiela and Ozier (2019) Gendered Language, Slide 14

slide-23
SLIDE 23

Grammatical Gender

A grammatical gender system is a system of noun classification that:

  • Includes masculine and feminine as two of the classes
  • Characterizes (some) inanimate objects as masculine or feminine

◮ English is not a gender language∗ (though it uses gender pronouns)

Jakiela and Ozier (2019) Gendered Language, Slide 15

slide-24
SLIDE 24

Grammatical Gender

A grammatical gender system is a system of noun classification that:

  • Includes masculine and feminine as two of the classes
  • Characterizes (some) inanimate objects as masculine or feminine

◮ English is not a gender language∗ (though it uses gender pronouns)

Languages that use grammatical gender — a.k.a. gender languages — differ in grammatical gender intensity along several dimensions

  • Do the masculine and feminine classes partition the noun space?

◮ Many languages have a neuter class (eg. German, Russian)

  • How many parts of speech must change to reflect agreement?

◮ Example: verbs agree with gender in Russian, but not in Spanish

Jakiela and Ozier (2019) Gendered Language, Slide 15

slide-25
SLIDE 25

Does Grammatical Gender Matter?

Conventional wisdom is that grammatical gender is arbitrary:

“In German, a young lady has no sex, while a turnip has.” – Mark Twain

Jakiela and Ozier (2019) Gendered Language, Slide 16

slide-26
SLIDE 26

Does Grammatical Gender Matter?

Conventional wisdom is that grammatical gender is arbitrary:

“In German, a young lady has no sex, while a turnip has.” – Mark Twain

Some linguists have questioned this assumption (cf. Lakoff 1987), arguing that gender categories have a certain... cultural intelligibility

  • In Dyirbal, women are grouped with fire and “dangerous things”
  • In Ket, one linguist suggested that certain small mammals are

feminine “because they are of no importance to the Kets”

  • Assignment of inanimate objects to grammatical gender categories
  • ften reflects stereotypes about male vs. female body types

Jakiela and Ozier (2019) Gendered Language, Slide 16

slide-27
SLIDE 27

Does Grammatical Gender Matter?

Jakiela and Ozier (2019) Gendered Language, Slide 17

slide-28
SLIDE 28

Does Grammatical Gender Matter?

Native German speakers said: Native Spanish speakers said: hard golden heavy intricate jagged little metal lovely serrated shiny

Jakiela and Ozier (2019) Gendered Language, Slide 17

slide-29
SLIDE 29

Does Grammatical Gender Matter?

Native German speakers said: Native Spanish speakers said: hard golden heavy intricate jagged little metal lovely serrated shiny der Schl¨ ussel la llave (masculine) (feminine)

Source: Boroditsky et al. (2002)

Jakiela and Ozier (2019) Gendered Language, Slide 17

slide-30
SLIDE 30

Does Grammatical Gender Matter?

Native German speakers said: Native Spanish speakers said: beautiful big elegant dangerous fragile long peaceful strong pretty sturdy

Jakiela and Ozier (2019) Gendered Language, Slide 18

slide-31
SLIDE 31

Does Grammatical Gender Matter?

Native German speakers said: Native Spanish speakers said: beautiful big elegant dangerous fragile long peaceful strong pretty sturdy die Br¨ ucke el puente (feminine) (masculine)

Source: Boroditsky et al. (2002)

Jakiela and Ozier (2019) Gendered Language, Slide 18

slide-32
SLIDE 32

Does Grammatical Gender Matter?

Linguistic evidence focuses on words, not grammatical structure

  • Words reflect culture, structures (usually) do not (McWhorter 2019)

Jakiela and Ozier (2019) Gendered Language, Slide 19

slide-33
SLIDE 33

Does Grammatical Gender Matter?

Linguistic evidence focuses on words, not grammatical structure

  • Words reflect culture, structures (usually) do not (McWhorter 2019)

Evidence (from other social sciences) that grammatical gender matters:

  • Perez and Tavits (2018): Estonian/Russian bilinguals show greater

support for gender equality when (randomly) interviewed in Estonian

  • Santacreu-Vasut et al. (2013): political quotas for women are more

common in countries where the national language is non-gender

  • Hicks et al. (2015): immigrants are more likely to divide household

tasks along gender lines if they grew up speaking a gender language

Jakiela and Ozier (2019) Gendered Language, Slide 19

slide-34
SLIDE 34

Does Grammatical Gender Matter?

Linguistic evidence focuses on words, not grammatical structure

  • Words reflect culture, structures (usually) do not (McWhorter 2019)

Evidence (from other social sciences) that grammatical gender matters:

  • Perez and Tavits (2018): Estonian/Russian bilinguals show greater

support for gender equality when (randomly) interviewed in Estonian

  • Santacreu-Vasut et al. (2013): political quotas for women are more

common in countries where the national language is non-gender

  • Hicks et al. (2015): immigrants are more likely to divide household

tasks along gender lines if they grew up speaking a gender language Existing empirical work hampered by data limitations

  • Limited to country case studies, dominant national langauges, WALS

Jakiela and Ozier (2019) Gendered Language, Slide 19

slide-35
SLIDE 35

Conceptual Framework

slide-36
SLIDE 36

Conceptual Framework: Gendered Domains

Intuition: grammatical gender predisposes us to partition the world into male, female; entering a domain that doesn’t match your gender is costly 1 λ 1 − λ Masculine domains: proportion female < λ Feminine domains: proportion female > 1 − λ

Jakiela and Ozier (2019) Gendered Language, Slide 21

slide-37
SLIDE 37

Conceptual Framework: Gendered Domains

Intuition: grammatical gender predisposes us to partition the world into male, female; entering a domain that doesn’t match your gender is costly 1 λ 1 − λ Masculine domains: proportion female < λ Feminine domains: proportion female > 1 − λ Implications: ⇒ Grammatical gender makes individual decisions strategic ⇒ Decision to enter depends on gender composition of entrants

Jakiela and Ozier (2019) Gendered Language, Slide 21

slide-38
SLIDE 38

Conceptual Framework: Gendered Domains

Consider two examples:

  • Educational attainment
  • Division of labor within the household (who works)

Jakiela and Ozier (2019) Gendered Language, Slide 22

slide-39
SLIDE 39

Conceptual Framework: Gendered Domains

Consider two examples:

  • Educational attainment
  • Division of labor within the household (who works)

Continuous distribution of ability types:

  • Girl (or woman, female, mother) i’s ability γ, γ ∼ Fγ
  • Boy (or man, male, father) i’s ability β, γ ∼ Fβ
  • Distributions are well-behaved
  • Ability translates into return to education, wages through some

smooth, increasing transformation (use ability as shorthand)

Jakiela and Ozier (2019) Gendered Language, Slide 22

slide-40
SLIDE 40

Conceptual Framework: Educational Attainment

In the absence of grammatical gender, it is individually optimal to attend school when net return is positive — a decision, not a strategic game Proportion of girls who attend school: Fγ(γ∗) γ∗ γ = Female ability

Jakiela and Ozier (2019) Gendered Language, Slide 23

slide-41
SLIDE 41

Conceptual Framework: Educational Attainment

In the absence of grammatical gender, it is individually optimal to attend school when net return is positive — a decision, not a strategic game Proportion of girls who attend school: Fγ(γ∗) γ∗ γ = Female ability Assuming equal numbers, proportion of students who are female given by: P∗

girls =

1 − Fγ(γ∗) 2 − Fβ(β∗) − Fγ(γ∗)

Jakiela and Ozier (2019) Gendered Language, Slide 23

slide-42
SLIDE 42

Conceptual Framework: Educational Attainment

Proportion of girls who attend school: Fγ(γ∗) γ∗ γ = Female ability Proportion of boys who attend school: Fβ(β∗) β∗ β = Male ability

Jakiela and Ozier (2019) Gendered Language, Slide 24

slide-43
SLIDE 43

Conceptual Framework: Educational Attainment

Proportion of girls who attend school: Fγ(γ∗) γ∗ γ = Female ability Proportion of boys who attend school: Fβ(β∗) β∗ β = Male ability Proportion of students who are female: Pneutral

girls

= 1 − Fγ(γ∗) 2 − Fβ(β∗) − Fγ(γ∗) 1 λ 1 − λ Pneutral

girls

Jakiela and Ozier (2019) Gendered Language, Slide 24

slide-44
SLIDE 44

Conceptual Framework: Educational Attainment

Proportion of girls who attend school: Fγ(γ∗) γ∗ γ = Female ability Proportion of boys who attend school: Fβ(β∗) β∗ β = Male ability

Jakiela and Ozier (2019) Gendered Language, Slide 25

slide-45
SLIDE 45

Conceptual Framework: Educational Attainment

Proportion of girls who attend school: Fγ(γ∗) γ∗ γ = Female ability Proportion of boys who attend school: Fβ(β∗) β∗ β = Male ability Proportion of students who are female: Pneutral

girls

= 1 − Fγ(γ∗) 2 − Fβ(β∗) − Fγ(γ∗) 1 λ 1 − λ Pneutral

girls

Jakiela and Ozier (2019) Gendered Language, Slide 25

slide-46
SLIDE 46

Conceptual Framework: Educational Attainment

Proportion of girls who attend school: Fγ(γmasc) < Fγ(γ∗) γ∗ γmasc γ = Female ability Proportion of boys who attend school: Fβ(β∗) β∗ β = Male ability

Jakiela and Ozier (2019) Gendered Language, Slide 26

slide-47
SLIDE 47

Conceptual Framework: Educational Attainment

Proportion of girls who attend school: Fγ(γmasc) < Fγ(γ∗) γ∗ γmasc γ = Female ability Proportion of boys who attend school: Fβ(β∗) β∗ β = Male ability Proportion female: Pmasc

girls =

1 − Fγ(γmasc) 2 − Fβ(β∗) − Fγ(γmasc) 1 λ 1 − λ Pmasc

girls

Jakiela and Ozier (2019) Gendered Language, Slide 26

slide-48
SLIDE 48

Conceptual Framework: Educational Attainment

  • Multiple equilibria are possible

◮ Human capital attainment is lower in gendered equilibria

  • When schools are single-sex, (β∗, γ∗) is unique equilibrium

◮ Explains recent disappearance of gender gaps in education, persistence of gaps in labor force participation in the Middle East

  • Compulsory education (i.e. cost of non-attendance) has two effects:

◮ Moves some children into school directly ◮ Changes beliefs about proportion of students who are girls/boys, narrowing scope for gendered equilibria (under some assumptions)

Jakiela and Ozier (2019) Gendered Language, Slide 27

slide-49
SLIDE 49

Conceptual Framework: Labor Force Participation

A mom and a dad maximize consumption given wages wm = γ and wd = β, with someone (mom, dad, or a nanny earning wn) at home: C = γLm + βLd − wnHn where Hm + Lm = 1, Hd + Ld = 1, and Hm + Hd + Hn = 1 In the absence of grammatical gender:

  • A mom or a dad who earns more than a nanny always works
  • When one or both parents earn less than a nanny, we expect

specialization: the lower-earning parent does all the childcare

Jakiela and Ozier (2019) Gendered Language, Slide 28

slide-50
SLIDE 50

Conceptual Framework: Labor Force Participation

Without grammatical gender, households make decisions independently

Mom at home: β>γ and γ<wn Nanny at home: β>wn and γ>wn Dad at home: β<γ and β<wn wn

β = Father's Ability

wn

γ = Mother's Ability Jakiela and Ozier (2019) Gendered Language, Slide 29

slide-51
SLIDE 51

Conceptual Framework: Labor Force Participation

Proportion of households hiring a nanny: P∗

nanny =

β=βmax

β=wn

γ=γmax

γ=wn

fβ,γ (β, γ) Proportion of households where mother works, father stays home: P∗

mom =

β=wn

β=0

γ=β

γ=0

fβ,γ (β, γ) + β=βmax

β=wn

γ=wn

γ=0

fβ,γ (β, γ) Proportion of households where father works, mother stays home: P∗

dad =

β=γ

β=0

γ=wn

γ=0

fβ,γ (β, γ) + β=wn

β=wn

γ=γmax

γ=wn

fβ,γ (β, γ)

Jakiela and Ozier (2019) Gendered Language, Slide 30

slide-52
SLIDE 52

Conceptual Framework: Labor Force Participation

NN: home , work FM: home , work

a b c d e f g h i j k l m wn - φ wn wn + φ

β = Father's Ability

wn - φ wn wn + φ

γ = Mother's Ability

a b c d e f g h i j k l m wn - φ wn wn + φ

β = Father's Ability

wn - φ wn wn + φ

γ = Mother's Ability

When domains can be gendered, six different equilibria are possible

  • An equilibrium always exist, and multiple equilibria can exist

Jakiela and Ozier (2019) Gendered Language, Slide 31

slide-53
SLIDE 53

Conceptual Framework: Labor Force Participation

NN: home , work FM: home , work

a b c d e f g h i j k l m wn - φ wn wn + φ

β = Father's Ability

wn - φ wn wn + φ

γ = Mother's Ability

a b c d e f g h i j k l m wn - φ wn wn + φ

β = Father's Ability

wn - φ wn wn + φ

γ = Mother's Ability

When domains can be gendered, six different equilibria are possible

  • An equilibrium always exist, and multiple equilibria can exist

Psychic costs make decisions strategic without social costs, creating potential for gender-segregated equilibria (and a lower ability workforce)

Jakiela and Ozier (2019) Gendered Language, Slide 31

slide-54
SLIDE 54

Identifying Gender Languages

slide-55
SLIDE 55

The World’s Languages

The Ethnologue is the most comprehensive database of languages

  • Includes over 7,000; 6,190 of them living oral native languages

Jakiela and Ozier (2019) Gendered Language, Slide 33

slide-56
SLIDE 56

The World’s Languages

In many (LIC/LMIC) countries, the most widely spoken native language accounts for a small fraction of the population (e.g. 0.18 in Nigeria)

Jakiela and Ozier (2019) Gendered Language, Slide 34

slide-57
SLIDE 57

Classifying Gender Structures

We compile data on grammatical structures from a range of sources:

  • World Atlas of Language Structures
  • Linguistic Survey of India

◮ Compiled by George A. Grierson between 1891 and 1928

  • George L. Campbell’s Compendium of the World’s Languages
  • Language-specific data sources:

◮ Grammatical monographs ◮ Language textbooks and online learning materials ◮ Academic work by (modern) linguists ◮ Interviews with native speakers and translators

Jakiela and Ozier (2019) Gendered Language, Slide 35

slide-58
SLIDE 58

Classifying Gender Structures

For each language, we attempt to code two variables:

  • A indicator for using any system of grammatical gender
  • A indicator for using a dichotomous system of grammatical gender

◮ All nouns must be either masculine or feminine

Jakiela and Ozier (2019) Gendered Language, Slide 36

slide-59
SLIDE 59

Classifying Gender Structures

For each language, we attempt to code two variables:

  • A indicator for using any system of grammatical gender
  • A indicator for using a dichotomous system of grammatical gender

◮ All nouns must be either masculine or feminine

We do not attempt to determine:

  • The number of genders/classes, if there are more than two
  • The intensity of the agreement system (i.e. what must agree)
  • The presence of gendered personal pronouns (for humans)

Jakiela and Ozier (2019) Gendered Language, Slide 36

slide-60
SLIDE 60

Classifying Gender Structures

Languages positively identified as gender languages in two ways:

  • 1. Explicit statement about grammatical gender structure

Serbian: “Three grammatical genders (masculine, feminine, and neuter) and two numbers (singular and plural) are also distinguished.” Tigrinya: “Tigrinya nouns are either masculine or feminine and are inflected for number. Gender is not marked on the noun, but on nominal dependents like articles and adjectives. Verbs agree with their subjects and objects in person, number, and gender.”

Jakiela and Ozier (2019) Gendered Language, Slide 37

slide-61
SLIDE 61

Classifying Gender Structures

Languages positively identified as gender languages in two ways:

  • 1. Explicit statement about grammatical gender structure

Serbian: “Three grammatical genders (masculine, feminine, and neuter) and two numbers (singular and plural) are also distinguished.” Tigrinya: “Tigrinya nouns are either masculine or feminine and are inflected for number. Gender is not marked on the noun, but on nominal dependents like articles and adjectives. Verbs agree with their subjects and objects in person, number, and gender.”

  • 2. A textbook or language-specific grammar indicates that:

◮ There are masculine and feminine noun classes (genders), at least

  • ne of which includes nouns other than male/female animates

◮ Adjectives or another part of speech must agree in gender

Jakiela and Ozier (2019) Gendered Language, Slide 37

slide-62
SLIDE 62

Classifying Gender Structures

Languages identified as non-gender languages in the same ways:

  • 1. Explicit statement about grammatical gender structure

Gamo: “The use of gender is governed by non-linguistic factors — i.e. by the actual sex of the referent.” Maithili: “Modern Maithili, however, has no grammatical gender. In other words, in modern Maithili, distinctions of gender are determined soley by the sex of the animate noun.” Nuosu: “There is no grammatical gender, and such words as do not denote animate beings have no gender at all.”

Jakiela and Ozier (2019) Gendered Language, Slide 38

slide-63
SLIDE 63

Classifying Gender Structures

Languages identified as non-gender languages in the same ways:

  • 1. Explicit statement about grammatical gender structure

Gamo: “The use of gender is governed by non-linguistic factors — i.e. by the actual sex of the referent.” Maithili: “Modern Maithili, however, has no grammatical gender. In other words, in modern Maithili, distinctions of gender are determined soley by the sex of the animate noun.” Nuosu: “There is no grammatical gender, and such words as do not denote animate beings have no gender at all.”

  • 2. A textbook or language-specific grammar describes nouns or

nominals without mentioning any noun class system, or describes a system of classes that do not include either masculine or feminine

Jakiela and Ozier (2019) Gendered Language, Slide 38

slide-64
SLIDE 64

Classifying Gender Structures

We classify more than 95 percent of population in all but eight countries

Jakiela and Ozier (2019) Gendered Language, Slide 39

slide-65
SLIDE 65

The Distribution of Gender Languages

Native speakers of gender languages: 38 percent of world’s population

→ [Comparison with WALS]

Jakiela and Ozier (2019) Gendered Language, Slide 40

slide-66
SLIDE 66

Cross-Country Analysis

slide-67
SLIDE 67

Cross-Country Analysis: Data

  • 1. Labor force participation

◮ World Development Indicators ◮ Available for 177 countries

Jakiela and Ozier (2019) Gendered Language, Slide 42

slide-68
SLIDE 68

Cross-Country Analysis: Data

  • 1. Labor force participation

◮ World Development Indicators ◮ Available for 177 countries

  • 2. Educational attainment (primary and secondary school completion)

◮ Barro-Lee Educational Attainment Data ◮ Available for 142 countries

Jakiela and Ozier (2019) Gendered Language, Slide 42

slide-69
SLIDE 69

Cross-Country Analysis: Data

  • 1. Labor force participation

◮ World Development Indicators ◮ Available for 177 countries

  • 2. Educational attainment (primary and secondary school completion)

◮ Barro-Lee Educational Attainment Data ◮ Available for 142 countries

  • 3. Gender attitudes

◮ World Values Survey, Round 6 ◮ Available for 56 countries

Jakiela and Ozier (2019) Gendered Language, Slide 42

slide-70
SLIDE 70

Cross-Country Analysis: Empirical Specifications

We estimate OLS regressions of the form: Yc = α + βGenderc + δcontinent + λXc + εc where:

  • Genderc is the proportion of population speaking gender language
  • δcontinent is a vector of continent fixed effects
  • Xc is a vector of country-level geographic controls:

◮ Average rainfall, average temperature, proportion tropical, indicator for being landlocked, suitability for the plough

  • εc is a mean-zero error term

Jakiela and Ozier (2019) Gendered Language, Slide 43

slide-71
SLIDE 71

Cross-Country Analysis: Robust Inference

  • 1. Measurement error in country-level prevalence of gender languages

◮ Bounding exercise following Imbens and Manski (2004)

Jakiela and Ozier (2019) Gendered Language, Slide 44

slide-72
SLIDE 72

Cross-Country Analysis: Robust Inference

  • 1. Measurement error in country-level prevalence of gender languages

◮ Bounding exercise following Imbens and Manski (2004)

  • 2. Non-independence of languages with families

◮ Permutation test based on structure of the language tree

Jakiela and Ozier (2019) Gendered Language, Slide 44

slide-73
SLIDE 73

Cross-Country Analysis: Assessing Causality

  • 1. Examine within-country gender differences, where applicable

◮ Applies to LFP and education, not gender attitudes

Jakiela and Ozier (2019) Gendered Language, Slide 45

slide-74
SLIDE 74

Cross-Country Analysis: Assessing Causality

  • 1. Examine within-country gender differences, where applicable

◮ Applies to LFP and education, not gender attitudes

  • 2. Examine coefficient stability, robustness to observable controls

◮ Follow Altonji et al. (2005), Oster (forthcoming)

Jakiela and Ozier (2019) Gendered Language, Slide 45

slide-75
SLIDE 75

Cross-Country Analysis: Assessing Causality

  • 1. Examine within-country gender differences, where applicable

◮ Applies to LFP and education, not gender attitudes

  • 2. Examine coefficient stability, robustness to observable controls

◮ Follow Altonji et al. (2005), Oster (forthcoming)

  • 3. Replicate cross-country results using within-country variation

Jakiela and Ozier (2019) Gendered Language, Slide 45

slide-76
SLIDE 76

Cross-Country Analysis: Female LFP

20 40 60 80 100

LFPf

  • 80 -60 -40 -20

20

LFPf - LFPm Proportion gender < 0.1 0.1 < proportion gender < 0.9 Proportion gender > 0.9

Jakiela and Ozier (2019) Gendered Language, Slide 46

slide-77
SLIDE 77

Cross-Country Analysis: Female LFP

Dependent variable: LFPf LFPf - LFPm Specification: OLS OLS OLS OLS (1) (2) (3) (4) Proportion gender

  • 13.83
  • 11.92
  • 11.61
  • 14.66

(2.80) (3.34) (2.47) (3.25) [p < 0.001] [p < 0.001] [p < 0.001] [p < 0.001] Continent Fixed Effects No Yes No Yes Country-Level Geography Controls No Yes No Yes Observations 178 178 178 178 R2 0.15 0.33 0.12 0.47

Robust standard errors are clustered by the most widely spoken language in all specifications; they are reported in parentheses. P-values are reported in square brackets. LFPf is the percentage of women in the labor force, measured in 2011. LFPf - LFPm is the gender difference in labor force participation — i.e. the difference between female and male labor force participation, again measured in 2011. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough. Jakiela and Ozier (2019) Gendered Language, Slide 47

slide-78
SLIDE 78

Cross-Country Analysis: Female LFP

  • 80
  • 60
  • 40
  • 20

20

LFPmale - LFPfemale Dominican Republic: 30

th percentile

Jamaica: 48

th percentile

Estimated coefficients are economically significant:

  • Grammatical gender could fully explain the disparity in female labor

force participation between Jamaica and the Dominican Republic

  • Grammatical gender keeps 125 million women out of work force

Jakiela and Ozier (2019) Gendered Language, Slide 48

slide-79
SLIDE 79

Cross-Country Analysis: Female LFP

Robustness checks:

  • Marginal impact of stronger grammatical gender systems
  • Including “bad” controls
  • Omitting major world languages

Jakiela and Ozier (2019) Gendered Language, Slide 49

slide-80
SLIDE 80

Cross-Country Analysis: Educational Attainment

20 40 60 80 100

primaryf

  • 60 -40 -20

20 40

primaryf - primarym Proportion gender < 0.1 0.1 < proportion gender < 0.9 Proportion gender > 0.9

→ Primary education by continent

Jakiela and Ozier (2019) Gendered Language, Slide 50

slide-81
SLIDE 81

Cross-Country Analysis: Educational Attainment

20 40 60 80 100

secondaryf

  • 60 -40 -20

20 40

secondarym - secondaryf Proportion gender < 0.1 0.1 < proportion gender < 0.9 Proportion gender > 0.9

→ Secondary education by continent

Jakiela and Ozier (2019) Gendered Language, Slide 51

slide-82
SLIDE 82

Cross-Country Analysis: Educational Attainment

Dependent variable: PRIf PRIf - PRIm Specification: OLS OLS OLS OLS (1) (2) (3) (4) Proportion gender 14.79

  • 6.71

1.21

  • 3.72

(5.83) (4.40) (2.14) (2.16) [0.013] [0.130] [0.573] [0.088] Continent Fixed Effects No Yes No Yes Country-Level Geography Controls No Yes No Yes Observations 142 142 142 142 R2 0.06 0.61 0.00 0.20

Robust standard errors are clustered by the most widely spoken language in all specifications; they are reported in parentheses. P- values are reported in square brackets. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough. Jakiela and Ozier (2019) Gendered Language, Slide 52

slide-83
SLIDE 83

Cross-Country Analysis: Educational Attainment

Dependent variable: SECf SECf - SECm Specification: OLS OLS OLS OLS (1) (2) (3) (4) Proportion gender 14.52 0.43 0.48

  • 0.86

(5.77) (3.70) (1.93) (2.35) [0.013] [0.907] [0.802] [0.716] Continent Fixed Effects No Yes No Yes Country-Level Geography Controls No Yes No Yes Observations 142 142 142 142 R2 0.06 0.67 0.00 0.10

Robust standard errors are clustered by the most widely spoken language in all specifications; they are reported in parentheses. P- values are reported in square brackets. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough. Jakiela and Ozier (2019) Gendered Language, Slide 53

slide-84
SLIDE 84

Cross-Country Analysis: Gender Attitudes

World Values Survey includes 8 questions on gender attitudes:

  • When a mother works for pay, the children suffer [1]
  • When jobs are scarce, men should have more right to a job than women [1]
  • On the whole, men make better political leaders than women do [1]
  • On the whole, men make better business executives than women do [1]
  • Being a housewife is just as fulfilling as working for pay [1]
  • If a woman earns more money than her husband, it’s almost certain to cause problems [1]
  • A university education is more important for a boy than for a girl [1]
  • Having a job is the best way for a woman to be an independent person [0]

Jakiela and Ozier (2019) Gendered Language, Slide 54

slide-85
SLIDE 85

Cross-Country Analysis: Gender Attitudes

*** * ** *** *** ** ***

p = 0.685 p = 0.005 p = 0.081 p = 0.042 p = 0.009 p = 0.005 p = 0.012 p = 0.006

Having a job not best way to be independent University is more important for boys If a wife earns more, it causes problems Being a housewife as fulfilling as paid work When a mother works, the children suffer Men make better business executives Men have more right to a scarce job Men make better political leaders

.1 .2 .3 .4

Proportion speaking gender language

Jakiela and Ozier (2019) Gendered Language, Slide 55

slide-86
SLIDE 86

Cross-Country Analysis: Gender Attitudes

Dependent variable: Gender Attitude Index Specification: OLS OLS (1) (2) Proportion gender

  • 0.03
  • 0.12

(0.05) (0.04) [0.576] [0.002] Continent Fixed Effects No Yes Country-Level Geography Controls No Yes Observations 56 56 R2 0.01 0.78

Robust standard errors clustered by most widely spoken language in all specifications. The Gender Attitude Index is the first principal component of responses to the eight questions on gender attitudes included in the World Values Survey. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough. Jakiela and Ozier (2019) Gendered Language, Slide 56

slide-87
SLIDE 87

Cross-Country Analysis: Gender Attitudes

.2 .4 .6 .8 1 Gender Attitude Index

Yemen Jordan Egypt Libya Qatar Uzbekistan Pakistan Tunisia Algeria Kuwait Bahrain Iraq Azerbaijan Nigeria India Turkey Morocco Philippines Kyrgyzstan Ghana Malaysia Lebanon Kazakhstan Armenia Russia Georgia Belarus South Africa Rwanda Thailand China Ukraine Japan Singapore Zimbabwe South Korea Estonia Ecuador Poland Romania Mexico Brazil Colombia Cyprus Trinidad and Tobago Peru Chile Uruguay Slovenia United States New Zealand Spain Germany Australia Netherlands Sweden

Belarus: 49

th percentile

Trinidad and Tobago: 80

th percentile

Jakiela and Ozier (2019) Gendered Language, Slide 57

slide-88
SLIDE 88

Cross-Country Analysis: Gender Attitudes

Attitudes among Women: Attitudes among Men:

* * * ** ** ** **

p = 0.679 p = 0.053 p = 0.076 p = 0.082 p = 0.027 p = 0.047 p = 0.018 p = 0.022

Having a job not best way to be independent University is more important for boys If a wife earns more, it causes problems Being a housewife as fulfilling as paid work When a mother works, the children suffer Men make better business executives Men have more right to a scarce job Men make better political leaders

.1 .2 .3 .4

Proportion speaking gender language

*** ** *** *** ** ***

p = 0.224 p = 0.001 p = 0.122 p = 0.024 p = 0.004 p = 0.001 p = 0.016 p = 0.002

Having a job not best way to be independent University is more important for boys If a wife earns more, it causes problems Being a housewife as fulfilling as paid work When a mother works, the children suffer Men make better business executives Men have more right to a scarce job Men make better political leaders

.1 .2 .3 .4

Proportion speaking gender language

Jakiela and Ozier (2019) Gendered Language, Slide 58

slide-89
SLIDE 89

Cross-Country Analysis: Gender Attitudes

Sample: Attitude Index: Women Attitude Index: Men Specification: OLS OLS OLS OLS (1) (2) (3) (4) Proportion gender

  • 0.02
  • 0.10
  • 0.04
  • 0.14

(0.05) (0.04) (0.06) (0.04) [0.714] [0.012] [0.508] [p < 0.001] Continent Fixed Effects No Yes No Yes Geography Controls No Yes No Yes Observations 56 56 56 56 R2 0.00 0.73 0.02 0.78

Robust standard errors clustered by most widely spoken language in all specifications. The Gender Attitude Index is the first principal component of responses to the eight questions on gender attitudes included in the World Values Survey. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough. Jakiela and Ozier (2019) Gendered Language, Slide 59

slide-90
SLIDE 90

Cross-Country Analysis: Measurement Error

The problem: RHS variable is an interval for 85 of 193 countries

  • Analysis thus far assumes missingness is ignorable
  • Measurement error is not classical, could bias estimates

Jakiela and Ozier (2019) Gendered Language, Slide 60

slide-91
SLIDE 91

Cross-Country Analysis: Measurement Error

The problem: RHS variable is an interval for 85 of 193 countries

  • Analysis thus far assumes missingness is ignorable
  • Measurement error is not classical, could bias estimates

Our approach: calculate bounds following Imbens and Manski (2004)

  • 1. Identify highest and lowest coefficient estimates numerically
  • 2. Calculate associated na¨

ıve confidence intervals, take the union

  • 3. Symmetrically tighten the confidence interval for correct coverage

Jakiela and Ozier (2019) Gendered Language, Slide 60

slide-92
SLIDE 92

Cross-Country Analysis: Measurement Error

Full data vs WALS-only data

Women's Attitudes Men's Attitudes Attitude Index PRIfemale - PRImale PRIfemale LFPfemale - LFPmale LFPfemale

  • 75
  • 50
  • 25

25 50 75 95 percent confidence interval Naive OLS CI Imbens-Manski CI

Women's Attitudes Men's Attitudes Attitude Index PRIfemale - PRImale PRIfemale LFPfemale - LFPmale LFPfemale

  • 75
  • 50
  • 25

25 50 75 95 percent confidence interval Naive OLS CI Imbens-Manski CI

→ [Manski table]

Jakiela and Ozier (2019) Gendered Language, Slide 61

slide-93
SLIDE 93

Cross-Country Analysis: Independence

The problem: languages are not independent (Roberts et al. 2015)

  • Useful variation in grammatical structure within and between families
  • Intuitively, this is a clustering problem, but countries not nested

Jakiela and Ozier (2019) Gendered Language, Slide 62

slide-94
SLIDE 94

Cross-Country Analysis: Independence

The problem: languages are not independent (Roberts et al. 2015)

  • Useful variation in grammatical structure within and between families
  • Intuitively, this is a clustering problem, but countries not nested

Our approach: permutation tests based on the language tree

  • 1. Assign languages to largest possible homogeneous clusters
  • 2. Randomly permute treatment (grammatical gender) across clusters
  • 3. Replicate cross-country analysis for each hypothetical treatment

⇒ Allows us to calculate permutation-test p-values

Jakiela and Ozier (2019) Gendered Language, Slide 62

slide-95
SLIDE 95

Cross-Country Analysis: Permutation Tests

Dravidian Southern Tulu Tulu Koraga Korra Koraga Tamil-Kannada Tamil-Kodagu Tamil-Malayalam Tamil Yerukula Tamil Irula Malayalam Ravula Paniya Malayalam Kodagu Mullu Kurumba Kodava Kannada Kurumba Jennu Kurumba Kannada Kannada Badaga South-Central Telugu Gondi-Kui Konda-Kui Mukha-Dora Kuvi Kui Koya Konda-Dora Gondi Northern Gondi Aheri Gondi Adilabad Gondi Central Parji-Gadaba Pottangi Ollar Gadaba Duruwa Kolami-Naiki Northwest Kolami Northern Sauria Paharia Kurux Kumarbhag Paharia Brahui

Jakiela and Ozier (2019) Gendered Language, Slide 63

slide-96
SLIDE 96

Cross-Country Analysis: Permutation Tests

Dravidian Southern Tulu Tulu Koraga Korra Koraga Tamil-Kannada Tamil-Kodagu Tamil-Malayalam Tamil Yerukula Tamil Irula Malayalam Ravula Paniya Malayalam Kodagu Mullu Kurumba Kodava Kannada Kurumba Jennu Kurumba Kannada Kannada Badaga South-Central Telugu Gondi-Kui Konda-Kui Mukha-Dora Kuvi Kui Koya Konda-Dora Gondi Northern Gondi Aheri Gondi Adilabad Gondi Central Parji-Gadaba Pottangi Ollar Gadaba Duruwa Kolami-Naiki Northwest Kolami Northern Sauria Paharia Kurux Kumarbhag Paharia Brahui

Jakiela and Ozier (2019) Gendered Language, Slide 64

slide-97
SLIDE 97

Cross-Country Analysis: Permutation Tests

Dravidian Southern Tulu Tulu Koraga Korra Koraga Tamil-Kannada Tamil-Kodagu Tamil-Malayalam Tamil Yerukula Tamil Irula Malayalam Ravula Paniya Malayalam Kodagu Mullu Kurumba Kodava Kannada Kurumba Jennu Kurumba Kannada Kannada Badaga South-Central Telugu Gondi-Kui Konda-Kui Mukha-Dora Kuvi Kui Koya Konda-Dora Gondi Northern Gondi Aheri Gondi Adilabad Gondi Central Parji-Gadaba Pottangi Ollar Gadaba Duruwa Kolami-Naiki Northwest Kolami Northern Sauria Paharia Kurux Kumarbhag Paharia Brahui

Jakiela and Ozier (2019) Gendered Language, Slide 65

slide-98
SLIDE 98

Cross-Country Analysis: Permutation Tests

Dravidian Southern Tulu Tulu Koraga Korra Koraga Tamil-Kannada Tamil-Kodagu Tamil-Malayalam Tamil Yerukula Tamil Irula Malayalam Ravula Paniya Malayalam Kodagu Mullu Kurumba Kodava Kannada Kurumba Jennu Kurumba Kannada Kannada Badaga South-Central Telugu Gondi-Kui Konda-Kui Mukha-Dora Kuvi Kui Koya Konda-Dora Gondi Northern Gondi Aheri Gondi Adilabad Gondi Central Parji-Gadaba Pottangi Ollar Gadaba Duruwa Kolami-Naiki Northwest Kolami Northern Sauria Paharia Kurux Kumarbhag Paharia Brahui

Jakiela and Ozier (2019) Gendered Language, Slide 66

slide-99
SLIDE 99

Cross-Country Analysis: Permutation Tests

Dravidian Southern Tulu Tulu Koraga Korra Koraga Tamil-Kannada Tamil-Kodagu Tamil-Malayalam Tamil Yerukula Tamil Irula Malayalam Ravula Paniya Malayalam Kodagu Mullu Kurumba Kodava Kannada Kurumba Jennu Kurumba Kannada Kannada Badaga South-Central Telugu Gondi-Kui Konda-Kui Mukha-Dora Kuvi Kui Koya Konda-Dora Gondi Northern Gondi Aheri Gondi Adilabad Gondi Central Parji-Gadaba Pottangi Ollar Gadaba Duruwa Kolami-Naiki Northwest Kolami Northern Sauria Paharia Kurux Kumarbhag Paharia Brahui

Jakiela and Ozier (2019) Gendered Language, Slide 67

slide-100
SLIDE 100

Cross-Country Analysis: Permutation Tests

Female LFP: Gender Difference in LFP:

Jakiela and Ozier (2019) Gendered Language, Slide 68

slide-101
SLIDE 101

Cross-Country Analysis: Permutation Tests

Na¨ ıve OLS Permutation-based p-values p-values Female labor force participation 0.00050 0.01520 Gender difference in labor force participation 0.00001 0.00810 Female primary school completion 0.13012 0.16920 Gender difference in primary school completion 0.08773 0.08820 Female secondary school completion 0.90692 0.92410 Gender difference in secondary school completion 0.71638 0.73140 Gender attitudes index 0.00225 0.05030 Gender attitudes index among women 0.01223 0.09620 Gender attitudes index among men 0.00063 0.03040

P-values estimated using 10,000 permutations. For each outcome, the na¨ ıve p-value comes from the associated regression in a previous table. The permutation-based p-value is the fraction of permutations in which the magnitude of the estimated coefficient (from a hypothetical permutation of the gender indicator that respects the cluster structure of the language tree) exceeds the magnitude of the estimated coefficient in the true (non-permuted) data set. Jakiela and Ozier (2019) Gendered Language, Slide 69

slide-102
SLIDE 102

Cross-Country Analysis: Coefficient Stability

Altonji et al. (2005) and Oster (2017) propose using robustness to

  • bservable controls to assess the magnitude of omitted variable bias
  • Bias from unobservables is proportional to coefficient movements
  • Coefficient movements must be scaled by changes in R2

Jakiela and Ozier (2019) Gendered Language, Slide 70

slide-103
SLIDE 103

Cross-Country Analysis: Coefficient Stability

Altonji et al. (2005) and Oster (2017) propose using robustness to

  • bservable controls to assess the magnitude of omitted variable bias
  • Bias from unobservables is proportional to coefficient movements
  • Coefficient movements must be scaled by changes in R2

Consider a true model: Y = α + βX + ηWobservable + γWunobservable + ε

Jakiela and Ozier (2019) Gendered Language, Slide 70

slide-104
SLIDE 104

Cross-Country Analysis: Coefficient Stability

Altonji et al. (2005) and Oster (2017) propose using robustness to

  • bservable controls to assess the magnitude of omitted variable bias
  • Bias from unobservables is proportional to coefficient movements
  • Coefficient movements must be scaled by changes in R2

Consider a true model: Y = α + βX + ηWobservable + γWunobservable + ε Data on Y , X, and Wobservable tells us:

  • How much does ˆ

β change when Wobservable is included?

  • How much of the residual variation in Y is explained by Wobservable?

Jakiela and Ozier (2019) Gendered Language, Slide 70

slide-105
SLIDE 105

Cross-Country Analysis: Coefficient Stability

In this framework, δ is a proportional selection coefficient: δ denotes ratio of (i) covariance between treatment and unobserved controls to (ii) covariance between treatment and observed controls

Jakiela and Ozier (2019) Gendered Language, Slide 71

slide-106
SLIDE 106

Cross-Country Analysis: Coefficient Stability

In this framework, δ is a proportional selection coefficient: δ denotes ratio of (i) covariance between treatment and unobserved controls to (ii) covariance between treatment and observed controls Regression results with and without controls allow us to calculate:

  • True causal β∗ under the assumption that δ = 1
  • Value of δ∗ that would be required for omitted variable bias from

unobservables to fully explain observed association between X and Y

◮ Altonji et al. (2005) suggest results are robust if δ > 1

Jakiela and Ozier (2019) Gendered Language, Slide 71

slide-107
SLIDE 107

Cross-Country Analysis: Coefficient Stability

OLS Coefficients ˚ β ˜ β β∗(Rmax, 1) δ∗ Female LFP

  • 13.83
  • 11.92
  • 8.35

1.44 Gender difference in LFP

  • 11.61
  • 14.66
  • 17.87

3.24 Female primary completion 14.79

  • 6.71
  • 19.40

δ < 0 Gender difference in primary 1.21

  • 3.72
  • 6.27

δ < 0 Female secondary completion 14.52 0.43

  • 9.69

0.05 Gender difference in secondary 0.48

  • 0.86
  • 1.77

δ < 0 Gender attitude index

  • 0.03
  • 0.12
  • 0.20

δ < 0 Gender attitudes: women

  • 0.02
  • 0.10
  • 0.18

δ < 0 Gender attitudes: men

  • 0.04
  • 0.14
  • 0.23

δ < 0

Where:

  • β∗ = implied causal impact of X on Y if δ = 1
  • δ∗ = implied proportional selection coefficient under null

Jakiela and Ozier (2019) Gendered Language, Slide 72

slide-108
SLIDE 108

Within-Country Analysis

slide-109
SLIDE 109

Within-Country Analysis: Afrobarometer Data

Gender languages account for between 10 and 90 percent of population

  • f Chad, Kenya, Mauritania, Niger, Nigeria, S. Sudan, Uganda

Jakiela and Ozier (2019) Gendered Language, Slide 74

slide-110
SLIDE 110

Within-Country Analysis: Afrobarometer Data

We pool Afrobarometer data from Kenya, Niger, Nigeria, Uganda:

Survey Round Kenya Niger Nigeria Uganda Total Round 2: 2002–2003 2,353 2,116 2,238 6,707 Round 3: 2005 1,261 2,120 2,345 5,726 Round 4: 2008 1,092 2,291 2,420 5,803 Round 5: 2011–2013 2,373 1,192 2,366 2,379 8,310 Total 7,079 1,192 8,893 9,382 26,546

Identify grammatical gender structure for 99 percent of respondents

  • Respondents speak 167 different African languages

Jakiela and Ozier (2019) Gendered Language, Slide 75

slide-111
SLIDE 111

Within-Country Analysis: IHDS Data

62 percent of the Indian population speaks a gender native language

  • India Human Development Survey (IHDS) includes data on 75,966

household heads and spouses who speak 57 different languages

Jakiela and Ozier (2019) Gendered Language, Slide 76

slide-112
SLIDE 112

Within-Country Analysis: Empirical Specifications

When we restrict the sample to women: Yi = α + βGenderi + ζZi + ǫi where:

  • Genderi is an indicator for having a gender native language
  • Xi is a vector of individual-level controls

◮ Age, age2, religion indicators

  • Regressions of Afrobarometer data also include country×round FEs
  • ǫi is a mean-zero error term

Jakiela and Ozier (2019) Gendered Language, Slide 77

slide-113
SLIDE 113

Within-Country Analysis: Empirical Specifications

When we include data on both women and men: Yi = α+βGenderi+ηFemalei+θGender × Femalei+γcountry×round+ζZi+ǫi where:

  • Genderi is an indicator for having a gender native language
  • Femalei is an indicator for being female
  • Gender × Femalei is a Genderi × Femalei interaction
  • Xi is a vector of individual-level controls (age, religion, interactions)
  • Regressions of Afrobarometer data also include country×round FEs
  • ǫi is a mean-zero error term

Jakiela and Ozier (2019) Gendered Language, Slide 78

slide-114
SLIDE 114

Within-Country Analysis: Results

Labor Force Participation

  • .5
  • .4
  • .3
  • .2
  • .1

.1 Coefficient on grammatical gender Female Labor Force Participation Africa without controls Africa with controls India without controls India with controls

  • .5
  • .4
  • .3
  • .2
  • .1

.1 Coefficient on grammatical gender Gender Difference in LFP Africa without controls Africa with controls India without controls India with controls

Primary Completion Secondary Completion

  • .5
  • .4
  • .3
  • .2
  • .1

.1 Coefficient on grammatical gender Female Primary Completion Africa without controls Africa with controls India without controls India with controls

  • .5
  • .4
  • .3
  • .2
  • .1

.1 Coefficient on grammatical gender Gender Difference in Primary Africa without controls Africa with controls India without controls India with controls

  • .5
  • .4
  • .3
  • .2
  • .1

.1 Coefficient on grammatical gender Female Secondary Completion Africa without controls Africa with controls India without controls India with controls

  • .5
  • .4
  • .3
  • .2
  • .1

.1 Coefficient on grammatical gender Gender Difference in Secondary Africa without controls Africa with controls India without controls India with controls

Jakiela and Ozier (2019) Gendered Language, Slide 79

slide-115
SLIDE 115

Within-Country: Coefficient Stability

OLS Coefficients ˚ β ˜ β β∗(Rmax, 1) δ∗ Panel A. Afrobarometer Data from Kenya, Niger, Nigeria, and Uganda In labor force (women only)

  • 0.24
  • 0.18
  • 0.13

2.11 Female × in labor force (pooled)

  • 0.17
  • 0.11
  • 0.06

1.86 Completed primary (women only)

  • 0.31
  • 0.22
  • 0.15

2.18 Female × primary (pooled)

  • 0.12
  • 0.11
  • 0.10

4.64 Completed secondary (women only)

  • 0.19
  • 0.16
  • 0.14

3.47 Female × secondary (pooled)

  • 0.06
  • 0.06
  • 0.06

6.01 Panel B. India Human Development Survey III (IHDS) Data In labor force (women only)

  • 0.08
  • 0.07
  • 0.07

11.70 Female × in labor force (pooled)

  • 0.10
  • 0.08
  • 0.04

1.90 Completed primary (women only)

  • 0.14
  • 0.13
  • 0.12

12.14 Female × primary (pooled)

  • 0.13
  • 0.12
  • 0.11

13.19 Completed secondary (women only)

  • 0.03
  • 0.02
  • 0.02

7.20 Female × secondary (pooled)

  • 0.03
  • 0.03
  • 0.03

25.89

Jakiela and Ozier (2019) Gendered Language, Slide 80

slide-116
SLIDE 116

Conclusion

slide-117
SLIDE 117

Discussion & Conclusion

0 .1 .2 .3 .4

  • 2.5
  • 2
  • 1.5
  • 1
  • .5

.5 1 1.5 2 2.5 3 3.5

AFG ALB ARE ARG ARM AUS AUT BDI BEL BEN BGD BGR BHR BLZ BOL BRA BRB BRN BWA CAF CAN CHE CHL CHN CIV CMR COD COG COL CRI CUB CYP CZE DEU DNK DOM DZA ECU EGY ESP EST FIN FJI FRA GAB GBR GHA GMB GRC GTM GUY HND HRV HTI HUN IDN IND IRL IRN IRQ ISL ISR ITA JAM JOR JPN KAZ KEN KGZ KHM KOR KWT LAO LBR LBY LKA LSO LTU LUX LVA MAR MDA MDV MEX MLI MLT MMR MNG MOZ MRT MUS MWI MYS NAM NER NIC NLD NOR NPL NZL PAK PAN PER PHL PNG POL PRT PRY QAT ROU RUS RWA SAU SDN SEN SGP SLE SLV SRB SVK SVN SWE SWZ SYR TGO THA TJK TON TTO TUN TUR TZA UGA UKR URY USA VEN VNM YEM ZAF ZMB ZWE

  • 10
  • 5

5 10 15 20 25 30

Change in Gender Gap in LFP

  • 2.5
  • 2
  • 1.5
  • 1
  • .5

.5 1 1.5 2 2.5 3 3.5

Change in Gender Gap in Years of Education

  • 10
  • 5

5 10 15 20 25 30 0 .05.1.15 Jakiela and Ozier (2019) Gendered Language, Slide 82

slide-118
SLIDE 118

Discussion & Conclusion

Limiting high-ability women’s participation in the labor force is a misallocation of talent (Hsieh, Hurst, Jones, and Klenow 2018)

  • Costs may have been small historically, likely increasing over time
  • Many of the relevant constraints are social and psychological →

[Interventions 1] → [Interventions 2]

Jakiela and Ozier (2019) Gendered Language, Slide 83

slide-119
SLIDE 119

Discussion & Conclusion

Limiting high-ability women’s participation in the labor force is a misallocation of talent (Hsieh, Hurst, Jones, and Klenow 2018)

  • Costs may have been small historically, likely increasing over time
  • Many of the relevant constraints are social and psychological →

[Interventions 1] → [Interventions 2] We build a new data set that allows us to (better) test an old hypothesis about the relationship between language structure and gender equality

  • We characterize the grammatical gender structure of most of the

world’s living languages, accounting for 99 percent of the population

  • We find that grammatical gender predicts larger gender gaps in labor

force participation and education both across and within countries

Jakiela and Ozier (2019) Gendered Language, Slide 83

slide-120
SLIDE 120

Thank you!

slide-121
SLIDE 121

Additional Slides

slide-122
SLIDE 122

Motivation: Language Structures Thought

p=0.832 p=0.030 p=0.003 .5 .55 .6 .65 .7 Fraction successful German, N=29 Greek, N=28 Russian, N=46 Green on green Blue on blue Mixed color

Visual task: report presence and direction of triangular stimulus

Maier and Rahman (2018) results

→ Return to motivation

Jakiela and Ozier (2019) Gendered Language, Slide 86

slide-123
SLIDE 123

Motivation: Language Structures Thought

Fausey and Boroditsky (2011) show Spanish, English monolinguals videos depicting “intentional” and “accidental” versions of the same event

Action Intentional Accidental Knocks box Faces table, knocks box off table Knocks box off table while gesturing Breaks pencil Sits at table, breaks pencil in half Breaks pencil in half while writing

Jakiela and Ozier (2019) Gendered Language, Slide 87

slide-124
SLIDE 124

Motivation: Language Structures Thought

Fausey and Boroditsky (2011) show Spanish, English monolinguals videos depicting “intentional” and “accidental” versions of the same event

Action Intentional Accidental Knocks box Faces table, knocks box off table Knocks box off table while gesturing Breaks pencil Sits at table, breaks pencil in half Breaks pencil in half while writing

Experiment 1: subjects describe what happened

  • Spanish-speakers less likely to use agentive language to describe

accidental events, no differences observed for intentional acts

Jakiela and Ozier (2019) Gendered Language, Slide 87

slide-125
SLIDE 125

Motivation: Language Structures Thought

Fausey and Boroditsky (2011) show Spanish, English monolinguals videos depicting “intentional” and “accidental” versions of the same event

Action Intentional Accidental Knocks box Faces table, knocks box off table Knocks box off table while gesturing Breaks pencil Sits at table, breaks pencil in half Breaks pencil in half while writing

Experiment 1: subjects describe what happened

  • Spanish-speakers less likely to use agentive language to describe

accidental events, no differences observed for intentional acts Experiment 2: (other) subjects try to remember who did what

  • Both groups equally likely to remember intentional actors;

Spanish-speakers less likely to remember who caused accidents → Return to motivation

Jakiela and Ozier (2019) Gendered Language, Slide 87

slide-126
SLIDE 126

Marginal Impact of Dichotomous Gender Categories

Dependent variable: LFPf LFPf - LFPm Specification: OLS OLS OLS OLS (1) (2) (3) (4) Proportion (any) gender

  • 6.66
  • 7.19

4.29

  • 5.77

(2.54) (3.91) (1.65) (4.34) [0.010] [0.068] [0.010] [0.185] Proportion dichotomous gender

  • 10.58
  • 6.57
  • 23.44
  • 12.35

(4.78) (4.16) (3.54) (4.53) [0.029] [0.116] [p < 0.001] [0.007] Continent Fixed Effects No Yes No Yes Geography Controls No Yes No Yes Observations 178 178 178 178 R2 0.19 0.33 0.30 0.50

Robust standard errors clustered by most widely spoken language in all specifications. LFPf is the percentage of women in the labor force, measured in 2011. LFPm - LFPf is the difference between male and female labor force participation in 2011. Strong gender languages are those that partition the space of nouns into two gender categories, masculine and feminine. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough.

→ Return to robustness checks

Jakiela and Ozier (2019) Gendered Language, Slide 88

slide-127
SLIDE 127

Robustness to Potentially Endogenous Controls

Dependent variable: LFPf LFPf - LFPm Specification: OLS OLS (1) (2) Proportion speaking gender language

  • 6.66
  • 10.42

(2.80) (2.84) [p < 0.001] [p < 0.001] Continent Fixed Effects Yes Yes Country-Level Geography Controls Yes Yes Observations 176 176 R2 0.57 0.68

Robust standard errors clustered by most widely spoken language in all specifications. LFPf is the percentage of women in the labor force, measured in 2011. LFPm - LFPf is the difference between male and female labor force participation in 2011. Strong gender languages are those that partition the space of nouns into two gender categories, masculine and feminine. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough. Bad controls are log GDP per capita (in 2011), log population (in 2011), and the percentage Catholic, Protestant, other Christian, Muslim, and Hindu (taken from Alesina et al. 2013), and an indicator for former communist countries.

→ Return to robustness checks

Jakiela and Ozier (2019) Gendered Language, Slide 89

slide-128
SLIDE 128

Omitting Major World Languages

Dependent variable: LFPf LFPf – LFPm Omitted Language: Arabic English Spanish Arabic English Spanish Specification: OLS OLS OLS OLS OLS OLS (1) (2) (3) (4) (5) (6) Proportion speaking gender language

  • 6.18
  • 12.33
  • 10.10
  • 9.09
  • 15.31
  • 11.31

(3.56) (3.84) (3.87) (3.52) (3.59) (3.39) [0.085] [0.002] [0.010] [0.011] [p < 0.001] [0.001] Continent Fixed Effects Yes Yes Yes Yes Yes Yes Country-Level Geography Controls Yes Yes Yes Yes Yes Yes Observations 159 167 160 159 167 160 R2 0.21 0.34 0.37 0.31 0.49 0.51 Robust standard errors are clustered by the most widely spoken language in all specifications; they are reported in parentheses. P-values are reported in square brackets. LFPf is the percentage of women in the labor force, measured in 2011. LFPf – LFPm is the difference between male and female labor force participation in 2011. Geography controls are the percentage of land area in the tropics or subtropics, average yearly precipitation, average temperature, an indicator for being landlocked, and the Alesina et al. (2013) measure of suitability for the plough.

→ Return to robustness checks

Jakiela and Ozier (2019) Gendered Language, Slide 90

slide-129
SLIDE 129

Primary School Completion by Continent

20 40 60 80 100

primaryf

  • 60 -40 -20

20 40

primaryf-primarym Africa Asia Europe

→ Return to secondary education figure

Jakiela and Ozier (2019) Gendered Language, Slide 91

slide-130
SLIDE 130

Secondary School Completion by Continent

20 40 60 80 100

secondaryf

  • 60 -40 -20

20 40

secondaryf-secondarym Africa Asia Europe

→ Return to secondary education figure

Jakiela and Ozier (2019) Gendered Language, Slide 92

slide-131
SLIDE 131

Comparing country-level WALS data to full data

.2 .4 .6 .8 1 Measure based on WALS languages .2 .4 .6 .8 1 Measure based on full set of languages

Two measures of the fraction of a country speaking a gender language as their native language

→ Return to distribution of gender languages

Jakiela and Ozier (2019) Gendered Language, Slide 93

slide-132
SLIDE 132

Cross-Country Analysis: Measurement Error

Na¨ ıve OLS CI Imbens-Manski CI Female labor force participation [−18.533, −5.305] [−18.467, −5.013] Gender difference in labor force participation [−21.077, −8.233] [−20.916, −7.741] Female primary school completion [−15.431, 2.010] [−16.221, 1.673] Gender difference in primary school completion [−8.003, 0.559] [−8.446, 0.432] Female secondary school completion [−6.901, 7.769] [−8.261, 7.327] Gender difference in secondary school completion [−5.510, 3.799] [−5.401, 3.746] Gender attitudes index [−0.193, −0.045] [−0.194, −0.047] Gender attitudes index among women [−0.173, −0.022] [−0.173, −0.023] Gender attitudes index among men [−0.214, −0.063] [−0.215, −0.064]

footnotesizeFor each outcome, the na¨ ıve confidence interval comes from the associated regression in a previous table. The Imbens- Manski worst-case confidence interval is calculated by finding the minimum and maximum possible point estimates of the relevant coefficient based on the interval nature of the dataset (without complete data on the grammatical structure of all languages, the right-hand-side variable–the fraction of a country’s population speaking a gender language–is only observed up to an interval in some cases), then by tightening the confidence interval for correct coverage following Imbens and Manski (2004).

→ Return to Manski graph

Jakiela and Ozier (2019) Gendered Language, Slide 94

slide-133
SLIDE 133

Policy Implications: More Than Words

Interventions can leverage the salience of gender.

  • Porter and Serra (2018) show that having a female role model visit a

Principles of Economics class makes female students more likely to take Intermediate Micro, and to consider majoring in economics (no impact on male students).

  • Riley (2018) shows that watching Queen of Katwe causes Ugandan

secondary school students to perform better on a mathematics examination, with largest effects for female and lower-ability students.

Jakiela and Ozier (2019) Gendered Language, Slide 95

slide-134
SLIDE 134

Policy Implications: More Than Words

Interventions can leverage the salience of gender.

  • Porter and Serra (2018) show that having a female role model visit a

Principles of Economics class makes female students more likely to take Intermediate Micro, and to consider majoring in economics (no impact on male students).

  • Riley (2018) shows that watching Queen of Katwe causes Ugandan

secondary school students to perform better on a mathematics examination, with largest effects for female and lower-ability students.

  • sadietannerconference.org

“You can’t be what you can’t see” - Dr. Joycelyn Elders → [Back to discussion]

Jakiela and Ozier (2019) Gendered Language, Slide 95

slide-135
SLIDE 135

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

Jakiela and Ozier (2019) Gendered Language, Slide 96

slide-136
SLIDE 136

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

◮ Men in Saudi Arabia believe that other men are less supportive of female labor force participation than they really are;

Jakiela and Ozier (2019) Gendered Language, Slide 96

slide-137
SLIDE 137

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

◮ Men in Saudi Arabia believe that other men are less supportive of female labor force participation than they really are; ◮ Correcting beliefs experimently makes men more willing to forgo income so that wives can participate in online job-matching;

Jakiela and Ozier (2019) Gendered Language, Slide 96

slide-138
SLIDE 138

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

◮ Men in Saudi Arabia believe that other men are less supportive of female labor force participation than they really are; ◮ Correcting beliefs experimently makes men more willing to forgo income so that wives can participate in online job-matching; ◮ Months later, this increases the likelihood that women have participated in a job interview outside the home.

Jakiela and Ozier (2019) Gendered Language, Slide 96

slide-139
SLIDE 139

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

◮ Men in Saudi Arabia believe that other men are less supportive of female labor force participation than they really are; ◮ Correcting beliefs experimently makes men more willing to forgo income so that wives can participate in online job-matching; ◮ Months later, this increases the likelihood that women have participated in a job interview outside the home.

  • Patnaik (2018) shows that that the Quebec Parental Insurance

Program’s “daddy-only” label for some parts of parental leave

Jakiela and Ozier (2019) Gendered Language, Slide 96

slide-140
SLIDE 140

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

◮ Men in Saudi Arabia believe that other men are less supportive of female labor force participation than they really are; ◮ Correcting beliefs experimently makes men more willing to forgo income so that wives can participate in online job-matching; ◮ Months later, this increases the likelihood that women have participated in a job interview outside the home.

  • Patnaik (2018) shows that that the Quebec Parental Insurance

Program’s “daddy-only” label for some parts of parental leave

◮ increased fathers’ use of parental leave (53 percentage pts, 200 pct);

Jakiela and Ozier (2019) Gendered Language, Slide 96

slide-141
SLIDE 141

Policy Implications: More Than Words

Interventions can address misperceptions about common beliefs.

  • Bursztyn, Gonz´

alez, and Yanagizawa-Drott (2018) show that

◮ Men in Saudi Arabia believe that other men are less supportive of female labor force participation than they really are; ◮ Correcting beliefs experimently makes men more willing to forgo income so that wives can participate in online job-matching; ◮ Months later, this increases the likelihood that women have participated in a job interview outside the home.

  • Patnaik (2018) shows that that the Quebec Parental Insurance

Program’s “daddy-only” label for some parts of parental leave

◮ increased fathers’ use of parental leave (53 percentage pts, 200 pct); ◮ also increased fathers’ long-term share of household and child-rearing responsibilities, increasing mothers’ labor supply as well.

→ [Back to discussion]

Jakiela and Ozier (2019) Gendered Language, Slide 96