Statistical Analysis of Endorsement Experiments: Measuring Support - - PowerPoint PPT Presentation

statistical analysis of endorsement experiments measuring
SMART_READER_LITE
LIVE PREVIEW

Statistical Analysis of Endorsement Experiments: Measuring Support - - PowerPoint PPT Presentation

Statistical Analysis of Endorsement Experiments: Measuring Support for Militant Groups in Pakistan Kosuke Imai Department of Politics Princeton University Joint work with Will Bullock and Jacob Shapiro May 13, 2011 Kosuke Imai (Princeton)


slide-1
SLIDE 1

Statistical Analysis of Endorsement Experiments: Measuring Support for Militant Groups in Pakistan

Kosuke Imai Department of Politics Princeton University Joint work with Will Bullock and Jacob Shapiro

May 13, 2011

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 1 / 24

slide-2
SLIDE 2

Motivation

Survey is used widely in social sciences Validity of survey depends on the accuracy of self-reports Sensitive questions = ⇒ social desirability, privacy concerns e.g., racial prejudice, corruptions Lies and nonresponses How can we elicit truthful answers to sensitive questions? Survey methodology: protect privacy through indirect questioning Statistical methodology: efficiently recover underlying responses

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 2 / 24

slide-3
SLIDE 3

Survey Methodologies for Sensitive Questions

Randomized Response Technique

Most extensively studied Use randomization to protect privacy Difficulties: logistics, lack of understanding among respondents

List Experiments (item count technique)

Use aggregation to protect privacy New multivariate regression analysis method New methods to detect and correct failures (joint with G. Blair) Difficulties: design effects, ceiling and floor effects

Endorsement Experiments

Use randomized endorsements to measure support levels Develop a measurement model based on item response theory Difficulties: interpretation, need for modeling Applications:

1

Pakistanis’ support for Islamic militant groups

2

Afghanis’ support for Taliban and ISAF (joint with J. Lyall)

3

Nigerians’ support for insurgents (joint with G. Blair)

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 3 / 24

slide-4
SLIDE 4

Endorsement Experiments

Measuring support for political actors (e.g., candidates, parties) when studying sensitive questions Ask respondents to rate their support for a set of policies endorsed by randomly assigned political actors Experimental design:

1

Select policy questions

2

Randomly divide sample into control and treatment groups

3

Across respondents (and questions), randomly assign political actors for endorsement (no endorsement for the control group)

4

Compare support level for each policy endorsed by different actors

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 4 / 24

slide-5
SLIDE 5

The Pakistani Survey Experiment

6,000 person urban-rural sample in April 2009 Four militant groups:

Pakistani militants fighting in Kashmir (a.k.a. Kashmiri tanzeem) Militants fighting in Afghanistan (a.k.a. Afghan Taliban) Al-Qa’ida Firqavarana Tanzeems (a.k.a. sectarian militias)

Four policies:

WHO plan to provide universal polio vaccination across Pakistan Curriculum reform for religious schools Reform of FCR to make Tribal areas equal to rest of the country Peace jirgas to resolve disputes over Afghan border (Durand Line)

Response rate over 90%

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 5 / 24

slide-6
SLIDE 6

Endorsement Experiment Questions: Example

The script for the control group The World Health Organization recently announced a plan to introduce universal Polio vaccination across Pakistan. How much do you support such a plan? (1) A great deal (2) A lot (3) A moderate amount (4) A little (5) Not at all The script for a treatment group The World Health Organization recently announced a plan to introduce universal Polio vaccination across Pakistan, a policy that has received support from Al-Qa’ida. How much do you support such a plan? (1) A great deal (2) A lot (3) A moderate amount (4) A little (5) Not at all

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 6 / 24

slide-7
SLIDE 7

Distribution of Responses

Polio Vaccinations Curriculum Reform FCR Reforms Durand Line

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

Not At All A Little A Moderate Amount A Lot A Great Deal Control Group Pakistani militant groups in Kashmir Afghan Taliban Al−Qaida Firqavarana Tanzeems Control Group Pakistani militant groups in Kashmir Afghan Taliban Al−Qaida Firqavarana Tanzeems Control Group Pakistani militant groups in Kashmir Afghan Taliban Al−Qaida Firqavarana Tanzeems Control Group Pakistani militant groups in Kashmir Afghan Taliban Al−Qaida Firqavarana Tanzeems Punjab Sindh NWFP Balochistan

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 7 / 24

slide-8
SLIDE 8

Methodological Challenges and Proposed Solutions

1

How to combine responses from multiple questions? = ⇒ item response theory

2

How to recoup loss of statistical efficiency? = ⇒ hierarchical modeling

3

How to interpret the “support”? = ⇒ policy vs. valence

4

How to select policy questions?

Policies should belong to a single dimension Respondents should not have strong views Should one use well-known policies?: Statistical and substantive tradeoffs

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 8 / 24

slide-9
SLIDE 9

Endorsement Experiments Framework

N respondents J policy questions K political actors Yij ∈ {0, 1}: response of respondent i to policy question j Tij ∈ {0, 1, . . . , K}: political actor randomly assigned to endorse policy j for respondent i Yij(t): potential response given the endorsement by actor t Covariates measured prior to the treatment

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 9 / 24

slide-10
SLIDE 10

The Proposed Model

Quadratic random utility model (Clinton, Jackman, and Rivers): Ui(ζj1, k) = −(xi + s∗

ijk) − ζj12 + ηij

Ui(ζj0, k) = −(xi + s∗

ijk) − ζj02 + νij

xi is the ideal point and s∗

ijk is the “influence” of endorsement

The statistical model (item response theory): Pr(Yij = 1 | Tij = k) = Pr(Yij(k) = 1) = Pr(Ui(ζj1, k) > Ui(ζj0, k)) = Pr(αj + βj(xi + s∗

ijk) > ǫij)

Support level: greater support ⇐ ⇒ greater prob. of Yij = 1 sijk =

  • s∗

ijk

if βj ≥ 0 −s∗

ijk

  • therwise

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 10 / 24

slide-11
SLIDE 11

The Proposed Model (Continued)

Hierarchical modeling: xi

indep.

∼ N(Z ⊤

i δ, σ2 x)

sijk

indep.

∼ N(Z ⊤

i λjk, ω2 jk)

λjk

i.i.d.

∼ N(θk, Φk) “Noninformative” hyper prior on (αj, βj, δ, θk, ω2

jk, Φk)

Interpretation:

spacial model vs. factor analysis policy vs. valence aspects of support

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 11 / 24

slide-12
SLIDE 12

Quantities of Interest and Model Fitting

Average support level for each militant group k τjk(Zi) = Z ⊤

i λjk

for each policy j κk(Zi) = Z ⊤

i θk

averaging over all policies Standardize them by dividing the (posterior) standard deviation of ideal points Bayesian Markov chain Monte Carlo algorithm Multiple chains to monitor convergence Implementation via JAGS (Plummer)

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 12 / 24

slide-13
SLIDE 13

Model for the Division Level Support

Ordered response with an intercept αjl varying across divisions The model specification: xi

indep.

∼ N(δdivision[i], 1) sijk

indep.

∼ N(λk,division[i], ω2

k)

δdivision[i]

indep.

∼ N(µprovince[i], σ2

province[i])

λk,division[i]

indep.

∼ N(θk,province[i], Φk,province[i]) Averaging over policies Partial pooling across divisions within each province

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 13 / 24

slide-14
SLIDE 14

Estimated Division Level Support

−1.0 −0.5 0.0 0.5 1.0 Standardized Level of Support Pakistani militant groups in Kashmir Militants fighting in Afghanistan Al−Qaida Firqavarana Tanzeems

Bahawalpur n=118 Dera Ghazi Khan n=0 Faisalabad n=313 Gujranwala n=403 Lahore n=579 Multan n=495 Rawalpindi n=208 Sargodha n=131 Hyderabad n=203 Karachi n=473 Larkana n=311 Mirpurkhas n=0 Sukkur n=293 Bannu n=0 Dera Ismail Khan n=84 Hazara n=287 Kohat n=50 Malakand n=0 Mardan n=215 Peshawar n=288 Kalat n=103 Makran n=0 Nasirabad n=210 Quetta n=320 Sibi n=67 Zhob n=61

Punjab Sindh NWFP Balochistan Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 14 / 24

slide-15
SLIDE 15

Model with Individual Covariates

Ordered response with an intercept αjl varying across divisions The model specification: xi

indep.

∼ N(δdivision[i] + Z ⊤

i δZ, 1)

sijk

indep.

∼ N(λk,division[i] + Z ⊤

i λZ k , ω2 k)

δdivision[i]

indep.

∼ N(µprovince[i], σ2

province[i])

λk,division[i]

indep.

∼ N(θk,province[i], Φk,province[i]) Expands upon the division level model to include individual level covariates: gender, urban/rural, education, income Individual level covariate effects after accounting for differences across divisions Poststratification on these covariates using the census

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 15 / 24

slide-16
SLIDE 16

Estimated Effects of Individual Covariates

−0.2 −0.1 0.0 0.1 0.2

Pakistani militant groups in Kashmir

Standardized Level of Support

  • Female

Rural Income Education −0.2 −0.1 0.0 0.1 0.2 Militants fighting in Afghanistan Standardized Level of Support

  • Female

Rural Income Education −0.2 −0.1 0.0 0.1 0.2 Al−Qaida Standardized Level of Support

  • Female

Rural Income Education −0.2 −0.1 0.0 0.1 0.2 Firqavarana Tanzeems Standardized Level of Support

  • Female

Rural Income Education

Demographics play a small role in explaining support for groups

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 16 / 24

slide-17
SLIDE 17

Regional Clustering of the Support for Al-Qaida

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 17 / 24

slide-18
SLIDE 18

Some Non-Causal Observations

Comparison with the “knowledge on the ground”

Greatest support in Punjab: consistent Support in Gujranwala but not in Bahawalpur: surprising (US AID?)

Least tolerant where senior leadership resides

Hazara for Al-Qa’ida Quetta and Zhob for Taliban

Least support where many terrorist attacks before April 2009

Hazara, Kohat, Nasirabad, Peshawar, and Quetta all suffered from attacks in early 2009 Data on “politically motivated violence” from March 2008 through March 2009 (National Counterterrorism Center’s Worldwide Incident Tracking System)

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 18 / 24

slide-19
SLIDE 19

Association between Support and Violence

Strong negative association

  • −0.4

−0.3 −0.2 −0.1 0.0 0.1 0.2 50 100 150 200

Pakistani militant groups in Kashmir

Division−Level Estimated Support Number of Incidents

correlation = −0.574

  • −0.4

−0.3 −0.2 −0.1 0.0 0.1 0.2 50 100 150 200

Militants fighting in Afghanistan

Division−Level Estimated Support Number of Incidents

correlation = −0.594

  • −0.4

−0.3 −0.2 −0.1 0.0 0.1 0.2 50 100 150 200

Al−Qaida

Division−Level Estimated Support Number of Incidents

correlation = −0.468

  • −0.4

−0.3 −0.2 −0.1 0.0 0.1 0.2 50 100 150 200

Firqavarana Tanzeems

Division−Level Estimated Support Number of Incidents

correlation = −0.414

Weaker association for the standard ordered probit model (division dummies, treatment variables, their interactions)

  • −0.4

−0.2 0.0 0.2 0.4 0.6 50 100 150 200

Pakistani militant groups in Kashmir

Division−Level Estimated Support Number of Incidents

correlation = −0.061

  • −0.4

−0.2 0.0 0.2 0.4 0.6 50 100 150 200

Militants fighting in Afghanistan

Division−Level Estimated Support Number of Incidents

correlation = −0.365

  • −0.4

−0.2 0.0 0.2 0.4 0.6 50 100 150 200

Al−Qaida

Division−Level Estimated Support Number of Incidents

correlation = 0.021

  • −0.4

−0.2 0.0 0.2 0.4 0.6 50 100 150 200

Firqavarana Tanzeems

Division−Level Estimated Support Number of Incidents

correlation = −0.166

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 19 / 24

slide-20
SLIDE 20

Ideology, Support, and Violence

No strong relationship between:

ideology and violence ideology and support

  • −0.5

0.0 0.5 50 100 150 200 Division−Level Estimated Ideal Point Number of Incidents

correlation = −0.040

  • −0.5

0.0 0.5 −0.25 −0.20 −0.15 −0.10 −0.05 0.00 0.05 0.10 Division−Level Estimated Ideal Point Division−Level Average Estimated Support for Militant Groups

correlation = 0.087

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 20 / 24

slide-21
SLIDE 21

Simulation Studies

1

Based on the Pakistani Data

Same 2 models plus province-level issue ownership model Top-level parameters held constant across simulations Sample sizes and distribution same as before Ideal points, endorsements and responses follow IRT models

2

Varying sample sizes

Model for division-level estimates with no covariates Model for province-level estimates with no covariates but support varying across policies N = 1000, 1500, 2000 Again, top-level parameters held constant across simulations while ideal points, endorsements and responses follow IRT models

100 simulations under each scenario (3 chains, 60000 iterations) Frequentist evaluation of Bayesian estimators

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 21 / 24

slide-22
SLIDE 22

Monte Carlo Evidence based on the Pakistani Data

Density −0.10 −0.05 0.00 0.05 0.10 10 20 30 40 Density 0.80 0.85 0.90 0.95 1.00 5 10 15 20 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Effect Size Proportion Statistically Significant

  • Density

−0.10 −0.05 0.00 0.05 0.10 10 20 30 40 Density 0.80 0.85 0.90 0.95 1.00 5 10 15 20

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Effect Size Proportion Statistically Significant The Division Model With Individual Covariates The Division Model

Bias Coverage Rate of the 90% Confidence Intervals Statistical Power α level = 0.10

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 22 / 24

slide-23
SLIDE 23

Monte Carlo Evidence with Varying Sample Size

  • N=1000

N=1500 N=2000 −0.2 −0.1 0.0 0.1 0.2

Bias

Bias

  • N=1000

N=1500 N=2000 0.75 0.80 0.85 0.90 0.95 1.00 1.05

Coverage Rate of the 90% Confidence Intervals

Coverage Rate 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

Statistical Power

Effect Size Proportion Statistically Significant

  • N=2000

N=1500 N=1000

  • N=1000

N=1500 N=2000 −0.2 −0.1 0.0 0.1 0.2 Bias

  • N=1000

N=1500 N=2000 0.75 0.80 0.85 0.90 0.95 1.00 1.05 Coverage Rate 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Effect Size Proportion Statistically Significant

  • N=2000

N=1500 N=1000

The Division Model The Division Model With Individual Covariates α level = 0.10

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 23 / 24

slide-24
SLIDE 24

Concluding Remarks

Survey methodology to study sensitive questions Endorsement Experiments

Most indirect form of questioning Applicability limited to measuring support Analysis based on the item response framework Multilevel modeling to efficient estimation of spatial patterns

Design considerations:

Policies should belong to a single dimension Respondents should not have strong opinion Separating policy and valence aspects of support Statistical vs. substantive tradeoffs Could measure policy positions and political knowledge separately

JAGS code available at the dataverse

Kosuke Imai (Princeton) Endorsement Experiments NEMP (NYU) 24 / 24