Null Hypothesis Significance Testing and the Problem of - - PowerPoint PPT Presentation

null hypothesis significance
SMART_READER_LITE
LIVE PREVIEW

Null Hypothesis Significance Testing and the Problem of - - PowerPoint PPT Presentation

Null Hypothesis Significance Testing and the Problem of Underpowered Studies in Economics Le (Lyla) Zhang, Curtin University (with Andreas Ortmann, UNSW) 2015 workshop in Experimental Methods: The replicability crisis in the social sciences


slide-1
SLIDE 1

Null Hypothesis Significance Testing and the Problem of Underpowered Studies in Economics

Le (Lyla) Zhang, Curtin University

(with Andreas Ortmann, UNSW)

2015 workshop in Experimental Methods: The replicability crisis in the social sciences and how to address it November, 2015

slide-2
SLIDE 2

Outline

Null Hypothesis Significance Testing (NHST)

  • Commonly Used Procedure
  • Two Types of Errors

The Statistical Power Analysis

  • A Meta-analysis (to calculate effect size)
  • Statistical power of dictator game experiments
slide-3
SLIDE 3

Null Hypothesis Significance Testing

  • Widely used routine
  • Set “no treatment effect” as null hypothesis
  • A common used (“conventional”) criterion:

=5% (10%, 1%)

Null Hypothesis Calculate Statistics Reject Fail to Reject

slide-4
SLIDE 4

Two Types of Errors

Null is true (H0) Null is false (H1) Reject α -Type I error false positive 1-β (power) Fail to reject 1-α β – Type II error false negative

slide-5
SLIDE 5

Dictator Game Experiments

slide-6
SLIDE 6

Dictator Game Experiments

e.g., $10

slide-7
SLIDE 7

Dictator Game Experiments

  • Over the past 15 years, hundreds of dictator game

experiments have been conducted (Engel, 2010; Zhang & Ortmann, 2014).

  • These studies vary in experimental design variables

(e.g., asset legitimacy, real money, etc) and substantial variables (e.g., country, student, age).

  • Some of them are published, while others are not.
slide-8
SLIDE 8

A meta-analysis of dictator game experiments

Action Space Uncertaint y Incentive Student Efficiency Asset Legitimacy Deserving Recipient Double Blind Identificati

  • n

Group Decision Social cue Repeated Game Age Country Communic ation Paper Quality

slide-9
SLIDE 9

Dictator Game Experiments

Often used threshold

slide-10
SLIDE 10

The severe situation of under-powered studies

  • Large variations in statistical power of studies included in

meta-analysis of DG game experiments (130 studies). (Min: 5%; Max: 100%; Median: 22.5%)

  • The majority of them are under-powered (less likely to

find an effect which exists).

  • It depends on the sample size and the variables of interest

(various design and implementation characteristics).

slide-11
SLIDE 11

Dictator Game Experiments

  • High statistical power

Large ES

  • Statistical power varies and it

depends on sample size

Medium ES

  • Need a large sample to achieve

the required statistical power

Small ES

slide-12
SLIDE 12

Dictator Game Experiments

slide-13
SLIDE 13

What can we do?

Rules of thumb: List et al (EE, 2010). However, it does not guarantee a high level of statistical power. Include a meta-analysis in the literature review, if possible. Use the average effect size in the meta-analysis for power analysis of future projects. It requires open data. If there is no extant study, pilot sessions would be helpful.

slide-14
SLIDE 14

Thank you!