False-Positives, p-Hacking, Statistical Power, and Evidential Value
Leif D. Nelson
University of California, Berkeley Haas School of Business Summer Institute June 2014
False-Positives, p-Hacking, Statistical Power, and Evidential Value - - PowerPoint PPT Presentation
False-Positives, p-Hacking, Statistical Power, and Evidential Value Leif D. Nelson University of California, Berkeley Haas School of Business Summer Institute June 2014 Who am I? Experimental psychologist who studies judgment and
University of California, Berkeley Haas School of Business Summer Institute June 2014
2
3
[not a rhetorical question]
4
5
6
7
8
9
10
11
12
13
14
NOISE How much TV per day?
37
38
39
40
41
42
– 30%
– 5%
– 4%
– 3%
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
61
62
63
64
65
66
68
69
70
Highlights: More power at 5 Certain with 80%
71
Highlights
‘never’ wrong
powered studies.
72
73
74
75
76
.72 .75 .79 .85 .93 0.0 0.2 0.4 0.6 0.8 1.0
d=0 d=.2 d=.4 d=.6 d=.8
Estimated effect Size
Cohen's d
77
78
79
80
0.2 0.4 0.6 0.8 1 1.2 5 10 20 30 40 50
Estimated effect size (cohen-d) Number of studies in p-curve
True d = 0
Sample size of each study n=20 n=50
0.2 0.4 0.6 0.8 1 1.2 5 10 20 30 40 50
Estimated effect size (cohen-d) Number of studies in p-curve
True d = .3
Sample size of each study n=20 n=50
0.2 0.4 0.6 0.8 1 1.2 5 10 20 30 40 50
Estimated effect size (cohen-d) Number of studies in p-curve
True d = .6
Sample size of each study n=20 n=50
0.2 0.4 0.6 0.8 1 1.2 5 10 20 30 40 50
Estimated effect size (cohen-d) Number of studies in p-curve
True d = .9
Sample size of each study n=20 n=50
81
82
83
84
85
Choice is bad Choice is good ** **
86
87
88
89
datacolada.org p-curve.com