Statistics for the terrified
Amanda Burls
Evidence-Based Teachers and Developers Conference, Taormina, Sicily October 2013
Statistics for the terrified Amanda Burls Evidence-Based Teachers - - PowerPoint PPT Presentation
Statistics for the terrified Amanda Burls Evidence-Based Teachers and Developers Conference, Taormina, Sicily October 2013 Post-prandial session End 0:59 0:14 0:26 0:25 0:24 0:23 0:22 0:21 0:20 0:19 0:18 0:17 0:16 0:15 0:13
Statistics for the terrified
Amanda Burls
Evidence-Based Teachers and Developers Conference, Taormina, Sicily October 2013
Post-prandial session
1:00 0:59 0:58 0:57 0:56 0:55 0:54 0:53 0:52 0:51 0:50 0:49 0:48 0:47 0:46 0:45 0:44 0:43 0:42 0:41 0:40 0:39 0:38 0:37 0:36 0:35 0:34 0:33 0:32 0:31 0:30 0:29 0:28 0:27 0:26 0:25 0:24 0:23 0:22 0:21 0:20 0:19 0:18 0:17 0:16 0:15 0:14 0:13 0:12 0:11 0:10 0:09 0:08 0:07 0:06 0:05 0:04 0:03 0:02 0:01
Hypothermia vs. control
In severe head injury Mortality or incapacity (n=158) RR 0.63 (0.46, 0.87) Marion 1997
.1 .2 1 5 10
Total (95%CI) Clifton 1992 Hirayama 1994 Clifton 1993 Favours intervention RR Favours control
1:00 0:59 0:58 0:57 0:56 0:55 0:54 0:53 0:52 0:51 0:50 0:49 0:48 0:47 0:46 0:45 0:44 0:43 0:42 0:41 0:40 0:39 0:38 0:37 0:36 0:35 0:34 0:33 0:32 0:31 0:30 0:29 0:28 0:27 0:26 0:25 0:24 0:23 0:22 0:21 0:20 0:19 0:18 0:17 0:16 0:15 0:14 0:13 0:12 0:11 0:10 0:09 0:08 0:07 0:06 0:05 0:04 0:03 0:02 0:01
1 minute to discuss with your neighbour Then write down what you think this graphic tells you
Learning objectives
– Know how measures of effect are reported – Be able to interpret p-values – Be able to interpret confidence intervals – Be able to calculate relative risks (RR, OR) – Be able to explain the difference between statistical significance clinical significance – Like to use blobbograms and be able to interpret then with ease
Statistics without fear
Before we start, let’s remind ourselves What are the important things to think about when we are using research evidence to help inform your decisions?
Validity for an intervention study?
2:00 1:59 1:58 1:57 1:56 1:55 1:54 1:53 1:52 1:51 1:50 1:49 1:48 1:47 1:46 1:45 1:44 1:43 1:42 1:41 1:40 1:39 1:38 1:37 1:36 1:35 1:34 1:33 1:32 1:31 1:30 1:29 1:28 1:27 1:26 1:25 1:24 1:23 1:22 1:21 1:20 1:19 1:18 1:17 1:16 1:15 1:14 1:13 1:12 1:11 1:10 1:09 1:08 1:07 1:06 1:05 1:04 1:03 1:02 1:01 1:00 0:59 0:58 0:57 0:56 0:55 0:54 0:53 0:52 0:51 0:50 0:49 0:48 0:47 0:46 0:45 0:44 0:43 0:42 0:41 0:40 0:39 0:38 0:37 0:36 0:35 0:34 0:33 0:32 0:31 0:30 0:29 0:28 0:27 0:26 0:25 0:24 0:23 0:22 0:21 0:20 0:19 0:18 0:17 0:16 0:15 0:14 0:13 0:12 0:11 0:10 0:09 0:08 0:07 0:06 0:05 0:04 0:03 0:02 0:01 End
Validity for an RCT – Getting similar groups and keeping them similar
Appraisal of any study must consider
– Can the results be trusted?
– What are the results – How are they (or can they be) expressed – Could they have occurred by chance
– Do these results apply to the local context or to me or to my patient?
There are two sorts of error
Systematic error (Bias) Random error
Warning!
Everything I say from now
being considered come from an unbiased study!
(assumes NO systematic errors)
How are results summarised?
two alternatives.
comparisons be expressed?
Well-conducted RCT (no bias)
2:00 1:59 1:58 1:57 1:56 1:55 1:54 1:53 1:52 1:51 1:50 1:49 1:48 1:47 1:46 1:45 1:44 1:43 1:42 1:41 1:40 1:39 1:38 1:37 1:36 1:35 1:34 1:33 1:32 1:31 1:30 1:29 1:28 1:27 1:26 1:25 1:24 1:23 1:22 1:21 1:20 1:19 1:18 1:17 1:16 1:15 1:14 1:13 1:12 1:11 1:10 1:09 1:08 1:07 1:06 1:05 1:04 1:03 1:02 1:01 1:00 0:59 0:58 0:57 0:56 0:55 0:54 0:53 0:52 0:51 0:50 0:49 0:48 0:47 0:46 0:45 0:44 0:43 0:42 0:41 0:40 0:39 0:38 0:37 0:36 0:35 0:34 0:33 0:32 0:31 0:30 0:29 0:28 0:27 0:26 0:25 0:24 0:23 0:22 0:21 0:20 0:19 0:18 0:17 0:16 0:15 0:14 0:13 0:12 0:11 0:10 0:09 0:08 0:07 0:06 0:05 0:04 0:03 0:02 0:01 End
Expressing results: What did the study show?
– 10 randomised to receive Potters – 10 randomised to receive placebo
– 2 get better on Potters – 1 get better on placebo
at least three different ways
Summarise
Potters, one case of back pain is improved
Less
moRe
?
Relative Risk
group is to recover than the
Potters means the relative risk is 2, or RR = 2.0
Less
moRe
1
Risk difference
recovering – the proportion of patients benefitting from treatment
improved on placebo, so the risk difference is 10%
Less
moRe
Number needed to treat (NNT)
the new intervention needs to be given to produce one extra patient who is helped
How were the results summarised?
Two basic ways to summarise results of studies that compare groups:
Do you think this study proves that Potters works?
“It could have happened by chance!”
“It could have happened by chance!”
So how many would you want before you believe the results?
What is the minimum number you would want in each arm to believe the trial?
Assume similar effect size: 10% better with placebo 20% with Potters
Scores
Quantifying uncertainty due to chance
p-value
The Null Hypothesis
… is the assumption of no difference between treatments being compared
1 Impossible Absolutely certain
1 Blue 19 Green
Bag of 20 sweets
10 Blue 10 Green
Bag of 20 sweets
20 Blue 10 Green
Bag of 30 sweets
“Statistical significance”
by chance on fewer than 1 in 20
Potters Placebo P-value 2/10 1/10 P = 0.531 4/20 2/20 P = 0.376 6/30 3/30 P = 0.278 8/40 4/40 P = 0.210 10/50 5/50 P = 0.161 12/60 6/60 P = 0.125 14/70 7/70 P = 0.097 16/80 8/80 P = 0.076 18/90 9/90 P = 0.060 20/100 10/100 P = 0.048 100/500 50/500 P < 0.0001 200/1000 100/1000 P < 0.0001
Why p<0.05 as the cut-off?
significant” and not
Toss a coin 8 times in a row and record the number of heads
0 1 2 3 4 5 6
P<0.016
2 4 6 8 10 12 14 16 18
10 20 30 40 50 60 70 80 90 100
Pre and Post Workshop Scores Percentage
5 4 3 2 1
“Odds ratio”
Do you think this is likely to have happened by chance?
1.Yes 2.Don’t know 3.No
Do you think this is likely to have happened by chance?
1.Yes 2.Don’t know (~1000) 3.No
P<0.00001
10 20 30 40 50 60 70 80 90 100
Pre and Post Workshop Scores Percentage
5 4 3 2 1
“MAAG”
Do you think this is likely to have happened by chance?
1.Yes 2.No
P<0.00001
Statistical significance does not imply clinical significance!
Limitations of the p-value Any genuine difference between two groups, no matter how small, can be made to be “statistically significant”
taking a sufficiently large sample.
We need a better way to express uncertainty due to chance….. [?]
Introduction to confidence intervals
surrounding a point estimate.
How many Red sweets did I pick?
More likely Less likely Less likely
P< 0.000001
Hypothermia vs. control
In severe head injury Mortality or incapacity (n=158) RR 0.63 (0.46, 0.87) Marion 1997
.1 .2 1 5 10
Total (95%CI) Clifton 1992 Hirayama 1994 Clifton 1993 Favours intervention RR Favours control
Hypothermia vs. control
In severe head injury Mortality or incapacity (n=158) RR 0.63 (0.46, 0.87) Marion 1997
.1 .2 1 5 10
Total (95%CI) Clifton 1992 Hirayama 1994 Clifton 1993 Favours intervention RR Favours control
When making health care decisions we often want to know
certain risk factor
likely to develop a particular outcome
better (or be harmed)
How are these things summarised?
comparison and association as you can
2:00 1:59 1:58 1:57 1:56 1:55 1:54 1:53 1:52 1:51 1:50 1:49 1:48 1:47 1:46 1:45 1:44 1:43 1:42 1:41 1:40 1:39 1:38 1:37 1:36 1:35 1:34 1:33 1:32 1:31 1:30 1:29 1:28 1:27 1:26 1:25 1:24 1:23 1:22 1:21 1:20 1:19 1:18 1:17 1:16 1:15 1:14 1:13 1:12 1:11 1:10 1:09 1:08 1:07 1:06 1:05 1:04 1:03 1:02 1:01 1:00 0:59 0:58 0:57 0:56 0:55 0:54 0:53 0:52 0:51 0:50 0:49 0:48 0:47 0:46 0:45 0:44 0:43 0:42 0:41 0:40 0:39 0:38 0:37 0:36 0:35 0:34 0:33 0:32 0:31 0:30 0:29 0:28 0:27 0:26 0:25 0:24 0:23 0:22 0:21 0:20 0:19 0:18 0:17 0:16 0:15 0:14 0:13 0:12 0:11 0:10 0:09 0:08 0:07 0:06 0:05 0:04 0:03 0:02 0:01 End
On the next slide
someone who cannot see the field
Possible summaries
are goats
Calculate (you have I minute)
(note that in epidemiology we use the word “risk” to mean “chance” – it doesn’t necessarily mean something unwanted)
Answers
– 10/16 – 0.625
– 10:6 – 1.66
Odds
Odds are a way of describing how many people in a population have a disease, risk factor or other
e.g. The odds could be the number of people with the disease compared to the number without the disease: number with disease
Comparing two or more groups…
exposed/not expose)
Mountain side
Summarise the relative risk of being in the field compared to the mountain in terms of
3:7 = 0.43
3/10 = 0.3
Is it reasonable to conclude that there is an association between being in the field and being a sheep?
P-value
How likely is it that we would get a result as big (or as small) as the one
The answer is given as a p-value
A. P = 0.24 B. P<0.5
E. P<0.01 F. P>0.001
How likely is it that we would get a result as big (or as small) as the one
A. P = 0.24 B. P<0.5
E. P<0.01 F. P>0.001
Meta-analysis
More fun!
Fixed effects
have been drawn from the same barrel Random effects
drawn from a random barrel and all we know about the proportions of sweets in the barrels in the world is what we can deduce from our samples
|
“Hey, no problem!”
Statistics scary? – Nah, all bark and no bite
31 October, 2013 Critical Appraisal Skills Workshop 84