Sample size determination: why, when, how? @graemeleehickey - PowerPoint PPT Presentation

Graeme L. Hickey University of Liverpool Sample size determination: why, when, how? @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk

Wh Why? y? Scientific: might miss out on an important discovery (testing too few), or find a clinically irrelevant effect size (testing too many) Ethical: might sacrifice subjects (testing too many) or unnecessarily expose too few when study success chance low (testing too few) Economical: might waste money and time (testing too many) or have to repeat the experiment again (testing too few) Also, generally required for study grant proposals

Whe When? n? • Should be determined in advance of the study • For randomised control trials (RCTs), must be determined and specified in the study protocol before recruitment starts

Wha What no not to do do Use same sample size as another (possibly similar) study Might have just gotten lucky Base sample size on what is available Extend study period, seek more money, pool study Use a nice whole number and hope no one notices Unless you want your paper rejected Avoid calculating a sample size because you couldn’t estimate the parameters needed Do a pilot study or use approximate formulae, e.g. SD ≈ (max – min) / 4 Avoid calculating a sample size because you couldn’t work one out Speak to a statistician

Ex Exampl ple • A physician wants to set a study to compare a new antihypertensive drug relative to a placebo • Participants are randomized into two treatment groups: • Group N: new drug • Group P: placebo • The primary endpoint is taken as the mean reduction in systolic blood pressure (BP sys ) after four weeks

Wha What do do we ne need? d? Item Definition Specified value Type I error ( ⍺ ) Power (1 – β) Minimal clinically relevant difference Variation

Er Errors Hypothesis test We will use the conventional No evidence of a Evidence of a values of ⍺ =0.05 difference difference and β=0.20 True Negative False positive Type I error ( 𝛽 ) No difference Truth False negative True Positive Type II error (β) Difference

Wha What do do we ne need? d? Item Definition Specified value The probability of falsely rejecting Type I error ( ⍺ ) 0.05 H 0 (false positive rate) The probability of correctly Power (1 – β) 0.80 rejecting H 0 (true positive rate) Minimal clinically relevant difference Variation

Mi Minima mal clinically relevant di differ erenc ence • Minimal difference between the studied groups that the investigator wishes to detect • Referred to as minimal clinically relevant difference (MCRD) – different from statistical significance • MCRD should be biologically plausible • Sample size ∝ MCRD -2 • E.g. if n =100 required to detect MCRD = 1, then n =400 required to detect MCRD = 0.5 • Note : some software / formula define the ‘effect size’ as the standardized effect size = MCRD / σ

Whe Where to get MCRD or variation n value ues • Biological / medical expertise • Review the literature • Pilot studies • If unsure, get a the range of values and explore using sensitivity analyses

Ex Exampl ple: continue nued • From previous studies, the mean BP sys of hypertensive patients is 145 mmHg (SD = 5 mmHg) • Histograms also suggest that the distribution of BP is normally distributed in the population • An expert says the new drug would need to lower BP sys by 5 mmHg for it to be clinically significant, otherwise the side effects outweigh the benefit • He assumes the standard deviation of BP sys will be the same in the treatment group

Wha What do do we ne need? d? Item Definition Specified value The probability of falsely rejecting Type I error ( ⍺ ) 0.05 H 0 (false positive rate) The probability of correctly Power (1 – β) 0.80 rejecting H 0 (true positive rate) The smallest (biologically plausible) Minimal clinically relevant 5 mmHg difference in the outcome that is difference clinically relevant Variability in the outcome (SD for Variation 5 mmHg continuous outcomes)

Sa Samp mple size formu rmula* . 𝜏 . 𝑎 #,- . + 𝑎 #,0 𝑜 ≈ 2 𝜈 # − 𝜈 % . • 𝜈 # − 𝜈 % is the MCRD • 𝑎 ' is the quantile from a standard normal distribution • 𝜏 is the common standard deviation *based on a two-sided test assuming 𝜏 is known

Sa Samp mple size calculation 𝑜 ≈ 2 1.96 + 0.84 . 5 . 5 . 𝑜 = 2 1.96 + 0.84 . 5 . = 15.7 5 . Therefore we need 16 patients per treatment group NB: we always round up, never down

Se Sensitivity analyses • Sample size sensitive to changes in ⍺ , β, MCRD, σ • Generally a good idea to consider sensitivity of calculation to parameter choices • If unsure, generally choose the largest sample size

Sa Samp mple size calculation software • Standalone tools: G*Power (http://www.gpower.hhu.de/) • Many statistics software packages have built-in functions • Lots of web-calculators available • Lots of formulae published in (bio)statistics papers

Pr Practical limitat ations • What if the study duration is limited; the disease rare; financial resources stretched; etc.? • Calculate the power from the maximum sample size possible (reverse calculation) • Possible solutions: • change outcome (e.g. composite) • use as an argument for more funding • don’t perform the study • reduce variation, e.g. change scope of study • pool resources with other centres

� Es Estimation n pr probl blems • Study objective may be to estimate a parameter (e.g. a prevalence) rather than perform a hypothesis test • Sample size, n , chosen to control the width of the confidence interval (CI) • E.g. if a prevalence, the approximate 95% CI is given by < (1 – 𝑞 < ) < ± 1.96 𝑞 𝑞 𝑜 Margin of error (MOE) where 𝑞̂ is the estimated proportion

� Exa xample • David and Boris want to estimate how support among cardiothoracic surgeons for the UK to leave the EU • They want the MOE to be <3% #.@A • SE maximized when 𝑞̂ = 0.5 , so need < 0.03 . B • So need to (randomly) poll n = 1068 members

Dr Drop-ou outs / m / missing d data • Sample size calculation is for the number of subjects providing data • Drop-outs / missing data are generally inevitable • If we anticipate losing x % of subjects to drop-out / missing data, then inflate the calculated sample size, n , to be: 𝑜 𝑜 ⋆ = 1 − 𝑦 100

Samp Sa mple size formu rmula and software available fo for ot other… • Effects: • Hypotheses: • Comparing two proportions • Non-inferiority • Hazard ratios • Superiority • Odds ratios • … • … • Study designs: • Cluster RCTs • Cross-over studies • Repeated measures (ANCOVA) • …

Ob Obse servational studies How to approach (depending on Issues study objective) • Study design features: • Start from assuming randomization as a reference • Non-randomized ⇒ bias • Missing data • Correction factors (e.g. [1,2]) • Assignment proportions • Inflate sample size for PSM to unbalanced account for potential unmatched • Far fewer ‘closed-form’ formulae subjects • … [1] Hsieh FY et al . Stat Med . 1998; 17: 1623–34. [2] Lipsitz SR & Parzen M. The Statistician . 1995; 1: 81-90.

Re Reporting • Six high-impact journals in 2005-06*: • 5% reported no calculation details • 43% did not report all required parameters • Similar reporting inadequacies in papers submitted to EJCTS/ICVTS • Information provided should (in most cases) allow the statistical reviewer to reproduce the calculation • CONSORT Statement requirement * Charles et al. BMJ 2009;338:b1732

Final Final commen ents ts • All sample size formulae depend on significance, power, MCRD, variability (+ possible additional assumptions / parameters, e.g. number of events, correlations, …) no matter how complex • Lots of published formula (search Google Sc )), books, software, and of course… statisticians – need to find the one right for your study • A post hoc power calculation is worthless • Instead report effect size + 95% CI

I just cannae do it, I need more Thanks for listening Captain. I dinnae power, Scotty have the poower ! Any questions? Statistical Primer article to be published soon! Slides available (shortly) from: www.glhickey.com

Sample size determination: why, when, how? @graemeleehickey - PowerPoint PPT Presentation

Graeme L. Hickey University of Liverpool Sample size determination: why, when, how? @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Wh Why? y? Scientific: might miss out on an important discovery (testing too few), or find

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

SAMPLE SIZE IN TRIAXIAL LOADS How sample size affects the frictional behavior Photo by H.

Sample 2 Inlet in western (Sunset) Bay 0 Sample 3 Inlet behind Christian Island 1 Sample

Sample Size Power, Sample Size, and the FDR How many observations do we need? Depends on

AN INTRODUCTION TO CONTENT DETERMINATION Gerard Casamayor Chris Mellish Contents 1. The place

Agglomeration of Ash Particles due to Flue Gas Conditioning (a) Sample CA8S12F1 (b) Sample

Sample Preparation Sample Preparation Sample Size 6 mm x 12 mm x 50 mm 10 mm x 12 mm

Math 1710 Class 24 Examples Power 2-Sample CIs Dr. Allen Back and HTs 2-Sample

Determination of Dry Coating Determination of Dry Coating A B C D E F G H I J K L M

Lumber Size Lumber Size Control Control Studies Studies Lumber Size Control Lumber Size

Lab 2 discussion Last Time Debugging Its a science use experiments to refine

SEM Photographs of Activated ash samples SEM Micrographs (Original ash samples) (a) Sample S1F1

SCALING-UP SELF-DETERMINATION Tools from the National Gateway to Self-Determination A National

Draft decision: SP AusNet transmission determination 2008-14 Pre-determination conference 11

Sex Determination Read pp65-75 in text Outline 1.Different mechanisms for sex determination 2.

Planning Sample Size for Randomized Evaluations Jed Friedman, World Bank Based on slides from

EGAP Learning Days: Power Analysis Gareth Nellis Preliminaries: Average Treatment Effect

Bias and Equity Implicit Bias & Health Care Disparities Two sides of the same coin Clinical

Brownian motors in the micro-scale domain: Enhancement of efficiency by noise Part of Phys. Rev. E

Multiband RF-Interconnect for Reconfigurable Network-on-Chip Communications Jason Cong

CS654 Advanced Computer Architecture Lec 4 - Introduction Peter Kemper Adapted from the slides

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

Optimization of Power Analysis Using Neural Network Zdenek Martinasek, Jan Hajny and Lukas Malina

Fundamentals of Signals Overview Definition Examples Energy and power Signal

Sambuz

Useful Links

Newsletter

Mail Us

Sample size determination: why, when, how? @graemeleehickey - PowerPoint PPT Presentation

Graeme L. Hickey University of Liverpool Sample size determination: why, when, how? @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Wh Why? y? Scientific: might miss out on an important discovery (testing too few), or find

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

SAMPLE SIZE IN TRIAXIAL LOADS How sample size affects the frictional behavior Photo by H.

Sample 2 Inlet in western (Sunset) Bay 0 Sample 3 Inlet behind Christian Island 1 Sample

Sample Size Power, Sample Size, and the FDR How many observations do we need? Depends on

AN INTRODUCTION TO CONTENT DETERMINATION Gerard Casamayor Chris Mellish Contents 1. The place

Agglomeration of Ash Particles due to Flue Gas Conditioning (a) Sample CA8S12F1 (b) Sample

Sample Preparation Sample Preparation Sample Size 6 mm x 12 mm x 50 mm 10 mm x 12 mm

Math 1710 Class 24 Examples Power 2-Sample CIs Dr. Allen Back and HTs 2-Sample

Determination of Dry Coating Determination of Dry Coating A B C D E F G H I J K L M

Lumber Size Lumber Size Control Control Studies Studies Lumber Size Control Lumber Size

Lab 2 discussion Last Time Debugging Its a science use experiments to refine

SEM Photographs of Activated ash samples SEM Micrographs (Original ash samples) (a) Sample S1F1

SCALING-UP SELF-DETERMINATION Tools from the National Gateway to Self-Determination A National

Draft decision: SP AusNet transmission determination 2008-14 Pre-determination conference 11

Sex Determination Read pp65-75 in text Outline 1.Different mechanisms for sex determination 2.

Planning Sample Size for Randomized Evaluations Jed Friedman, World Bank Based on slides from

EGAP Learning Days: Power Analysis Gareth Nellis Preliminaries: Average Treatment Effect

Bias and Equity Implicit Bias &amp; Health Care Disparities Two sides of the same coin Clinical

Brownian motors in the micro-scale domain: Enhancement of efficiency by noise Part of Phys. Rev. E

Multiband RF-Interconnect for Reconfigurable Network-on-Chip Communications Jason Cong

CS654 Advanced Computer Architecture Lec 4 - Introduction Peter Kemper Adapted from the slides

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

Optimization of Power Analysis Using Neural Network Zdenek Martinasek, Jan Hajny and Lukas Malina

Fundamentals of Signals Overview Definition Examples Energy and power Signal

Sambuz

Useful Links

Newsletter

Mail Us

Bias and Equity Implicit Bias & Health Care Disparities Two sides of the same coin Clinical