Estimating CIs on proportions How confident can I be in my estimate? - PowerPoint PPT Presentation

Is this “population” free of infection? Estimating CI’s on proportions ❖ How confident can I be in my estimate? (e.g., 0 of 10 vs. 0 of 30) ❖ How different is the estimate of prevalence in two species, populations, times, … (quick and dirty) ❖ Skip the “simple” normal approximations ❖ will always be a little wrong, sometimes nonsensical) ❖ with modern stats packages, there is no need to resort to such a bad approximation Estimating CI’s on proportions Use Wilson score interval (w/o continuity correction)… h q 4 n 2 z 2 i 2 n z 2 ± 1 1 1 1 CL = p + ˆ n ˆ p (1 − ˆ p ) + 1+ 1 n z 2 where z = 1 − α / 2 = 1 . 96 it’s ugly, but it works well binom.confint() function in binom package http://vassarstats.net/prop1.html

Adjusting prevalence estimates for imperfect tests φ T rue = φ Apparent + specificity − 1 sensitivity + specificity − 1 CL Adjusted = CL Apparent + specificity − 1 sensitivity + specificity − 1 epi.prev() function in epiR package Rogan W, Gladen B (1978). Estimating prevalence from results of a screening test. American Journal of Epidemiology 107: 71 - 76. Detection varies with titer! Probability of detecting infection in tail clip 1.00 0.75 0.50 ❖ We treat infections as binary (at 0.25 least for microparasites) 0.00 0 1 2 3 4 5 6 log 10 virus titer in liver + kidney ❖ Virus titers vary by orders of magnitude Probability of detecting infection in swab 1.00 ❖ The P(detect ranavirus) 0.75 increases with titer 0.50 0.25 0.00 0 1 2 3 4 5 6 log 10 virus titer in liver + kidney Take care in interpreting prevalence data Just a snapshot in time High incidence ≠ lots of disease at least some individuals of many species are tolerant of RV Low incidence ≠ lack of disease or impact if individuals die or recover quickly, they will not be sampled and so will not be part of prevalence estimate

Take care in interpreting prevalence data Prevalence — the proportion infected (or diseased) at some time point Incidence — the rate of new infections (or occurrence of disease) over an interval Take care in interpreting prevalence data Scenario A: Long-lasting infections (e.g., long time course, low mortality & recovery) Prevalence = 7/10 Incidence = 7 (or 7/8 at risk) Scenario B: Short infections (e.g., rapid recovery) Prevalence = 4/10 Incidence = 7 (or 7/8 at risk) Incidence Prevalence ≈ Incidence × Duration ≈ Incidence × 1/rate of loss Loss (Assuming constant population size, incidence, and duration)

Take care in interpreting prevalence data Combining prevalence with other data is usually more informative: Are there dead or dying animals? P(disease) often increases with intensity of infection low prevalence of high intensity infections is more consistent with a die- off than low intensity infections Susceptibility of the species of interest low prevalence in a very susceptible species would be interpreted differently than similar prevalence in a very tolerant species Timing/phenology low prevalence in young larvae could mean low susceptibility/ transmission OR very early in an epidemic Comparing prevalence: Chi-square tests Pop A Pop B Total n ( O i − E i ) 2 χ 2 = X Infected 30 10 20 E i Not infected 50 i =1 25 25 Total 80 35 45 ❖ Can accommodate multiple groups (e.g., ponds, species, whatnot) ❖ Simple to calculate (even by hand) ❖ Requires that expected count in all cells be ≥ 5 which may be difficult with low sample sizes and/or low (or very high) prevalence Comparing prevalence: Chi-square tests n Pop A Pop B Total ( O i − E i ) 2 χ 2 = X Infected 30 10 20 E i Not infected 50 i =1 25 25 Total 35 45 80 If there is no difference between the two populations, we would expect the proportion infected to be the same in both: 30/80=0.375 Of the 35 sampled in Pop A, we expect 35 × 0.375 = 13.125 infections. Similarly we would expect 4 5 × 0.375 = 16.875 infected in Pop B. The expected number of uninfected in each pond is calculated similarly: 35 × (50/80) = 35 × 0.625 = 21.875 uninfected in population A, and 4 5 × (50/80) = 45 × 0.625 = 28.125

Comparing prevalence: Chi-square tests n ( O i − E i ) 2 χ 2 = X E i i =1 Pop A Pop B Infected (10-13.125) 2 /13.125 (20-16.875) 2 /16.875 Not infected (25-21.875) 2 /21.875 (25-28.125) 2 /28.125 Pop A Pop B Infected 0.7440476 0.5787037 Not infected 0.4464286 0.3472222 Sum = 2.116402 Compare to Chi-square distribution with (rows-1) (columns-1) = (2-1)(2-1) = 1 d.f. so P = 0.1457 Note: with 2x2 table, a correction is usually applied by stats packages Comparing prevalence: Margins & test options chisq.test() function in R stats Pop A Pop B Total NOTE: when simulate.p.value=TRUE assumes both R & C fixed Infected 10 20 30 fisher.test() function in R stats GTest() function in R package DescTools or Not infected 25 25 50 G.test() function in R package RVAideMemoire Total 35 45 80 barnardw.test() function in R package Barnard Experimental What is fixed? Large sample Small sample Design Chi-square G-test with Yates Model I Total sample size, N G-test correction Chi-square G-test with Yates Model II Either row totals ( R ) or G-test correction column totals ( C ) Barnard’s test Barnard’s test Model III Both row totals ( R ) & Chi-square Fisher’s exact Fisher’s exact column totals ( C ) Comparing/modeling prevalence: logistic regression Accommodates one many categorical (e.g., pond, species) or continuous predictors (e.g., pond size, salinity) Models the probability of some binary outcome (i.e., infection, death) in a pond (or individual)

Comparing/modeling prevalence: logistic regression The logit transform of this probability is a linear function of the predictors ✓ p i ◆ logit( p i ) = ln = β 0 + β 1 x i + · · · + β n x i 1 − p i Comparing/modeling prevalence: logistic regression ✓ p i ◆ logit( p i ) = ln = β 0 + β 1 x i + · · · + β n x i 1 − p i We can recover the probability by simple back- transformations ✓ ◆ p i = e β 0 + β 1 x i + ··· + β n x i exp(logit( p i )) = 1 − p i e β 0 + β 1 x i + ··· + β n x i 1 p i = 1 + e β 0 + β 1 x i + ··· + β n x i = e − ( β 0 + β 1 x i + ··· + β n x i ) Can make statements about how the probability or odds of infection (or death) change with the predictor Be careful about the units! Detecting die-offs or other temporary events Sampling times Die-off Die-off Time (e.g., days) P ( observe ) = duration of event 1.00 time between surveys Probability of observing die − off 0.75 Duration of die − off 2d 0.50 5d 8d 0.25 0.00 0 7 14 21 28 Days between surveys

General advice ❖ Remember that P =0.05 is not a magic threshold for what does/does not matter! ❖ Present effect sizes (change in prevalence between populations or with some predictor) to give a sense of biological importance ❖ Provide confidence intervals to give an idea of certainty in the estimate General advice ❖ Graph your data in a way that ❖ Honestly illustrates effects and confidence ❖ Include zero and one when graphing prevalence ❖ Show confidence intervals or confidence envelopes (logistic regression) ❖ Allows the raw data can be recovered for future (e.g., meta) analyses ❖ e.g., if you show prevalence as points on a graph, provide sample sizes ❖ Provide context: prevalence is only part of the story

Estimating CIs on proportions How confident can I be in my estimate? - PowerPoint PPT Presentation

Is this population free of infection? Estimating CIs on proportions How confident can I be in my estimate? (e.g., 0 of 10 vs. 0 of 30) How different is the estimate of prevalence in two species, populations, times, (quick and

Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . .

Estimating proportions of elements in finite symmetric and classical groups Alice Niemeyer UWA,

Estimating Estimating Covariance . . . Statistical Characteristics Estimating . . . Proof of

Small area estimation of proportions of Small area estimation of proportions of Arsenic affected

Lecture 22/Chapter 19 Part 4. Statistical Inference Ch. 19 Diversity of Sample Proportions

11/11/2014 Chapter 21 COMPARING TWO PROPORTIONS 1 THE STANDARD DEVIATION OF THE DIFFERENCE

Planning III-A: Planning III-A: Estimating Software Size - Estimating Software Size -

Estimating Frequency Moments Estimating F 0 Algorithm Correctness Further Anil Maheshwari

Estimating Frequency Moments Moments Estimating F 0 Algorithm Correctness Anil Maheshwari

Lecture 24/Chapter 20 Estimating Proportions with Confidence Example: Importance of Margin of

Estimating Relative Expression Mark Voorhies 4/6/2011 Mark Voorhies Estimating Relative

Ratios, Rates & Proportions Slide 2 / 130 Table of Contents Click on the topic to go to that

Factor Proportions and the Structure of Commodity Trade John Romalis - Chicago GSB, August

PCA and Admixture proportions for low depth NGS data Anders Albrechtsen Structured

Inference for Proportions Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Based on

ACMS 20340 Statistics for Life Sciences Chapter 20: Comparing Two Proportions Two sample tests

Cluster approach for EEG analysis: predicting upcoming sensorimotor event. Maria Luiza Rangel

Feature Selection ZHI LI Fenys Lab October 3, 2019 What is Feature? X (Independent)

Insights into the Treatment of SBD With Imaging Richard J. Schwab, M.D. Professor of Medicine

Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where

The intersection axiom of conditional independence : some new results Richard D. Gill

When its better to ask forgiveness than get permission Chris Thompson, Maritza Johnson, Serge

Users Really Do Plug in USB Drives They Find Matthew Tischer, Zakir Durumeric, Sam Foster, Sunny

Sta$s$cal Significance Tes$ng In Theory and In Prac$ce Ben

Estimating CIs on proportions How confident can I be in my estimate? - PowerPoint PPT Presentation

Is this population free of infection? Estimating CIs on proportions How confident can I be in my estimate? (e.g., 0 of 10 vs. 0 of 30) How different is the estimate of prevalence in two species, populations, times, (quick and

Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . .

Estimating proportions of elements in finite symmetric and classical groups Alice Niemeyer UWA,

Estimating Estimating Covariance . . . Statistical Characteristics Estimating . . . Proof of

Small area estimation of proportions of Small area estimation of proportions of Arsenic affected

Lecture 22/Chapter 19 Part 4. Statistical Inference Ch. 19 Diversity of Sample Proportions

11/11/2014 Chapter 21 COMPARING TWO PROPORTIONS 1 THE STANDARD DEVIATION OF THE DIFFERENCE

Planning III-A: Planning III-A: Estimating Software Size - Estimating Software Size -

Estimating Frequency Moments Estimating F 0 Algorithm Correctness Further Anil Maheshwari

Estimating Frequency Moments Moments Estimating F 0 Algorithm Correctness Anil Maheshwari

Lecture 24/Chapter 20 Estimating Proportions with Confidence Example: Importance of Margin of

Estimating Relative Expression Mark Voorhies 4/6/2011 Mark Voorhies Estimating Relative

Ratios, Rates &amp; Proportions Slide 2 / 130 Table of Contents Click on the topic to go to that

Factor Proportions and the Structure of Commodity Trade John Romalis - Chicago GSB, August

PCA and Admixture proportions for low depth NGS data Anders Albrechtsen Structured

Inference for Proportions Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Based on

ACMS 20340 Statistics for Life Sciences Chapter 20: Comparing Two Proportions Two sample tests

Cluster approach for EEG analysis: predicting upcoming sensorimotor event. Maria Luiza Rangel

Feature Selection ZHI LI Fenys Lab October 3, 2019 What is Feature? X (Independent)

Insights into the Treatment of SBD With Imaging Richard J. Schwab, M.D. Professor of Medicine

Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where

The intersection axiom of conditional independence : some new results Richard D. Gill

When its better to ask forgiveness than get permission Chris Thompson, Maritza Johnson, Serge

Users Really Do Plug in USB Drives They Find Matthew Tischer, Zakir Durumeric, Sam Foster, Sunny

Sta$s$cal Significance Tes$ng In Theory and In Prac$ce Ben

Ratios, Rates & Proportions Slide 2 / 130 Table of Contents Click on the topic to go to that