bingroup
play

binGroup: Homogeneous Population A Package for Heterogeneous - PowerPoint PPT Presentation

Outline What is group testing? binGroup: Homogeneous Population A Package for Heterogeneous Population Group Testing Matrix Pooling Christopher R. Bilder 1 Boan Zhang 1 Frank Schaarschmidt 2 Joshua M. Tebbs 3 1 University of


  1. Outline � What is group testing? binGroup: � Homogeneous Population A Package for � Heterogeneous Population Group Testing � Matrix Pooling Christopher R. Bilder 1 Boan Zhang 1 Frank Schaarschmidt 2 Joshua M. Tebbs 3 1 University of Nebraska-Lincoln 2 Leibniz Universität Hannover, Germany 3 University of South Carolina This research is supported in part by NIH grant R01AI067373 Slide 1 of 20

  2. What is group testing? Outline • Testing a person for a disease � What is group testing? • Individual testing � Homogeneous Population � Heterogeneous Population � Matrix Pooling – Problem: Cost and Time • Group testing Slide 2 of 20

  3. What is group testing? Outline • Group testing (continued) � What is group testing? − Save time and resources � Homogeneous − Applications in screening blood donations, drug Population � Heterogeneous discovery experiments, veterinary and public health Population studies � Matrix Pooling − Estimation • Probability of disease − Identification • Which individuals are positive Slide 3 of 20

  4. Homogeneous Population Outline • Notation � What is group testing? – Individual responses � Homogeneous • Y ik are independent Bernoulli( p ) random variables Population � Heterogeneous for item i in group k ( i = 1, …, I k , k = 1, …, K ) Population • Need to estimate p = P ( Y ik = 1) � Matrix Pooling • p is the “prevalence in a population” • Y ik are unobserved – Group responses • Z k are independent Bernoulli( θ k ) random variables • θ k = P ( Z k = 1) θ = − − • 1 (1 ) k I p k Slide 4 of 20

  5. Homogeneous Population Outline • Estimate p � What is group testing? − Likelihood function � Homogeneous Population � Heterogeneous Population − Common group size I � Matrix Pooling • MLE for p : − Unequal group sizes • Iterative numerical methods needed Slide 5 of 20

  6. Homogeneous Population Outline • bgtCI() � What is group testing? – Calculates point estimate and confidence interval for p � Homogeneous • Common group size Population � Heterogeneous – Different types of confidence intervals Population • Example (Ornaghi et al., 1999) � Matrix Pooling − The purpose is to estimate the probability that female planthopper transfer the MRC virus to maize crops Slide 6 of 20

  7. Homogeneous Population Outline • Example (continued) � What is group testing? − 24 plants with 7 planthoppers on each � Homogeneous − 3 plants test positive for the virus Population � Heterogeneous > bgtCI(n=24, y=3, s=7, conf.level=0.95, Population + alternative="two.sided", method="Score") � Matrix Pooling The 95 percent Score confidence interval is: [ 0.006325 0.05164 ] point estimator = 0.0189 Slide 7 of 20

  8. Homogeneous Population Outline • estDesign() � What is group testing? – Find the optimal group size � Homogeneous − Example Population � Heterogeneous > estDesign(n = 24, smax = 100, p.tr = 0.0189) Population � Matrix Pooling group size s with minimal mse(p) = 43 $varp [1] 3.239869e-05 $mse [1] 3.2808e-05 $bias [1] 0.0006397784 $exp [1] 0.01953978 • Other functions include: − bgtvs(), bgtTest(), bgtPower(), nDesign(), sDesign() Slide 8 of 20

  9. Heterogeneous Population Outline • Notation � What is group testing? – Individual responses � Homogeneous • Y ik are independent Bernoulli( p ik ) random variables Population � Heterogeneous for item i in group k ( i = 1, …, I k , k = 1, …, K ) Population • Need to estimate p ik = P ( Y ik = 1) � Matrix Pooling • Y ik are unobserved – Group responses • Z k are independent Bernoulli( θ k ) random variables • θ k = P ( Z k = 1) for group k ∏ I θ = − − 1 (1 ) • k p = k ik 1 i Slide 9 of 20

  10. Heterogeneous Population Outline • Notation (continued) � What is group testing? – Covariates � Homogeneous • x ik1 , x ik2 , …, x ikp for the i th item in the k th group Population � Heterogeneous • Incorporate factors which influence disease status Population • Model: logit( p ik ) = β 0 + β 1 x ik1 + … + β p x ikp � Matrix Pooling • Estimation of β 0 , β 1 , β 2 , …, β p – Note that Y ik are not directly observed – Vansteelandt et al. ( Biometrics , 2000) • Likelihood function is written in terms of the Z k Slide 10 of 20

  11. Heterogeneous Population Outline • Estimation of β 0 , β 1 , β 2 , …, β p (continued): � What is group testing? – Xie ( Statistics in Medicine , 2001) � Homogeneous • Likelihood function is written in terms of the Population � Heterogeneous unobserved Y ik Population I K ∏∏ � Matrix Pooling k − = − 1 y (1 ) y L p p ik ik ik ik = = 1 1 k i • EM algorithm used p = = ik E Y ( | Z 1) ik k I k − − 1 (1 p ) ∏ ik = i 1 Z = = E Y ( | 0) 0 ik k Slide 11 of 20

  12. Heterogeneous Population Outline • HIV surveillance (Verstraeten et al., Tropical Medicine � What is group and International Health , 1998) testing? � Homogeneous AGE EDUC. groupres gnum Population � Heterogeneous 21 4 0 1 Population 16 2 0 1 � Matrix Pooling 17 1 0 1 17 2 0 1 18 1 0 1 ⁞ 25 2 1 85 29 3 1 85 17 2 1 85 18 2 1 85 Slide 12 of 20 18 2 1 85

  13. Heterogeneous Population Outline • Model HIV with two covariates: AGE and EDUC. � What is group testing? > fit1 <- gtreg(formula = groupres ~ AGE + EDUC., data = hivsurv, groupn = gnum, sens = 1, spec = 1 � Homogeneous Population method = "Vansteelandt") � Heterogeneous • Result is a list that contains a lot of information: Population > class(fit1) � Matrix Pooling [1] "gt" > names(fit1) [1] "coefficients" "hessian" "fitted.group.values" [4] "deviance" "df.residual" "null.deviance" [7] "df.null" "aic" "counts" [10] "residuals" "z" "call" [13] "formula" "method" "link" [16] "terms" • Summarize the results: > summary(fit1) Slide 13 of 20

  14. Heterogeneous Population Outline (Continued) � What is group testing? Call: gtreg(formula = groupres ~ AGE + EDUC., data = hivsurv, groupn � Homogeneous Population = gnum, linkf = "logit", method = "Vansteelandt") Deviance Residuals: � Heterogeneous Population Min 1Q Median 3Q Max � Matrix Pooling -1.1868 -0.9376 -0.8197 1.3223 1.6826 Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -2.78115 1.45576 -1.910 0.0561 . AGE -0.04921 0.06224 -0.791 0.4292 EDUC. 0.67646 0.40087 1.687 0.0915 . --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 − − + exp( 2.781 0.049 0.676 ) age educ = + ˆ p − − + 1 exp( 2.781 0.049 0.676 ) age educ Slide 14 of 20

  15. Heterogeneous Population Outline (Continued) � What is group testing? � Homogeneous Null deviance: 112.1 on 428 degrees of freedom Population Residual deviance: 109.3 on 425 degrees of freedom � Heterogeneous AIC: 115.3 Population � Matrix Pooling Number of iterations in optim(): 194 summary.gt(), predict.gt(), residuals.gt() • − Similar to those method functions for glm class • sim.g() − Simulates data in group testing form Slide 15 of 20

  16. Matrix Pooling Outline • Regression in the matrix pooling group testing scheme � What is group testing? − Put individual specimens in square arrays and test each � Homogeneous row and column pool (Phatarfod and Sudbury, 1994) Population − A simple example of 4 × 4 square array: � Heterogeneous Population � Matrix Pooling Slide 16 of 20

  17. Matrix Pooling Outline • Regression in the matrix pooling group testing scheme � What is group (continued) testing? � Homogeneous − EM algorithm Population � Heterogeneous − E( Y ij | Row and column responses) in the E-step cannot Population be explicitly expressed � Matrix Pooling − Gibbs sampling in each E-step, suggested by Xie ( Statistics in Medicine , 2001) Slide 17 of 20

  18. Matrix Pooling Outline • gtreg.mp() � What is group testing? − Fit the group testing regression model in the matrix � Homogeneous pooling setting Population � Heterogeneous − Individual retests can be included Population • A set of method functions available � Matrix Pooling • Example > head(sa1) x col.resp row.resp coln rown arrayn retest 1 29.96059 0 0 1 1 1 NA 2 61.28205 0 1 1 2 1 NA 3 34.27341 0 1 1 3 1 NA 4 46.19001 0 0 1 4 1 NA 5 39.43801 0 1 1 5 1 NA 6 45.88038 1 0 2 1 1 NA Slide 18 of 20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend