Using non-parametric methods in the context
- f multiple testing to determine differentially
Using non-parametric methods in the context of multiple testing to - - PowerPoint PPT Presentation
Using non-parametric methods in the context of multiple testing to determine differentially expressed genes Greg Grant, Elisabetta Manduchi, Chris Stoeckert Penn Center for Bioinformatics CAMDA 2000 Outline Differential Expression
log scale
B and T
log scale
log scale
distributions and non-deterministic differential expression, thus complicating the problem of assigning confidence to predictions of differential expression. Only by including the absent calls do we see the difference in genes we expect to be differentially expressed, such as the following T-cell antigen CD7 precursor (Id: D007499)
i i > , 1 , 2
i i
, 1 , 1
1 1 1
1 , 1 , , 1
+ − − n X X
i j i
> C X X
i i i i 1 1 2 2
Prob µ µ
hypothetical data: assuming variance proportional to
necessary. real data:
intensities.
effect.
sates and reduces false positives at low and high intensity.
– Any predictions are false positives.
biological subclassing, with the latter effect diminishing as the number of replicates increases.
exact instead of conservative, the numbers in each column would converge to 0.1.
genes does not dramatically worsen the multiple testing problem of subclassing.
is fully represented in the replicates. If a class is very heterogeneous (e.g. B-cells) then many replicates might be needed to avoid over- representing a subclass by chance and therefore introducing false positives.
Brian Brunk Eugen Buehler Jonathan Crabtree Sue Davidson Sharon Diskin Georgi Kostov Phillip Le Joan Mazzarelli Shannon McWeeney Colleen Petrelli Debbie Pinney Angel Pizarro Jonathan Schug Jim Wolff