Multiple Testing
Applied Multivariate Statistics – Spring 2012
Multiple Testing Applied Multivariate Statistics Spring 2012 - - PowerPoint PPT Presentation
Multiple Testing Applied Multivariate Statistics Spring 2012 Overview Problem of multiple testing Controlling the FWER: - Bonferroni - Bonferroni-Holm Controlling the FDR: - Benjamini-Hochberg Case study 1 Package
Applied Multivariate Statistics – Spring 2012
1
“library(…, lib.loc = ‘path where you saved the folder of the package’)”
Bioconductor
2
3
Weight, blood pressure, heart rate, blood parameters, etc.
variable on the 5% significance level
significant effect !!
By definition, type 1 error is (at most) 5%
In example: Declare that wonder-pill changes variable, if in reality there is no change
Then: Every variable has a 5% chance of being “significantly changed by the drug”
4
Test 1 Test 2 Test 100
All tests 5% chance Significant tests Test 5 Test 19 Test 43 Test 77
significance (= one false positive test)
requires FWER to be less than 5%
5
Declared non-sign. Declared sign. Total True H0 U V M0 False H0 T S M1 Total M-R R M
(assuming independence among variables)
6
corrected p-value is less than significance level
corresponding p-value Pi holds: M ∗ 𝑄𝑗< 𝛽
7
H0(1): 0.005, H0(2): 0.011, H0(3): 0.02, H0(4): 0.04, H0(5): 0.13
Reject H0(1) , don’t reject H0(2) , H0(3) , H0(4) , H0(5)
8
corrected p-value is less than significance level
H0(i) denotes the null hypothesis for p-value P(i)
If at some point H0(j) can not be rejected, stop and don’t reject H0(j), H0(j+1), …, H0(M)
and is often better; still conservative
9
H0(1): 0.005, H0(2): 0.011, H0(3): 0.02, H0(4): 0.04, H0(5): 0.13
stop
Reject H0(1) and H0(2) , don’t reject H0(3) , H0(4) , H0(5)
10
We might be willing to accept A FEW false positives
significant results you found
11
Declared non-sign. Declared sign. Total True H0 U V M0 False H0 T S M1 Total M-R R M
corrected p-value is less than significance level
12
Bioconductor
13
Exploratory analysis; when generating hypothesis Report the number of tests you do (e.g.: “We investigated 40 features, but only report
Exploratory analysis; Screening: Select some features for further, more expensive investigation Balance between high power and low number of false positives
Confirmatory analysis; use if you really don’t want any false positives
14
Many hits / many False Pos. Few hits / few False Pos.
27 acute lymphoblastic leukemia (ALL) cases (code 0) 11 acute myeloid leukemia (AML) cases (code 1)
15
16
17
ulttest.html
18