Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits - - PowerPoint PPT Presentation

adaptive monte carlo multiple testing via multi armed
SMART_READER_LITE
LIVE PREVIEW

Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits - - PowerPoint PPT Presentation

Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits Martin Zhang joint work with: David Tse & James Zou Stanford University Problem | Monte Carlo Multiple Hypothesis Testing SNP 1 SNP 2 SNP m Problem | Monte Carlo Multiple


slide-1
SLIDE 1

Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits

Martin Zhang

joint work with:

David Tse & James Zou Stanford University

slide-2
SLIDE 2

Problem | Monte Carlo Multiple Hypothesis Testing

SNP1

SNP2 SNPm

slide-3
SLIDE 3

Problem | Monte Carlo Multiple Hypothesis Testing

P1 P2 Pm

SNP1

SNP2 SNPm

slide-4
SLIDE 4

Problem | Monte Carlo Multiple Hypothesis Testing

P1 P2 Pm

SNP1

SNP2 SNPm

× ×

slide-5
SLIDE 5

Problem | Monte Carlo Multiple Hypothesis Testing

Monte Carlo test

P1 P2 Pm

SNP1

SNP2 SNPm

× ×

P1 ∼ 1 n

n

j=1

𝕁{Tnull

1,j ≥ tobs 1 }

slide-6
SLIDE 6

Problem | Monte Carlo Multiple Hypothesis Testing

Monte Carlo test Benjamini Hochberg procedure

P1 P2 Pm

SNP1

SNP2 SNPm

× ×

P1 ∼ 1 n

n

j=1

𝕁{Tnull

1,j ≥ tobs 1 }

Data-dependent # of discoveries Control FDR = 𝔽 [

false discovery discovery ]

slide-7
SLIDE 7

Monte Carlo test Benjamini Hochberg procedure

P1 P2 Pm

SNP1

SNP2 SNPm

× ×

P1 ∼ 1 n

n

j=1

𝕁{Tnull

1,j ≥ tobs 1 }

Data-dependent # of discoveries Control FDR = 𝔽 [

false discovery discovery ]

Problem | Monte Carlo Multiple Hypothesis Testing

Computational cost: nm

slide-8
SLIDE 8

Monte Carlo test Benjamini Hochberg procedure

P1 P2 Pm

SNP1

SNP2 SNPm

× ×

P1 ∼ 1 n

n

j=1

𝕁{Tnull

1,j ≥ tobs 1 }

Data-dependent # of discoveries Control FDR = 𝔽 [

false discovery discovery ]

Problem | Monte Carlo Multiple Hypothesis Testing

Computational cost: nm

hypothesis tests

m

MC samples per test

n ×

slide-9
SLIDE 9

Genome-wide association studies

m = 500,000 n = 50,000,000

MC samples per test hypothesis tests

m n

Problem | Monte Carlo Multiple Hypothesis Testing

slide-10
SLIDE 10

Genome-wide association studies

m = 500,000 n = 50,000,000

T

  • tal MC samples:

nm = 2.5 × 1013

T ypical computation time: ~2 months MC samples per test hypothesis tests

m n

Problem | Monte Carlo Multiple Hypothesis Testing

slide-11
SLIDE 11

Genome-wide association studies

m = 500,000 n = 50,000,000

T

  • tal MC samples:

nm = 2.5 × 1013

T ypical computation time: ~2 months

Can we make it faster?

MC samples per test hypothesis tests

m n

Problem | Monte Carlo Multiple Hypothesis Testing

slide-12
SLIDE 12

Results | Adaptive Monte Carlo Multiple Testing (AMT)

same discoveries with high probability; information theoretically optimal

Theorem (informal):

baseline: nm

Expected # of MC samples:

nm

slide-13
SLIDE 13

Results | Adaptive Monte Carlo Multiple Testing (AMT)

same discoveries with high probability; information theoretically optimal

Theorem (informal):

baseline: nm

Expected # of MC samples:

nm 2 months 1 hour with the same discoveries

GWAS example:

slide-14
SLIDE 14

rank k p-value

1 2 3 4 5 6 7 8 1

Quantities to estimate:

Results | Adaptive Monte Carlo Multiple Testing (AMT)

slide-15
SLIDE 15

rank k p-value

1 2 3 4 5 6 7 8 1

τ*

BH threshold τ*

Quantities to estimate:

Results | Adaptive Monte Carlo Multiple Testing (AMT)

slide-16
SLIDE 16

How each p-value compares with τ*

rank k p-value

1 2 3 4 5 6 7 8 1

τ*

BH threshold τ*

Quantities to estimate:

Results | Adaptive Monte Carlo Multiple Testing (AMT)

slide-17
SLIDE 17

How each p-value compares with τ*

rank k p-value

1 2 3 4 5 6 7 8 1

τ*

BH threshold τ*

Quantities to estimate:

More MC samples

Results | Adaptive Monte Carlo Multiple Testing (AMT)

slide-18
SLIDE 18

How each p-value compares with τ*

rank k p-value

1 2 3 4 5 6 7 8 1

τ*

BH threshold τ*

Quantities to estimate:

More MC samples Less MC samples

Results | Adaptive Monte Carlo Multiple Testing (AMT)

slide-19
SLIDE 19

How each p-value compares with τ*

rank k p-value

1 2 3 4 5 6 7 8 1

τ*

BH threshold τ*

Quantities to estimate:

More MC samples

Adaptive Estimation via Multi-Armed Bandits

Less MC samples

Results | Adaptive Monte Carlo Multiple Testing (AMT)