sizes of non-null hypotheses Jennifer Brennan, Ramya Korlakai - - PowerPoint PPT Presentation

β–Ά
sizes of non null hypotheses
SMART_READER_LITE
LIVE PREVIEW

sizes of non-null hypotheses Jennifer Brennan, Ramya Korlakai - - PowerPoint PPT Presentation

Estimating the number and effect sizes of non-null hypotheses Jennifer Brennan, Ramya Korlakai Vinayak, Kevin Jamieson jrb@cs.washington.edu ICML 2020 Example: Fruit Fly Genetics Hao et al. (2008) measured the effect of 13,000 fruit fly genes


slide-1
SLIDE 1

Estimating the number and effect sizes of non-null hypotheses

Jennifer Brennan, Ramya Korlakai Vinayak, Kevin Jamieson jrb@cs.washington.edu ICML 2020

slide-2
SLIDE 2

Example: Fruit Fly Genetics

Hao et al. (2008) measured the effect of 13,000 fruit fly genes on susceptibility to influenza Measurements were distributed N(0,1) under the null, higher indicates protection from influenza

More protection from influenza

slide-3
SLIDE 3

Example: Fruit Fly Genetics

Hao et al. (2008) measured the effect of 13,000 fruit fly genes on susceptibility to influenza Measurements were distributed N(0,1) under the null, higher indicates protection from influenza

Multiple hypothesis testing identifies few discoveries

Significant Genes

slide-4
SLIDE 4

Example: Fruit Fly Genetics

Hao et al. (2008) measured the effect of 13,000 fruit fly genes on susceptibility to influenza Measurements were distributed N(0,1) under the null, higher indicates protection from influenza

Observed distribution does not match theoretical null

𝑂 0, 1

slide-5
SLIDE 5

Example: Fruit Fly Genetics

Hao et al. (2008) measured the effect of 13,000 fruit fly genes on susceptibility to influenza Measurements were distributed N(0,1) under the null, higher indicates protection from influenza

Observed distribution does not match theoretical null

𝑂 0, 1

Too many small, positive measurements for chance alone

slide-6
SLIDE 6

Example: Fruit Fly Genetics

Hao et al. (2008) measured the effect of 13,000 fruit fly genes on susceptibility to influenza Measurements were distributed N(0,1) under the null, higher indicates protection from influenza

Observed distribution does not match theoretical null

𝑂 0, 1

Too small to claim individual significance

slide-7
SLIDE 7

Example: Fruit Fly Genetics

Idea: These genes can be counted, even though they can’t be identified

slide-8
SLIDE 8

Example: Fruit Fly Genetics

Idea: These genes can be counted, even though they can’t be identified

>7% of genes have effect size >1/4 (at least 8% increase in influenza resistance)

Our Estimator

slide-9
SLIDE 9

Example: Fruit Fly Genetics

Idea: These genes can be counted, even though they can’t be identified

>2% of genes have effect size >1 (at least 28% increase in influenza resistance) >7% of genes have effect size >1/4 (at least 8% increase in influenza resistance)

Our Estimator

slide-10
SLIDE 10

Example: Fruit Fly Genetics

Idea: These genes can be counted, even though they can’t be identified Enables power analysis for future experimental designs

>2% of genes have effect size >1 (at least 28% increase in influenza resistance) >7% of genes have effect size >1/4 (at least 8% increase in influenza resistance)

Our Estimator

slide-11
SLIDE 11

Example: Fruit Fly Genetics

Idea: These genes can be counted, even though they can’t be identified

Next Experiment: Take precise measurements (e.g., use many replications) to identify these genes

Enables power analysis for future experimental designs

>2% of genes have effect size >1 (at least 28% increase in influenza resistance) >7% of genes have effect size >1/4 (at least 8% increase in influenza resistance)

Our Estimator

slide-12
SLIDE 12

Example: Fruit Fly Genetics

Idea: These genes can be counted, even though they can’t be identified

Next Experiment: Take precise measurements (e.g., use many replications) to identify these genes Next Experiment: Take less precise measurements, identify fewer genes

Enables power analysis for future experimental designs

>2% of genes have effect size >1 (at least 28% increase in influenza resistance) >7% of genes have effect size >1/4 (at least 8% increase in influenza resistance)

Our Estimator

slide-13
SLIDE 13

Formal problem statement

slide-14
SLIDE 14

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions

slide-15
SLIDE 15

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ—

slide-16
SLIDE 16

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size

slide-17
SLIDE 17

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size Observe π‘Œπ‘— ∼ 𝑔(πœˆπ‘—)

slide-18
SLIDE 18

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size Observe π‘Œπ‘— ∼ 𝑔(πœˆπ‘—) E.g. 𝑔 πœˆπ‘— = 𝑂(πœˆπ‘—, 1)

slide-19
SLIDE 19

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size Observe π‘Œπ‘— ∼ 𝑔(πœˆπ‘—) E.g. 𝑔 πœˆπ‘— = 𝑂(πœˆπ‘—, 1) Identification: Which πœˆπ‘— > 0? Counting: What is the probability 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 0)?

slide-20
SLIDE 20

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size Observe π‘Œπ‘— ∼ 𝑔(πœˆπ‘—) E.g. 𝑔 πœˆπ‘— = 𝑂(πœˆπ‘—, 1) Identification: Which πœˆπ‘— > 0? Counting: What is the probability 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿), for all 𝛿?

slide-21
SLIDE 21

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size Observe π‘Œπ‘— ∼ 𝑔(πœˆπ‘—) E.g. 𝑔 πœˆπ‘— = 𝑂(πœˆπ‘—, 1) Identification: Which πœˆπ‘— > 0? (Returns a set in [n]) Counting: What is the probability 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿), for all 𝛿?

(Returns a fraction)

slide-22
SLIDE 22

Formal problem statement

We view multiple hypothesis testing from the perspective of learning mixture distributions For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— πœˆπ‘— is the (unknown) effect size Observe π‘Œπ‘— ∼ 𝑔(πœˆπ‘—) E.g. 𝑔 πœˆπ‘— = 𝑂(πœˆπ‘—, 1) Goal Estimate πœ‚πœ‰βˆ— 𝛿 = 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿), for all 𝛿

Constraint Never overestimate the true fraction

slide-23
SLIDE 23

Related work

Estimating the number of non-nulls (𝜈 β‰  0)

Early techniques [Schweder and SpjΓΈtvoll, 1982; Genovese et al., 2004; Meinshausen et al., 2006] relied on uniformity of p-values under the null Techniques do not extend to arbitrary thresholds (β€œHow many genes improved influenza resistance by at least 20%?”)

Plug-in estimators

Estimate the entire density πœ‰, then compute 𝑄

πœ‰(𝜈 > 𝛿)

Does not respect our constraint, that we cannot overestimate

Connections to False Discovery Rate (FDR) control

Tighter FDR control can be obtained by knowing number of non-nulls Previous methods either do not satisfy our constraint [Storey, 2002; Li and Barber,

2019], or perform poorly in our regime of interest (many hypotheses, small

effect sizes) [Stephens, 2016; Katsevich and Ramdas, 2018]

slide-24
SLIDE 24

Related work

Estimating the number of non-nulls (𝜈 β‰  0)

Early techniques [Schweder and SpjΓΈtvoll, 1982; Genovese et al., 2004; Meinshausen et al., 2006] relied on uniformity of p-values under the null Techniques do not extend to arbitrary thresholds (β€œHow many genes improved influenza resistance by at least 20%?”)

Plug-in estimators

Estimate the entire density πœ‰, then compute 𝑄

πœ‰(𝜈 > 𝛿)

Does not respect our constraint, that we cannot overestimate

Connections to False Discovery Rate (FDR) control

Tighter FDR control can be obtained by knowing number of non-nulls Previous methods either do not satisfy our constraint [Storey, 2002; Li and Barber,

2019], or perform poorly in our regime of interest (many hypotheses, small

effect sizes) [Stephens, 2016; Katsevich and Ramdas, 2018]

slide-25
SLIDE 25

Related work

Estimating the number of non-nulls (𝜈 β‰  0)

Early techniques [Schweder and SpjΓΈtvoll, 1982; Genovese et al., 2004; Meinshausen et al., 2006] relied on uniformity of p-values under the null Techniques do not extend to arbitrary thresholds (β€œHow many genes improved influenza resistance by at least 20%?”)

Plug-in estimators

Estimate the entire density πœ‰, then compute 𝑄

πœ‰(𝜈 > 𝛿)

Does not respect our constraint, that we cannot overestimate

Connections to False Discovery Rate (FDR) control

Tighter FDR control can be obtained by knowing number of non-nulls Previous methods either do not satisfy our constraint [Storey, 2002; Li and Barber,

2019], or perform poorly in our regime of interest (many hypotheses, small

effect sizes) [Stephens, 2016; Katsevich and Ramdas, 2018]

slide-26
SLIDE 26

Our Estimator

slide-27
SLIDE 27

Our Estimator

Goal Estimate Constraint Never overestimate Step 1 Consider the empirical CDF (Cumulative Distribution Function)

slide-28
SLIDE 28

Our Estimator

Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF Goal Estimate Constraint Never overestimate

DKW Inequality

slide-29
SLIDE 29

Our Estimator

Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF Goal Estimate Constraint Never overestimate

With high probability, the true CDF lives within this interval DKW Inequality

slide-30
SLIDE 30

Our Estimator

Goal Estimate Constraint Never overestimate Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF

With high probability, the true CDF lives within this interval DKW Inequality

slide-31
SLIDE 31

Our Estimator

Goal Estimate Constraint Never overestimate Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF

With high probability, the true CDF lives within this interval This could be the true CDF N(0,1) could not be the true CDF

slide-32
SLIDE 32

Our Estimator

Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF Step 3 Return the smallest amount of mass that could feasibly have generated the empirical CDF Goal Estimate Constraint Never overestimate

With high probability, the true CDF lives within this interval This is the CDF in this interval with the least mass above Ξ³

slide-33
SLIDE 33

Our Estimator

Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF Step 3 Return the smallest amount of mass that could feasibly have generated the empirical CDF Goal Estimate Constraint Never overestimate

A convex program! (efficiently computable) Constraint is satisfied with high probability Our sample complexity results match known lower bounds

slide-34
SLIDE 34

Our Estimator

Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF Step 3 Return the smallest amount of mass that could feasibly have generated the empirical CDF Goal Estimate Constraint Never overestimate

A convex program! (efficiently computable) Constraint is satisfied with high probability Our sample complexity results match known lower bounds

slide-35
SLIDE 35

Our Estimator

Step 1 Consider the empirical CDF (Cumulative Distribution Function) Step 2 Generate confidence intervals on the true CDF Step 3 Return the smallest amount of mass that could feasibly have generated the empirical CDF Goal Estimate Constraint Never overestimate

Our sample complexity results match known lower bounds A convex program! (efficiently computable) Constraint is satisfied with high probability

slide-36
SLIDE 36

Theorem

Our estimator provides the following guarantees:

With probability 1 βˆ’ 𝛽, does not overestimate πœ‚πœ‰βˆ— 𝛿 for any 𝛿

For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— Observe π‘Œπ‘— ∼ 𝑔 πœˆπ‘— Estimate πœ‚πœ‰βˆ— 𝛿 = 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿)

slide-37
SLIDE 37

Theorem

Our estimator provides the following guarantees:

With probability 1 βˆ’ 𝛽, does not overestimate πœ‚πœ‰βˆ— 𝛿 for any 𝛿 With probability 1 βˆ’ πœ€, estimate is at most 𝜻 from the truth whenever

For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— Observe π‘Œπ‘— ∼ 𝑔 πœˆπ‘— Estimate πœ‚πœ‰βˆ— 𝛿 = 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿)

slide-38
SLIDE 38

Theorem

Our estimator provides the following guarantees:

With probability 1 βˆ’ 𝛽, does not overestimate πœ‚πœ‰βˆ— 𝛿 for any 𝛿 With probability 1 βˆ’ πœ€, estimate is at most 𝜻 from the truth whenever

𝐺

πœ‰βˆ—

CDFs 𝐺

πœ‰ corresponding to

all mixing distributions πœ‰ with less than πœ‚πœ‰βˆ— 𝛿 βˆ’ 𝜁 probability mass above 𝛿 For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— Observe π‘Œπ‘— ∼ 𝑔 πœˆπ‘— Estimate πœ‚πœ‰βˆ— 𝛿 = 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿)

slide-39
SLIDE 39

Theorem

Our estimator provides the following guarantees:

With probability 1 βˆ’ 𝛽, does not overestimate πœ‚πœ‰βˆ— 𝛿 for any 𝛿 With probability 1 βˆ’ πœ€, estimate is at most 𝜻 from the truth whenever

𝐺

πœ‰βˆ—

Minimum β„“βˆž distance CDFs 𝐺

πœ‰ corresponding to

all mixing distributions πœ‰ with less than πœ‚πœ‰βˆ— 𝛿 βˆ’ 𝜁 probability mass above 𝛿 For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— Observe π‘Œπ‘— ∼ 𝑔 πœˆπ‘— Estimate πœ‚πœ‰βˆ— 𝛿 = 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿)

slide-40
SLIDE 40

Theorem

Our estimator provides the following guarantees:

With probability 1 βˆ’ 𝛽, does not overestimate πœ‚πœ‰βˆ— 𝛿 for any 𝛿 With probability 1 βˆ’ πœ€, estimate is at most 𝜻 from the truth whenever

𝐺

πœ‰βˆ—

Minimum β„“βˆž distance CDFs 𝐺

πœ‰ corresponding to

all mixing distributions πœ‰ with less than πœ‚πœ‰βˆ— 𝛿 βˆ’ 𝜁 probability mass above 𝛿

Goal: lower bound this distance

For 𝑗 = 1, 2, … , π‘œ Draw πœˆπ‘— ∼ πœ‰βˆ— Observe π‘Œπ‘— ∼ 𝑔 πœˆπ‘— Estimate πœ‚πœ‰βˆ— 𝛿 = 𝑄

πœˆβˆΌπœ‰βˆ—(𝜈 > 𝛿)

slide-41
SLIDE 41

Counting at least half of the discoveries

Let π‘Œπ‘—~𝑂(πœˆπ‘—, 1) be drawn from a mixture of Gaussians, with πœ‚βˆ— alternate hypotheses of effect size π›Ώβˆ— < 1 With probability at least 1 βˆ’ πœ€, our estimator detects over half of the alternate hypotheses (i.e., መ πœ‚π‘œ 0 >

1 2 πœ‚βˆ—), whenever

Matches a novel lower bound

π›Ώβˆ— πœ‚βˆ— How much mass is (strictly) above 0?

slide-42
SLIDE 42

Experiments

slide-43
SLIDE 43

Comparisons to baselines

slide-44
SLIDE 44

Comparisons to baselines

What fraction of genes are non-null?

slide-45
SLIDE 45

Comparisons to baselines

What fraction of genes are non-null? What fraction of genes have effect at least Β½ the true alternate?

slide-46
SLIDE 46

Comparisons to baselines

What fraction of genes are non-null? What fraction of genes have effect at least Β½ the true alternate?

slide-47
SLIDE 47

Comparisons to baselines

What fraction of genes are non-null? What fraction of genes have effect at least Β½ the true alternate?

slide-48
SLIDE 48

Other applications of this estimator

Standardized Testing

Each school has some πœˆπ‘— indicating its students’ true performance We observe π‘Œπ‘—, a noisy measurement of πœˆπ‘— (e.g., students’ average exam score) Our estimator: β€œat least Y% of schools are below proficient in math” Interesting on its own, or to suggest further testing to identify these schools

slide-49
SLIDE 49

Other applications of this estimator

Standardized Testing

Each school has some πœˆπ‘— indicating its students’ true performance We observe π‘Œπ‘—, a noisy measurement of πœˆπ‘— (e.g., students’ average exam score) Our estimator: β€œat least Y% of schools are below proficient in math” Interesting on its own, or to suggest further testing to identify these schools

Public Health*

Each person has some πœˆπ‘— indicating their susceptibility to the flu (variable due to age, health, etc.) We observe π‘Œπ‘—, the number of flu seasons they were sick, in the past five years Our estimator: β€œat most Y% of people have a 25% chance or greater of getting sick in a given year” (Impossible to identify these people with confidence)

*Example due to Tian, Kong and Valiant (2017)