ABC Methods for Bayesian Model Choice Christian P. Robert Universit - - PowerPoint PPT Presentation

abc methods for bayesian model choice
SMART_READER_LITE
LIVE PREVIEW

ABC Methods for Bayesian Model Choice Christian P. Robert Universit - - PowerPoint PPT Presentation

ABC Methods for Bayesian Model Choice ABC Methods for Bayesian Model Choice Christian P. Robert Universit e Paris-Dauphine, IuF, & CREST http://www.ceremade.dauphine.fr/~xian Bayes-250, Edinburgh, September 6, 2011 ABC Methods for


slide-1
SLIDE 1

ABC Methods for Bayesian Model Choice

ABC Methods for Bayesian Model Choice

Christian P. Robert

Universit´ e Paris-Dauphine, IuF, & CREST http://www.ceremade.dauphine.fr/~xian

Bayes-250, Edinburgh, September 6, 2011

slide-2
SLIDE 2

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Approximate Bayesian computation

Approximate Bayesian computation ABC for model choice Gibbs random fields Generic ABC model choice

slide-3
SLIDE 3

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Regular Bayesian computation issues

When faced with a non-standard posterior distribution π(θ|y) ∝ π(θ)L(θ|y) the standard solution is to use simulation (Monte Carlo) to produce a sample θ1, . . . , θT from π(θ|y) (or approximately by Markov chain Monte Carlo methods) [Robert & Casella, 2004]

slide-4
SLIDE 4

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Untractable likelihoods

Cases when the likelihood function f(y|θ) is unavailable and when the completion step f(y|θ) =

  • Z

f(y, z|θ) dz is impossible or too costly because of the dimension of z c MCMC cannot be implemented!

slide-5
SLIDE 5

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Untractable likelihoods

c MCMC cannot be implemented!

slide-6
SLIDE 6

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

The ABC method

Bayesian setting: target is π(θ)f(x|θ)

slide-7
SLIDE 7

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

The ABC method

Bayesian setting: target is π(θ)f(x|θ) When likelihood f(x|θ) not in closed form, likelihood-free rejection technique:

slide-8
SLIDE 8

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

The ABC method

Bayesian setting: target is π(θ)f(x|θ) When likelihood f(x|θ) not in closed form, likelihood-free rejection technique:

ABC algorithm

For an observation y ∼ f(y|θ), under the prior π(θ), keep jointly simulating θ′ ∼ π(θ) , z ∼ f(z|θ′) , until the auxiliary variable z is equal to the observed value, z = y. [Tavar´ e et al., 1997]

slide-9
SLIDE 9

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

A as approximative

When y is a continuous random variable, equality z = y is replaced with a tolerance condition, ̺(y, z) ≤ ǫ where ̺ is a distance

slide-10
SLIDE 10

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

A as approximative

When y is a continuous random variable, equality z = y is replaced with a tolerance condition, ̺(y, z) ≤ ǫ where ̺ is a distance Output distributed from π(θ) Pθ{̺(y, z) < ǫ} ∝ π(θ|̺(y, z) < ǫ)

slide-11
SLIDE 11

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

ABC algorithm

Algorithm 1 Likelihood-free rejection sampler for i = 1 to N do repeat generate θ′ from the prior distribution π(·) generate z from the likelihood f(·|θ′) until ρ{η(z), η(y)} ≤ ǫ set θi = θ′ end for where η(y) defines a (maybe in-sufficient) statistic

slide-12
SLIDE 12

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Output

The likelihood-free algorithm samples from the marginal in z of: πǫ(θ, z|y) = π(θ)f(z|θ)IAǫ,y(z)

  • Aǫ,y×Θ π(θ)f(z|θ)dzdθ ,

where Aǫ,y = {z ∈ D|ρ(η(z), η(y)) < ǫ}.

slide-13
SLIDE 13

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Output

The likelihood-free algorithm samples from the marginal in z of: πǫ(θ, z|y) = π(θ)f(z|θ)IAǫ,y(z)

  • Aǫ,y×Θ π(θ)f(z|θ)dzdθ ,

where Aǫ,y = {z ∈ D|ρ(η(z), η(y)) < ǫ}. The idea behind ABC is that the summary statistics coupled with a small tolerance should provide a good approximation of the posterior distribution: πǫ(θ|y) =

  • πǫ(θ, z|y)dz ≈ π(θ|y) .
slide-14
SLIDE 14

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

MA example

Consider the MA(q) model xt = ǫt +

q

  • i=1

ϑiǫt−i Simple prior: uniform prior over the identifiability zone, e.g. triangle for MA(2)

slide-15
SLIDE 15

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

MA example (2)

ABC algorithm thus made of

  • 1. picking a new value (ϑ1, ϑ2) in the triangle
  • 2. generating an iid sequence (ǫt)−q<t≤T
  • 3. producing a simulated series (x′

t)1≤t≤T

slide-16
SLIDE 16

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

MA example (2)

ABC algorithm thus made of

  • 1. picking a new value (ϑ1, ϑ2) in the triangle
  • 2. generating an iid sequence (ǫt)−q<t≤T
  • 3. producing a simulated series (x′

t)1≤t≤T

Distance: basic distance between the series ρ((x′

t)1≤t≤T , (xt)1≤t≤T ) = T

  • t=1

(xt − x′

t)2

  • r between summary statistics like the first q autocorrelations

τj =

T

  • t=j+1

xtxt−j

slide-17
SLIDE 17

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Comparison of distance impact

Evaluation of the tolerance on the ABC sample against both distances (ǫ = 100%, 10%, 1%, 0.1%) for an MA(2) model

slide-18
SLIDE 18

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Comparison of distance impact

0.0 0.2 0.4 0.6 0.8 1 2 3 4 θ1 −2.0 −1.0 0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 θ2

Evaluation of the tolerance on the ABC sample against both distances (ǫ = 100%, 10%, 1%, 0.1%) for an MA(2) model

slide-19
SLIDE 19

ABC Methods for Bayesian Model Choice Approximate Bayesian computation

Comparison of distance impact

0.0 0.2 0.4 0.6 0.8 1 2 3 4 θ1 −2.0 −1.0 0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 θ2

Evaluation of the tolerance on the ABC sample against both distances (ǫ = 100%, 10%, 1%, 0.1%) for an MA(2) model

slide-20
SLIDE 20

ABC Methods for Bayesian Model Choice ABC for model choice

ABC for model choice

Approximate Bayesian computation ABC for model choice Gibbs random fields Generic ABC model choice

slide-21
SLIDE 21

ABC Methods for Bayesian Model Choice ABC for model choice

Bayesian model choice

Principle Several models M1, M2, . . . are considered simultaneously for dataset y and model index M central to inference. Use of a prior π(M = m), plus a prior distribution on the parameter conditional on the value m of the model index, πm(θm) Goal is to derive the posterior distribution of M, a challenging computational target when models are complex.

slide-22
SLIDE 22

ABC Methods for Bayesian Model Choice ABC for model choice

Generic ABC for model choice

Algorithm 2 Likelihood-free model choice sampler (ABC-MC) for t = 1 to T do repeat Generate m from the prior π(M = m) Generate θm from the prior πm(θm) Generate z from the model fm(z|θm) until ρ{η(z), η(y)} < ǫ Set m(t) = m and θ(t) = θm end for [Toni, Welch, Strelkowa, Ipsen & Stumpf, 2009]

slide-23
SLIDE 23

ABC Methods for Bayesian Model Choice ABC for model choice

ABC estimates

Posterior probability π(M = m|y) approximated by the frequency

  • f acceptances from model m

1 T

T

  • t=1

Im(t)=m . Early issues with implementation:

◮ should tolerances ǫ be the same for all models? ◮ should summary statistics vary across models? incl. their

dimension?

◮ should the distance measure ρ vary across models?

slide-24
SLIDE 24

ABC Methods for Bayesian Model Choice ABC for model choice

ABC estimates

Posterior probability π(M = m|y) approximated by the frequency

  • f acceptances from model m

1 T

T

  • t=1

Im(t)=m . Early issues with implementation:

◮ ǫ then needs to become part of the model ◮ Varying statistics incompatible with Bayesian model choice

proper

◮ ρ then part of the model

Extension to a weighted polychotomous logistic regression estimate

  • f π(M = m|y), with non-parametric kernel weights

[Cornuet et al., DIYABC, 2009]

slide-25
SLIDE 25

ABC Methods for Bayesian Model Choice ABC for model choice

The great ABC controversy

On-going controvery in phylogeographic genetics about the validity

  • f using ABC for testing

Against: Templeton, 2008, 2009, 2010a, 2010b, 2010c, &tc argues that nested hypotheses cannot have higher probabilities than nesting hypotheses (!)

slide-26
SLIDE 26

ABC Methods for Bayesian Model Choice ABC for model choice

The great ABC controversy

On-going controvery in phylogeographic genetics about the validity

  • f using ABC for testing

Against: Templeton, 2008, 2009, 2010a, 2010b, 2010c, &tc argues that nested hypotheses cannot have higher probabilities than nesting hypotheses (!) Replies: Fagundes et al., 2008, Beaumont et al., 2010, Berger et al., 2010, Csill` ery et al., 2010 point out that the criticisms are addressed at [Bayesian] model-based inference and have nothing to do with ABC...

slide-27
SLIDE 27

ABC Methods for Bayesian Model Choice Gibbs random fields

Potts model

Potts model

  • c∈C Vc(y) is of the form
  • c∈C

Vc(y) = θS(y) = θ

  • l∼i

δyl=yi where l∼i denotes a neighbourhood structure

slide-28
SLIDE 28

ABC Methods for Bayesian Model Choice Gibbs random fields

Potts model

Potts model

  • c∈C Vc(y) is of the form
  • c∈C

Vc(y) = θS(y) = θ

  • l∼i

δyl=yi where l∼i denotes a neighbourhood structure In most realistic settings, summation Zθ =

  • x∈X

exp{θTS(x)} involves too many terms to be manageable and numerical approximations cannot always be trusted [Cucala et al., 2009]

slide-29
SLIDE 29

ABC Methods for Bayesian Model Choice Gibbs random fields

Neighbourhood relations

Setup Choice to be made between M neighbourhood relations i m ∼ i′ (0 ≤ m ≤ M − 1) with Sm(x) =

  • im

∼i′

I{xi=xi′} driven by the posterior probabilities of the models.

slide-30
SLIDE 30

ABC Methods for Bayesian Model Choice Gibbs random fields

Model index

Computational target: P(M = m|x) ∝

  • Θm

fm(x|θm)πm(θm) dθm π(M = m)

slide-31
SLIDE 31

ABC Methods for Bayesian Model Choice Gibbs random fields

Model index

Computational target: P(M = m|x) ∝

  • Θm

fm(x|θm)πm(θm) dθm π(M = m) If S(x) sufficient statistic for the joint parameters (M, θ0, . . . , θM−1), P(M = m|x) = P(M = m|S(x)) .

slide-32
SLIDE 32

ABC Methods for Bayesian Model Choice Gibbs random fields

Sufficient statistics in Gibbs random fields

slide-33
SLIDE 33

ABC Methods for Bayesian Model Choice Gibbs random fields

Sufficient statistics in Gibbs random fields

Each model m has its own sufficient statistic Sm(·) and S(·) = (S0(·), . . . , SM−1(·)) is also (model-)sufficient.

slide-34
SLIDE 34

ABC Methods for Bayesian Model Choice Gibbs random fields

Sufficient statistics in Gibbs random fields

Each model m has its own sufficient statistic Sm(·) and S(·) = (S0(·), . . . , SM−1(·)) is also (model-)sufficient. Explanation: For Gibbs random fields, x|M = m ∼ fm(x|θm) = f1

m(x|S(x))f2 m(S(x)|θm)

= 1 n(S(x))f2

m(S(x)|θm)

where n(S(x)) = ♯ {˜ x ∈ X : S(˜ x) = S(x)} c S(x) is therefore also sufficient for the joint parameters

slide-35
SLIDE 35

ABC Methods for Bayesian Model Choice Gibbs random fields

Toy example

iid Bernoulli model versus two-state first-order Markov chain, i.e. f0(x|θ0) = exp

  • θ0

n

  • i=1

I{xi=1}

  • {1 + exp(θ0)}n ,

versus f1(x|θ1) = 1 2 exp

  • θ1

n

  • i=2

I{xi=xi−1}

  • {1 + exp(θ1)}n−1 ,

with priors θ0 ∼ U(−5, 5) and θ1 ∼ U(0, 6) (inspired by “phase transition” boundaries).

slide-36
SLIDE 36

ABC Methods for Bayesian Model Choice Gibbs random fields

Toy example (2)

−40 −20 10 −5 5 BF01 BF ^

01

−40 −20 10 −10 −5 5 10 BF01 BF ^

01

(left) Comparison of the true BF m0/m1(x0) with BF m0/m1(x0) (in logs) over 2, 000 simulations and 4.106 proposals from the

  • prior. (right) Same when using tolerance ǫ corresponding to the

1% quantile on the distances.

slide-37
SLIDE 37

ABC Methods for Bayesian Model Choice Generic ABC model choice

Back to sufficiency

‘Sufficient statistics for individual models are unlikely to be very informative for the model probability. This is already well known and understood by the ABC-user community.’ [Scott Sisson, Jan. 31, 2011, ’Og]

slide-38
SLIDE 38

ABC Methods for Bayesian Model Choice Generic ABC model choice

Back to sufficiency

‘Sufficient statistics for individual models are unlikely to be very informative for the model probability. This is already well known and understood by the ABC-user community.’ [Scott Sisson, Jan. 31, 2011, ’Og] If η1(x) sufficient statistic for model m = 1 and parameter θ1 and η2(x) sufficient statistic for model m = 2 and parameter θ2, (η1(x), η2(x)) is not always sufficient for (m, θm)

slide-39
SLIDE 39

ABC Methods for Bayesian Model Choice Generic ABC model choice

Back to sufficiency

‘Sufficient statistics for individual models are unlikely to be very informative for the model probability. This is already well known and understood by the ABC-user community.’ [Scott Sisson, Jan. 31, 2011, ’Og] If η1(x) sufficient statistic for model m = 1 and parameter θ1 and η2(x) sufficient statistic for model m = 2 and parameter θ2, (η1(x), η2(x)) is not always sufficient for (m, θm) c Potential loss of information at the testing level

slide-40
SLIDE 40

ABC Methods for Bayesian Model Choice Generic ABC model choice

Limiting behaviour of B12 (T → ∞)

ABC approximation

  • B12(y) =

T

t=1 Imt=1 Iρ{η(zt),η(y)}≤ǫ

T

t=1 Imt=2 Iρ{η(zt),η(y)}≤ǫ

, where the (mt, zt)’s are simulated from the (joint) prior

slide-41
SLIDE 41

ABC Methods for Bayesian Model Choice Generic ABC model choice

Limiting behaviour of B12 (T → ∞)

ABC approximation

  • B12(y) =

T

t=1 Imt=1 Iρ{η(zt),η(y)}≤ǫ

T

t=1 Imt=2 Iρ{η(zt),η(y)}≤ǫ

, where the (mt, zt)’s are simulated from the (joint) prior As T go to infinity, limit Bǫ

12(y)

=

  • Iρ{η(z),η(y)}≤ǫπ1(θ1)f1(z|θ1) dz dθ1
  • Iρ{η(z),η(y)}≤ǫπ2(θ2)f2(z|θ2) dz dθ2

=

  • Iρ{η,η(y)}≤ǫπ1(θ1)fη

1 (η|θ1) dη dθ1

  • Iρ{η,η(y)}≤ǫπ2(θ2)fη

2 (η|θ2) dη dθ2

, where fη

1 (η|θ1) and fη 2 (η|θ2) distributions of η(z)

slide-42
SLIDE 42

ABC Methods for Bayesian Model Choice Generic ABC model choice

Limiting behaviour of B12 (ǫ → 0)

When ǫ goes to zero, Bη

12(y) =

  • π1(θ1)fη

1 (η(y)|θ1) dθ1

  • π2(θ2)fη

2 (η(y)|θ2) dθ2

slide-43
SLIDE 43

ABC Methods for Bayesian Model Choice Generic ABC model choice

Limiting behaviour of B12 (ǫ → 0)

When ǫ goes to zero, Bη

12(y) =

  • π1(θ1)fη

1 (η(y)|θ1) dθ1

  • π2(θ2)fη

2 (η(y)|θ2) dθ2

Bayes factor based on the sole observation of η(y)

slide-44
SLIDE 44

ABC Methods for Bayesian Model Choice Generic ABC model choice

Limiting behaviour of B12 (under sufficiency)

If η(y) sufficient statistic in both models, fi(y|θi) = gi(y)fη

i (η(y)|θi)

Thus B12(y) =

  • Θ1 π(θ1)g1(y)fη

1 (η(y)|θ1) dθ1

  • Θ2 π(θ2)g2(y)fη

2 (η(y)|θ2) dθ2

= g1(y)

  • π1(θ1)fη

1 (η(y)|θ1) dθ1

g2(y)

  • π2(θ2)fη

2 (η(y)|θ2) dθ2

= g1(y) g2(y) Bη

12(y) .

[Didelot, Everitt, Johansen & Lawson, 2011]

slide-45
SLIDE 45

ABC Methods for Bayesian Model Choice Generic ABC model choice

Limiting behaviour of B12 (under sufficiency)

If η(y) sufficient statistic in both models, fi(y|θi) = gi(y)fη

i (η(y)|θi)

Thus B12(y) =

  • Θ1 π(θ1)g1(y)fη

1 (η(y)|θ1) dθ1

  • Θ2 π(θ2)g2(y)fη

2 (η(y)|θ2) dθ2

= g1(y)

  • π1(θ1)fη

1 (η(y)|θ1) dθ1

g2(y)

  • π2(θ2)fη

2 (η(y)|θ2) dθ2

= g1(y) g2(y) Bη

12(y) .

[Didelot, Everitt, Johansen & Lawson, 2011] c No discrepancy only when cross-model sufficiency

slide-46
SLIDE 46

ABC Methods for Bayesian Model Choice Generic ABC model choice

Poisson/geometric example

Sample x = (x1, . . . , xn) from either a Poisson P(λ) or from a geometric G(p) Sum S =

n

  • i=1

xi = η(x) sufficient statistic for either model but not simultaneously Discrepancy ratio g1(x) g2(x) = S!n−S/

i xi!

1 n+S−1

S

slide-47
SLIDE 47

ABC Methods for Bayesian Model Choice Generic ABC model choice

Poisson/geometric discrepancy

Range of B12(x) versus Bη

12(x): The values produced have

nothing in common.

slide-48
SLIDE 48

ABC Methods for Bayesian Model Choice Generic ABC model choice

Formal recovery

Creating an encompassing exponential family f(x|θ1, θ2, α1, α2) ∝ exp{θT

1 η1(x) + θT 1 η1(x) + α1t1(x) + α2t2(x)}

leads to a sufficient statistic (η1(x), η2(x), t1(x), t2(x)) [Didelot, Everitt, Johansen & Lawson, 2011]

slide-49
SLIDE 49

ABC Methods for Bayesian Model Choice Generic ABC model choice

Formal recovery

Creating an encompassing exponential family f(x|θ1, θ2, α1, α2) ∝ exp{θT

1 η1(x) + θT 1 η1(x) + α1t1(x) + α2t2(x)}

leads to a sufficient statistic (η1(x), η2(x), t1(x), t2(x)) [Didelot, Everitt, Johansen & Lawson, 2011] In the Poisson/geometric case, if

i xi! is added to S, no

discrepancy

slide-50
SLIDE 50

ABC Methods for Bayesian Model Choice Generic ABC model choice

Formal recovery

Creating an encompassing exponential family f(x|θ1, θ2, α1, α2) ∝ exp{θT

1 η1(x) + θT 1 η1(x) + α1t1(x) + α2t2(x)}

leads to a sufficient statistic (η1(x), η2(x), t1(x), t2(x)) [Didelot, Everitt, Johansen & Lawson, 2011] Only applies in genuine sufficiency settings... c Inability to evaluate loss brought by summary statistics

slide-51
SLIDE 51

ABC Methods for Bayesian Model Choice Generic ABC model choice

The Pitman–Koopman lemma

Efficient sufficiency is not such a common occurrence:

Lemma

A necessary and sufficient condition for the existence of a sufficient statistic with fixed dimension whatever the sample size is that the sampling distribution belongs to an exponential family. [Pitman, 1933; Koopman, 1933]

slide-52
SLIDE 52

ABC Methods for Bayesian Model Choice Generic ABC model choice

The Pitman–Koopman lemma

Efficient sufficiency is not such a common occurrence:

Lemma

A necessary and sufficient condition for the existence of a sufficient statistic with fixed dimension whatever the sample size is that the sampling distribution belongs to an exponential family. [Pitman, 1933; Koopman, 1933] Provision of fixed support (consider U(0, θ) counterexample)

slide-53
SLIDE 53

ABC Methods for Bayesian Model Choice Generic ABC model choice

Meaning of the ABC-Bayes factor

‘This is also why focus on model discrimination typically (...) proceeds by (...) accepting that the Bayes Factor that one obtains is only derived from the summary statistics and may in no way correspond to that of the full model.’ [Scott Sisson, Jan. 31, 2011, ’Og]

slide-54
SLIDE 54

ABC Methods for Bayesian Model Choice Generic ABC model choice

Meaning of the ABC-Bayes factor

‘This is also why focus on model discrimination typically (...) proceeds by (...) accepting that the Bayes Factor that one obtains is only derived from the summary statistics and may in no way correspond to that of the full model.’ [Scott Sisson, Jan. 31, 2011, ’Og] In the Poisson/geometric case, if E[yi] = θ0 > 0, lim

n→∞ Bη 12(y) = (θ0 + 1)2

θ0 e−θ0

slide-55
SLIDE 55

ABC Methods for Bayesian Model Choice Generic ABC model choice

MA example

1 2 0.0 0.1 0.2 0.3 0.4 0.5 0.6 1 2 0.0 0.1 0.2 0.3 0.4 0.5 0.6 1 2 0.0 0.1 0.2 0.3 0.4 0.5 0.6 1 2 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7

Evolution [against ǫ] of ABC Bayes factor, in terms of frequencies of visits to models MA(1) (left) and MA(2) (right) when ǫ equal to 10, 1, .1, .01% quantiles on insufficient autocovariance distances. Sample

  • f 50 points from a MA(2) with θ1 = 0.6, θ2 = 0.2. True Bayes factor

equal to 17.71.

slide-56
SLIDE 56

ABC Methods for Bayesian Model Choice Generic ABC model choice

MA example

1 2 0.0 0.2 0.4 0.6 1 2 0.0 0.2 0.4 0.6 1 2 0.0 0.2 0.4 0.6 1 2 0.0 0.2 0.4 0.6 0.8

Evolution [against ǫ] of ABC Bayes factor, in terms of frequencies of visits to models MA(1) (left) and MA(2) (right) when ǫ equal to 10, 1, .1, .01% quantiles on insufficient autocovariance distances. Sample

  • f 50 points from a MA(1) model with θ1 = 0.6. True Bayes factor B21

equal to .004.

slide-57
SLIDE 57

ABC Methods for Bayesian Model Choice Generic ABC model choice

Further comments

‘There should be the possibility that for the same model, but different (non-minimal) [summary] statistics (so different η’s: η1 and η∗

1) the ratio of evidences may no

longer be equal to one.’ [Michael Stumpf, Jan. 28, 2011, ’Og] Using different summary statistics [on different models] may indicate the loss of information brought by each set but agreement does not lead to trustworthy approximations.

slide-58
SLIDE 58

ABC Methods for Bayesian Model Choice Generic ABC model choice

A population genetics evaluation

Population genetics example with

◮ 3 populations ◮ 2 scenari ◮ 15 individuals ◮ 5 loci ◮ single mutation parameter

slide-59
SLIDE 59

ABC Methods for Bayesian Model Choice Generic ABC model choice

A population genetics evaluation

Population genetics example with

◮ 3 populations ◮ 2 scenari ◮ 15 individuals ◮ 5 loci ◮ single mutation parameter ◮ 24 summary statistics ◮ 2 million ABC proposal ◮ importance [tree] sampling alternative

slide-60
SLIDE 60

ABC Methods for Bayesian Model Choice Generic ABC model choice

Stability of importance sampling

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

  • 0.0

0.2 0.4 0.6 0.8 1.0

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

slide-61
SLIDE 61

ABC Methods for Bayesian Model Choice Generic ABC model choice

Comparison with ABC

Use of 24 summary statistics and DIY-ABC logistic correction

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 importance sampling ABC direct and logistic

  • ● ●
slide-62
SLIDE 62

ABC Methods for Bayesian Model Choice Generic ABC model choice

Comparison with ABC

Use of 15 summary statistics and DIY-ABC logistic correction

  • −4

−2 2 4 6 −4 −2 2 4 6 importance sampling ABC direct

slide-63
SLIDE 63

ABC Methods for Bayesian Model Choice Generic ABC model choice

Comparison with ABC

Use of 15 summary statistics and DIY-ABC logistic correction

  • −4

−2 2 4 6 −4 −2 2 4 6 importance sampling ABC direct and logistic

slide-64
SLIDE 64

ABC Methods for Bayesian Model Choice Generic ABC model choice

Comparison with ABC

Use of 15 summary statistics and DIY-ABC logistic correction

  • 20

40 60 80 100 −2 −1 1 2 index log−ratio

slide-65
SLIDE 65

ABC Methods for Bayesian Model Choice Generic ABC model choice

A second population genetics experiment

◮ three populations, two divergent 100 gen. ago ◮ two scenarios [third pop. recent admixture between first two

  • pop. / diverging from pop. 1 5 gen. ago]

◮ In scenario 1, admixture rate 0.7 from pop. 1 ◮ 100 datasets with 100 diploid individuals per population, 50

independent microsatellite loci.

◮ Effective population size of 1000 and mutation rates of

0.0005.

◮ 6 parameters: admixture rate (U[0.1, 0.9]), three effective

population sizes (U[200, 2000]), the time of admixture/second divergence (U[1, 10]) and time of first divergence (U[50, 500]).

slide-66
SLIDE 66

ABC Methods for Bayesian Model Choice Generic ABC model choice

Results

IS algorithm performed with 100 coalescent trees per particle and 50,000 particles [12 calendar days using 376 processors] Using ten times as many loci and seven times as many individuals degrades the confidence in the importance sampling approximation because of an increased variability in the likelihood.

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Importance Sampling estimates of P(M=1|y) with 50,000 particles Importance Sampling estimates of P(M=1|y) with 1,000 particles

slide-67
SLIDE 67

ABC Methods for Bayesian Model Choice Generic ABC model choice

Results

Blurs potential divergence between ABC and genuine posterior probabilities because both are overwhelmingly close to one, due to the high information content of the data.

  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 ABC estimates of P(M=1|y) Importance Sampling estimates of P(M=1|y) with 50,000 particles

slide-68
SLIDE 68

ABC Methods for Bayesian Model Choice Generic ABC model choice

The only safe cases???

Besides specific models like Gibbs random fields, using distances over the data itself escapes the discrepancy... [Toni & Stumpf, 2010;Sousa et al., 2009]

slide-69
SLIDE 69

ABC Methods for Bayesian Model Choice Generic ABC model choice

The only safe cases???

Besides specific models like Gibbs random fields, using distances over the data itself escapes the discrepancy... [Toni & Stumpf, 2010;Sousa et al., 2009] ...and so does the use of more informal model fitting measures [Ratmann, Andrieu, Richardson and Wiujf, 2009]