Approximate Bayesian Computation Chris Drovandi, Charisse Farr - - PowerPoint PPT Presentation

approximate bayesian computation
SMART_READER_LITE
LIVE PREVIEW

Approximate Bayesian Computation Chris Drovandi, Charisse Farr - - PowerPoint PPT Presentation

ABC Examples Approximate Bayesian Computation Chris Drovandi, Charisse Farr October 24, 2012 Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation ABC Examples Approximate Bayesian Computation (ABC) Bayesian statistics


slide-1
SLIDE 1

ABC Examples

Approximate Bayesian Computation

Chris Drovandi, Charisse Farr

October 24, 2012

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-2
SLIDE 2

ABC Examples

Approximate Bayesian Computation (ABC)

Bayesian statistics involves inference based on the posterior distribution π(θ|y) ∝ f (y|θ)π(θ). What happens when likelihood f (y|θ) unavailable?

can’t use analytic Bayesian inference, Bayesian sampling schemes or Variational Bayes techniques these techniques all require knowledge of, or evaluation of, the likelihood function

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-3
SLIDE 3

ABC Examples

Approximate Bayesian Computation (ABC) techniques provide a means for drawing samples from a (approximate) posterior distribution without evaluating the likelihood function.

computer simulation models simulate the measurements that could be received for a given set of parameters

Potential samples from the posterior distribution are proposed. However, when accepting, rejecting and weighting these samples, rather than calculating the likelihood function, the algorithms compare simulated data with the actual measured data. ABC uses model simulations and compares simulated with

  • bserved

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-4
SLIDE 4

ABC Examples

ABC Samplers

three of the most common forms of ABC samplers are

ABC Rejection Sampling ABC MCMC Sampling ABC Population Monte Carlo Sampling

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-5
SLIDE 5

ABC Examples

ABC Rejection Sampling Algorithm I

Sample θ ∼ π(·)

sample a point θ from the prior π(·)

Simulate x ∼ f (·|θ)

simulate x from the measurement model using the sampled parameters x

Accept θ if ρ(y, x) ≤ ǫ

accept θ if the distance between the actual measurement and the simulated measurement ρ(y, x) is less than or equal to ǫ, the ABC distance tolerance

Repeat the above until we have N draws, θ1, . . . , θN

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-6
SLIDE 6

ABC Examples

The Choice of ǫ

The quality of the approximation increases as ǫ > 0 decreases

using a very small value of ǫ with continuous data is likely lead to a large number of samples being rejected, making the procedure computationally expensive

The choice of ǫ is a trade-off between accuracy of the approximation, and the computational effort.

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-7
SLIDE 7

ABC Examples

Summary Statistic

To reduce the computational expense of the procedure, we use instead statistics to summarise the data, rather than the full data. So S(·) = S1(·), . . . , Sp(·) (low dimensional) ρ(y, x) = S(y) − S(x) The effect of the approximation

Errors from insufficient summaries and ǫ > 0

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-8
SLIDE 8

ABC Examples

ABC Algorithms

Advantages

Simplicity Independent draws Easy to implement

Disadvantages

Acceptance rate too low High rejection rate if the prior and posterior distributions are not similar

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-9
SLIDE 9

ABC Examples

Poisson Distribution Example

Poisson Distribution

True data: λ = 2, n = 1000

Investigate three sets of summary statistics

sample mean (minimial sufficient) sample variance (insufficient, low dimensional)

  • rder statistics (sufficient, high dimensional)

Prior Simulation

T=104 Gamma prior ǫ selected by keeping best 100 simulations

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-10
SLIDE 10

ABC Examples

Gamma Renewal Process Example

N independent stochastic process with Gamma inter-arrival times First observation Y1 = min(X1, ..., XN) where Xi are iid from Gamma(α,β). Take the process that has Y1. Simulate its next time. Y2 is the new minimum out of the N values. Repeat... Intractable Likelihood

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-11
SLIDE 11

ABC Examples

Gamma Renewal Process Example (continued...)

Data: 100 observations with α = 0.3, β = 0.5 and N = 5 Priors: α ∼ G(1.3, 1), β ∼ G(1.5, 1) and N discrete uniform N ∈ {1, 2, . . . , 20} Compare 2 sets of summaries

All data (sufficient but high-dimensional) Sample mean and variance of sqrt of diff of data (insufficient but low dimensional) (why?)

Algorithm Settings:

105 prior simulations Keep the best 1000

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-12
SLIDE 12

ABC Examples

Gamma Renewal Process Example (Results)

−0.5 0.5 1.5 2.5 0.0 0.2 0.4 0.6

density.default(x = alpha[distabc3 < eps3])

N = 1000 Bandwidth = 0.1177 Density 1 2 3 4 0.0 0.1 0.2 0.3 0.4 0.5 0.6

density.default(x = beta[distabc3 < eps3])

N = 1000 Bandwidth = 0.1844 Density N Density 5 10 15 20 0.00 0.01 0.02 0.03 0.04 0.05 0.06

Figure : Summary Stats: All Data

−0.5 0.5 1.5 2.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0

density.default(x = alpha[distabc2 < eps2])

N = 1000 Bandwidth = 0.02889 Density 1 2 3 4 0.0 0.2 0.4 0.6 0.8

density.default(x = beta[distabc2 < eps2])

N = 1000 Bandwidth = 0.1647 Density N Density 5 10 15 20 0.00 0.02 0.04 0.06 0.08

Figure : Summary Stats: sample mean and variance of sqrt of differenced data

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation

slide-13
SLIDE 13

ABC Examples

Closing Remarks

Rejection sampling no good for vague priors Can embed ABC within MCMC (Marjoram et al (2003)) or SMC (Sisson et al 2007, Drovandi+Pettitt 2011) to reduce tolerances Biggest issue is (automatic) selection of summary statistics

Choose best subset out of large collection of summaries (e.g. Nunes+Balding 2010) Use (estimates) of posterior means as summaries (Fearnhead+Prangle 2012) Indirect Inference (Drovandi et al 2011)

Chris Drovandi, Charisse Farr BRAG - Approximate Bayesian Computation