Multitree Generalized Steppingstone Sampling A New MCMC Method for - PowerPoint PPT Presentation

Multitree Generalized Steppingstone Sampling – A New MCMC Method for Estimating the Marginal Likelihood of a Models Mark T. Holder, Paul O. Lewis, David L. Swofford, and David Bryant KU, UConn, Duke, U. Otago (NZ) Feb 16, 2013 – Austin, TX

Context • We rarely know the “true” model to use for analyses. • Model-averaging methods can be difficult to implement and use. • We can use the marginal likelihood to choose between models.

The marginal likelihood assess the fit of the model to the data by considering all parameter values � P ( D | M ) = P ( D | θ, M ) p ( θ | M ) dθ

Bayesian model selection Bayes Factor between two models: B 10 = P ( D | M 1 ) P ( D | M 0 ) We could estimate P ( D | M 1 ) by drawing points from the prior on θ and calculating the mean likelihood.

Sharp posterior (black) and prior (red) 40 30 density 0 2 0 1 0 − 2 − 1 1 2 0 x

Drawing from the prior will often miss any of the trees with high posterior density – massive sample sizes would be needed. Perhaps we can use samples from the posterior?

We can use the harmonic mean of the likelihoods of MCMC samples to estimate P ( D | M 1 ) . However . . .

“The Harmonic Mean of the Likelihood: Worst Monte Carlo Method Ever” A post on Dr. Radford Neal’s blog http://radfordneal.wordpress.com/2008/08/17/ the-harmonic-mean-of-the-likelihood-worst-monte-carlo-method-ever “The total unsuitability of the harmonic mean estimator should have been apparent within an hour of its discovery.”

Harmonic mean estimator of the marginal likelihood • appealing because it comes “for free” after we have sampled the posterior using MCMC, • unfortunately the estimator can have a huge variance associated with it in some (very common) cases. For example if: • the vast majority of parameter space has very low likelihood, and • a very small region has high likelihoods.

Importance sampling to approximate a difficult target distribution 1 Simulate points from an easy distribution. 2 Reweight the points by the ratio of densities between the easy and target distribution. 3 Treat the reweighted samples as draws from the target distribution.

2.5 2.0 1.5 density 1.0 0.5 0.0 −3 −2 −1 0 1 2 3 x

Importance and target densities 2.5 2.0 1.5 density 1.0 0.5 0.0 −3 −2 −1 0 1 2 3 x Importance weights 12 10 8 weight 6 4 2 0 −3 −2 −1 0 1 2 3

Importance and target densities 2.5 2.0 1.5 density 1.0 0.5 0.0 −3 −2 −1 0 1 2 3 x Samples from importance distribution Importance weights Weighted samples 12 12 12 10 10 10 8 8 8 weight weight 6 6 6 4 4 4 2 2 2 0 0 0 −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 x x x

Importance sampling The method works well if the importance distribution is: • fairly similar to the target distribution, and • not “too tight” to allow sampling the full range of the target distribution

In phylogenetics our posterior distribution is too peaked and our prior is too vague to allow us to use them in importance sampling: Sharp posterior (black) and prior (red) 40 30 density 0 2 0 1 0 − 2 − 1 1 2 0

Steppingstone sampling uses a series of importance sampling runs Steppingstone densities 40 30 density 20 10 0 −2 −1 0 1 2 x

Steppingstone sampling (Xie et al. 2011, Fan et al. 2011) blends two distributions: • the posterior, P ( D | θ, M 1 ) P ( θ, M 1 ) • a tractable reference distribution, π ( θ ) [ P ( D | θ, M 1 ) P ( θ, M 1 )] β [ π ( θ )] (1 − β ) p β ( θ | D, M 1 ) = c β p 1 ( θ | D, M 1 ) is the posterior. c 1 is the marginal likelihood of the model. p 0 ( θ | D, M 1 ) is the reference distribution. c 0 is 1.

Steppingstone sampling (Xie et al. 2011, Fan et al. 2011) blends two distributions: • the posterior, P ( D | θ, M 1 ) P ( θ, M 1 ) • a tractable reference distribution, π ( θ ) [ P ( D | θ, M 1 ) P ( θ, M 1 )] β [ π ( θ )] (1 − β ) p β ( θ | D, M 1 ) = c β � c 1 � � c 0 . 1 � � c 0 . 38 � � c 0 . 01 � P ( D | M 1 ) = c 1 = c 0 c 0 . 38 c 0 . 1 c 0 . 01 c 0 � c 1 � � c 0 . 38 � � c 0 . 1 � � c 0 . 01 � ✘✘ ✘ ✟ ✘✘ ✘ ✟ = c 0 . 38 c 0 . 1 c 0 . 01 c 0 ✘ ✘ ✘✘ ✟ ✘✘ ✟

Run MCMC with different β values Steppingstone densities 40 30 density 20 10 0 −2 −1 0 1 2 x

� � c 0 . 01 � � � � � P ( D | M ) c 0 . 38 c 0 . 1 � P ( D | M ) = 1 c 0 . 38 c 0 . 1 c 0 . 01 Photo by Johan Nobel http://www.flickr.com/photos/43147325@N08/4326713557/ downloaded from Wikimedia

Reference distributions in Steppingstone sampling In the original steppingstone (Xie et al 2011): reference = prior In generalized steppingstone (Fan et al 2011) it can be any distribution: • Should be centered around the areas with high probability, • Must be a probability distribution with a known normalizing constant, • Should be easy to draw sample from.

The log Bayes Factor for a complex model compared to a simple (true) model. Estimated twice (by Fan et al, 2011) Harmonic mean: Original Steppingstone: 20 20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −20 −20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● Seed 2 Seed 2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −40 −40 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −60 −60 ● ● ● ● ● ● ● ● −60 −40 −20 0 20 −60 −40 −20 0 20 Seed 1 Seed 1 20 0 −20 Seed 2 −40 −60 −60 −40 −20 0 20

Multitree Generalized Steppingstone Sampling A New MCMC Method for - PowerPoint PPT Presentation

Multitree Generalized Steppingstone Sampling A New MCMC Method for Estimating the Marginal Likelihood of a Models Mark T. Holder, Paul O. Lewis, David L. Swofford, and David Bryant KU, UConn, Duke, U. Otago (NZ) Feb 16, 2013 Austin,

CSci 8980: Advanced Topics in Graphical Models MCMC, Gibbs Sampling Instructor: Arindam Banerjee

Additional notes on MCMC sampling Shravan Vasishth March 18, 2020 For more details on MCMC, some

STAT 339 Markov Chain Monte Carlo (MCMC) 7 April 2017 Some theory and intuition about MCMC

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

An MCMC library for probabilistic programming Rob Zinkov June 13th, 2014 Rob Zinkov An MCMC

Testing MCMC Samplers Jason M.T. Roos First European Bayesian Summit in Marketing Testing MCMC

Attack-Resilient Multitree Data Distribution Topologies Sascha Grau 1 Technische Universit at

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

02 Sampling algorithms Shravan Vasishth SMLP Shravan Vasishth 02 Sampling algorithms SMLP 1 /

Lattice Gaussian Sampling with Markov Chain Monte Carlo (MCMC) Cong Ling Imperial College London

Markov chains and MCMC methods Ingo Blechschmidt November 7th, 2014 Kleine Bayessche AG Markov

r r

Simulation Monte Carlo Monte Carlo simulation Outcome of a single stochastic simulation run

Variational measures generated by functions and associated with local systems of sets Luisa Di

Lecture 13 - BRDFs Welcome! , = (, ) ,

Linear Collider Luminosity R. Brinkmann, DESY LC Workshop Chicago, Jan. 7-9, 02 Acknowledgement

Algorithms, Optimization and Simulation Results for Pulse-to- pulse Feedback in SLC, NLC/JLC,

HOM-free deflecting cavity T. Khabiboulline, M. Awida Hassan, I. Gonin, A. Lunin, V. Yakovlev and

Earth-Scattering of Dark Matter: when Dark Matter particle physics and astrophysics collide

Sambuz

Useful Links

Newsletter

Mail Us