Evidence estimation for Markov random fields: a triply intractable - PowerPoint PPT Presentation

Markov random fields A doubly intractable problem A triply intractable problem Evidence estimation for Markov random fields: a triply intractable problem Richard Everitt University of Reading January 7th, 2014 beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem Markov random fields Interacting objects Markov random fields (MRFs) are used for modelling (often large numbers of) interacting objects usually modelling symmetrical interactions. Used widely in statistics, physics and computer science, e.g. image analysis; ferromagnetism; geostatistics; point processes; social networks. beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem Markov random fields Image analysis The log expression of 72 genes on a particular chromosome beamer-icsi-logo over 46 hours (from Friel et al. 2009). Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem Markov random fields Pairwise Markov random fields beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem Markov random fields Intractable normalising constants Pairwise MRFs correspond to the factorisation ∏ f ( Y | θ ) ∝ γ ( Y | θ ) = φ ( Y i , Y j | θ ) . (i,j) ∈ Nei (Y) We also need to specify the normalising constant � ∏ Z ( θ ) = φ ( Y i , Y j | θ ) dY Y (i,j) ∈ Nei (Y) In general we are interested in models that take the form f ( Y | θ ) = γ ( Y | θ ) Z ( θ ) . beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A doubly intractable problem Doubly intractable Suppose we want to estimate parameters θ after observing Y = y . Use Bayesian inference to find p ( θ | y ) ∝ p ( y | θ ) p ( θ ) . Could use MCMC, but the acceptance probability in MH is � � 1 , q ( θ | θ ∗ ) p ( θ ∗ ) γ ( y | θ ∗ ) 1 Z ( θ ) min . q ( θ ∗ | θ ) Z ( θ ∗ ) p ( θ ) γ ( y | θ ) 1 beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A doubly intractable problem ABC-MCMC Approximate an intractable likelihood at θ with: R 1 ∑ π ε ( S ( x r ) | S ( y )) R r = 1 where the x r ∼ f ( . | θ ) are R simulations from f (originally in Ratmann et al. (2009)). Often R = 1 and π ε ( . | S ( y )) = U ( . | ( S ( y ) − ε , S ( y )+ ε )) . Essentially a nonparametric kernel estimator to the conditional distribution of the statistics given θ , based on simulations from f . ABC-MCMC is an MCMC algorithm that targets this beamer-icsi-logo approximate posterior. Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A doubly intractable problem ABC on ERGMs ”True” ABC beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A doubly intractable problem Synthetic likelihood An alternative approximation proposed in Wood (2010). Again take R simulations from f , x r ∼ f ( . | θ ) , and take the summary statistics of each. But instead use a multivariate normal approximation to the distribution of the summary statistics given θ : � � µ θ , � S ( y ) | � L ( S ( y ) | θ ) = N Σ θ , where R µ θ = 1 ∑ � S ( x r ) , R r = 1 Σ θ = ss T R − 1 , beamer-icsi-logo with s = ( S ( x 1 ) − � µ θ ,..., S ( x R ) − � µ θ ) . Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A doubly intractable problem The single auxiliary variable method Møller et al. (2006) augment the target distribution with an extra variable u and use p ( θ , u | y ) ∝ q u ( u | θ , y ) f ( y | θ ) p ( θ ) where q u is some (normalised) arbitrary distribution and u is on the same space as y . As the MH proposal in ( θ , u ) -space they use ( θ ∗ , u ∗ ) ∼ f ( u ∗ | θ ∗ ) q ( θ ∗ | θ ) . This gives an acceptance probability of � � 1 , q ( θ | θ ∗ ) p ( θ ∗ ) γ ( y | θ ∗ ) q u ( u ∗ | θ ∗ , y ) γ ( u | θ ) min . beamer-icsi-logo q ( θ ∗ | θ ) p ( θ ) γ ( y | θ ) γ ( u ∗ | θ ∗ ) q u ( u | θ , y ) Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A doubly intractable problem Exact approximations Note that q u ( u ∗ | θ ∗ , y ) is an unbiased importance sampling γ ( u ∗ | θ ∗ ) 1 estimator of Z ( θ ∗ ) . still targets the correct distribution! first seen in the pseudo-marginal methods of Beaumont (2003) and Andrieu and Roberts (2009). Relies on being able to simulate exactly from f ( . | θ ∗ ) , which is usually not possible or computationally expensive. Girolami et al. (2013) introduce an approach that does not require exact simulation (“Russian Roulette”). beamer-icsi-logo Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Markov random fields A doubly intractable problem A triply intractable problem A triply intractable problem Estimating the marginal likelihood The marginal likelihood (also known as the evidence) is � p ( y ) = θ p ( θ ) f ( y | θ ) d θ . Used in Bayesian model comparison p ( M | y ) = p ( M ) p ( y | M ) , most commonly seen in the Bayes’ factor, for comparing models p ( y | M 1 ) p ( y | M 2 ) . All commonly used methods require f ( y | θ ) to be tractable in θ , and usually can’t be estimated from MCMC output beamer-icsi-logo “a triply intractable problem” - Friel (2013). Richard Everitt University of Reading Evidence estimation for Markov random fields: a triply intractable problem

Evidence estimation for Markov random fields: a triply intractable - PowerPoint PPT Presentation

Markov random fields A doubly intractable problem A triply intractable problem Evidence estimation for Markov random fields: a triply intractable problem Richard Everitt University of Reading January 7th, 2014 beamer-icsi-logo Richard

Graphical Models - Part II Oliver Schulte - CMPT 726 Bishop PRML Ch. 8 Markov Random Fields

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Markov Random Fields Umamahesh Srinivas iPAL Group Meeting February 25, 2011 Outline Basic

Markov Random Fields and its Applications Huiwen Chang Introduction Markov Random

Visualization Visualization Height Fields and Contours Height Fields and Contours Scalar Fields

Outline Markov networks (a.k.a. Markov random fields) Markov Networks Reading: Michael

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Tiling of triply-periodic minimal surfaces (Hyperbolic tilings in soft matter physics) Myfanwy E.

Markov Random Fields: Inference and Estimation SPiNCOM reading group April 24 th , 2017 Dimitris

Planar Markov fields Marie-Colette van Lieshout colette@cwi.nl CWI P .O. Box 94079, NL-1090 GB

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

CS70: Lecture 36. Markov Chains 1. Markov Process: Motivation, Definition 2. Examples 3.

& Semantic Roles CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT marine@cs.umd.edu Q:

Chapter 8: Information Extraction (IE) 8.1 Motivation and Overview 8.2 Rule-based IE 8.3 Hidden

The Computing Community Consortium: Stimulating Bigger Thinking Ed Lazowska, UW and CCC Susan

Cybercasing the Joint: On the Privacy Implications of Geo-Tagging Gerald Friedland, Robin Sommer

Optimal Index Codes with Near-Extreme Rates Vitaly Skachek (joint work with Son Hoang Dau and

W The SensEval workshop series are specifically dedicated ORD sense disambiguation (WSD) is

Recent advances in local graph clustering and the transition to global analysis Kimon

Semantic role labeling Christopher Potts CS 244U: Natural language understanding Feb 2 With

Evidence estimation for Markov random fields: a triply intractable - PowerPoint PPT Presentation

Markov random fields A doubly intractable problem A triply intractable problem Evidence estimation for Markov random fields: a triply intractable problem Richard Everitt University of Reading January 7th, 2014 beamer-icsi-logo Richard

Graphical Models - Part II Oliver Schulte - CMPT 726 Bishop PRML Ch. 8 Markov Random Fields

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Markov Random Fields Umamahesh Srinivas iPAL Group Meeting February 25, 2011 Outline Basic

Markov Random Fields and its Applications Huiwen Chang Introduction Markov Random

Visualization Visualization Height Fields and Contours Height Fields and Contours Scalar Fields

Outline Markov networks (a.k.a. Markov random fields) Markov Networks Reading: Michael

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Tiling of triply-periodic minimal surfaces (Hyperbolic tilings in soft matter physics) Myfanwy E.

Markov Random Fields: Inference and Estimation SPiNCOM reading group April 24 th , 2017 Dimitris

Planar Markov fields Marie-Colette van Lieshout colette@cwi.nl CWI P .O. Box 94079, NL-1090 GB

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

CS70: Lecture 36. Markov Chains 1. Markov Process: Motivation, Definition 2. Examples 3.

&amp; Semantic Roles CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT marine@cs.umd.edu Q:

Chapter 8: Information Extraction (IE) 8.1 Motivation and Overview 8.2 Rule-based IE 8.3 Hidden

The Computing Community Consortium: Stimulating Bigger Thinking Ed Lazowska, UW and CCC Susan

Cybercasing the Joint: On the Privacy Implications of Geo-Tagging Gerald Friedland, Robin Sommer

Optimal Index Codes with Near-Extreme Rates Vitaly Skachek (joint work with Son Hoang Dau and

W The SensEval workshop series are specifically dedicated ORD sense disambiguation (WSD) is

Recent advances in local graph clustering and the transition to global analysis Kimon

Semantic role labeling Christopher Potts CS 244U: Natural language understanding Feb 2 With

& Semantic Roles CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT marine@cs.umd.edu Q: