A fast sampler for data simulation from spatial, and other, Markov - PowerPoint PPT Presentation

A fast sampler for data simulation from spatial, and other, Markov random fields Andee Kaplan Iowa State University ajkaplan@iastate.edu June 22, 2017 Slides available at http://bit.ly/kaplan-phd Joint work with M. Kaiser, S. Lahiri, and D. Nordman Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 1 / 41

Overview Thesis: On advancing MCMC-based methods for Markovian data structures with applications to deep learning, simulation, and resampling Goal: Develop statistical inference via Markov chain Monte Carlo (MCMC) techniques in complex data problems related to statistical learning, the analysis of network/graph data, and spatial resampling Challenge: Develop model-based methodology, which is both statistically rigorous and computationally scalable , by exploiting conditional independence 1 Statistical properties of graph models used in deep machine learning and image classification (Ch. 2 & 3) 2 Fast methods for simulating spatial, network, and other data (Ch. 4 & 5) Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 2 / 41

This talk Markov random field models are popular for spatial or network data Rather than specifying a joint distribution directly, a model is specified through a set of full conditional distributions for each spatial location Conditional distributions are assumed to correspond to a valid joint (e.g., sufficient conditions in Kaiser and Cressie (2000)) Goal: A new, provably fast approach for simulating spatial/network data under a Markov model Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 3 / 41

Spatial Markov random field (MRF) models Notation Variables { Y ( s i ) : i = 1 , . . . , n } at locations { s i : i = 1 , . . . , n } Neighborhoods: N i specified according to some configuration Neighboring Values: y ( N i ) = { y ( s j ) : s j ∈ N i } Full Conditionals: { f i ( y ( s i ) | y ( N i ) , θ ) : i = 1 , . . . , n } f i ( y ( s i ) | y ( N i ) , θ ) is conditional pmf/pdf of Y ( s i ) given values for its neighbors y ( N i ) Often assume a common conditional cdf F i = F form ( f i = f ) for all i Formulation adaptable to non-spatial data letting s i be a marker for observation Y ( s i ) (e.g., random graphs: s i represents a potential edge and Y ( s i ) ∈ { 0 , 1 } ) Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 4 / 41

Common neighborhood structures 4-nearest neighborhood 8-nearest neighborhood Defined by locations in cardinal Also includes neighboring diagonals directions · ∗ · ∗ ∗ ∗ ∗ ∗ s i ∗ ∗ s i · ∗ · ∗ ∗ ∗ � � � N i = { s i ± (0 , 1) } { s i ± (1 , 0) } N i = { s i ± (0 , 1) } { s i ± (1 , 0) } � { s i ± (1 , − 1) } { s i ± (1 , 1) } Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 5 / 41

Exponential family examples 1 Conditional Gaussian (3 parameters): � � − [ y ( s i ) − µ ( s i )] 2 1 √ f i ( y ( s i ) | y ( N i ) , α, η, τ ) = 2 πτ exp 2 τ 2 Y ( s i ) given neighbors y ( N i ) is normal with variance τ 2 and mean � µ ( s i ) = α + η [ y ( s j ) − α ] s j ∈N i 2 Conditional Binary (2 parameters): Y ( s i ) given neighbors y ( N i ) is Bernoulli p ( s i , κ, η ) where � logit [ p ( s i , κ, η )] = logit ( κ ) + η [ y ( s j ) − κ ] s j ∈N i In both examples, η represents a dependence parameter. Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 6 / 41

Illustrative Example For context, illustrate some common simulation demands arising in inference about spatial Markov models Spatial dataset from Besag (1977) Binary observations located on a 14 × 179 indicating the presence or absence of footrot in endive plants 15 10 Disease present Row N Y 5 0 0 50 100 150 Column Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 7 / 41

Three spatial binary models Isotropic centered autologistic model (Caragea and Kaiser 2009; Besag 1 1972; Besag 1977) 2 Centered autologistic model with two dependence parameters 3 Centered autologistic model as in (2) but having large scale structure determined by regression on the horizontal coordinate u i of each spatial location s i = ( u i , v i ). Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 8 / 41

Three models (Cont’d) Conditional mass function of the form exp[ y ( s i ) A i { y ( N i ) } ] f i ( y ( s i ) | y ( N i ) , θ ) = 1 + exp[ y ( s i ) A i { y ( N i ) } , y ( s i ) = 0 , 1 , with Model Natural parameter function A i { y ( N i ) } = log � � + η � (1) { y ( s j ) − κ } κ 1 − κ s j ∈N i A i { y ( N i ) } = log � � � � (2) + η u { y ( s j ) − κ } + η v { y ( s j ) − κ } κ 1 − κ s j ∈ N u , i s j ∈ N v , i (3) κ i � � � � A i { y ( N i ) } = log + η u { y ( s j ) − κ i } + η v { y ( s j ) − κ i } , 1 − κ i s j ∈ N u , i s j ∈ N v , i κ i � � log = β 0 + β 1 u i 1 − κ i Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 9 / 41

Bootstrap percentile confidence intervals Fit three models of increasing complexity to these data via pseudo-likelihood (Besag 1975) Apply simulation (parametric bootstrap) to obtain reference distributions for statistics based on the resulting estimators This involves the Gibbs sampler (due to the conditional model specification), where computational demands arise Model (1) Model (2) Model (3) η κ η u η v κ η u η v β 0 β 1 2.5% 0.628 0.107 0.691 0.378 0.106 -0.225 -0.221 -1.822 -0.003 50% 0.816 0.126 0.958 0.660 0.125 0.000 0.004 -1.600 -0.001 97.5% 1.001 0.145 1.220 0.921 0.145 0.209 0.214 -1.391 0.001 Bootstrap percentile confidence intervals in all three autologistic models Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 10 / 41

Sampling distributions via bootstrap simulation 4 ^ = 0.8213 η 3 (1) 2 1 0 0.4 0.6 0.8 1.0 1.2 η u η v 3 ^ = 0.965 ^ = 0.6598 η η 2 (2) 1 0 0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 η u η v ^ = 0.001 ^ = − 0.0475 3 η η (3) 2 1 0 −0.50 −0.25 0.00 0.25 −0.50 −0.25 0.00 0.25 Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 11 / 41

Common Spatial Simulation Approach With common conditionally specified models for spatial lattice, standard MCMC simulation approach via Gibbs sampling is: Starting from some initial Y ( j ) ≡ { Y ( j ) ∗ ( s 1 ) , . . . , Y ( j ) ∗ ( s n ) } , ∗ 1 Moving row-wise, for i = 1 , . . . , n , individually simulate/update Y ( j +1) ( s i ) for each location s i from conditional cdf F given ∗ Y ( j +1) ( s 1 ) , . . . , Y ( j +1) Y ( j ) ∗ ( s i +1 ) , . . . , Y ( j ) ( s i − 1 ) , ∗ ( s n ) ∗ ∗ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✛ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ ✉ 2 n individual updates provide 1 full Gibbs iteration. Repeat 1-2 to obtain M resampled spatial data sets Y ( j ) ∗ , j = 1 , . . . , M 3 (e.g., can burn-in, thin, etc.) Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 12 / 41

Endive data timing Endive example dataset simulations performed with the proposed (conclique-based) Gibbs sampler to follow Reported results would have been virtually identical with the same number of iterations to the standard sequential Gibbs sampler By model, generation of the reference distribution using the standard sampler would have taken approximately 25 . 31 minutes longer 1 31 minutes longer 2 40 . 7 minutes longer 3 Conclique MRF sampler had running times 8 . 15 seconds 1 14 . 74 seconds 2 95 . 71 seconds 3 Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 13 / 41

Concliques Cliques – Hammersley and Clifford (1971) Singletons and sets of locations such that each location in the set is a neighbor of all other locations in the set Example: Four nearest neighbors gives cliques of sizes 1 and 2 The Converse of Cliques – Concliques (Kaiser, Lahiri, and Nordman 2012) Sets of locations such that no location in the set is a neighbor of any other location in the set Concliques 4 Nearest 8 Nearest Concliques 8 Nearest Neighbors Neighbors 4 Nearest Neighbors Neighbors · ∗ · ∗ ∗ ∗ 1 2 1 2 ∗ ∗ ∗ ∗ s s 1 2 1 2 3 4 3 4 · ∗ · ∗ ∗ ∗ 2 1 2 1 1 2 1 2 1 2 1 2 3 4 3 4 2 1 2 1 Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 14 / 41

Generalized spatial residuals (Kaiser, Lahiri, and Nordman 2012) Definition F ( y | y ( N i ) , θ ) is the conditional cdf of Y ( s i ) under the model Substitute random variables, Y ( s i ) and neighbors { Y ( s j ) : s j ∈ N i } , into (continuous) conditional cdf to define residuals: R ( s i ) = F ( Y ( s i ) |{ Y ( s j ) : s j ∈ N i } , θ ) . Key Property Let {C j : j = 1 , . . . , q } be a collection of concliques that partition the integer grid. Under the conditional model, spatial residuals within a conclique are iid Uniform (0 , 1) -distributed : { R ( s i ) : s i ∈ C j } iid ∼ Uniform(0 , 1) for j = 1 , . . . , q Andee Kaplan ( ajkaplan@iastate.edu ) Conclique-based Gibbs June 22, 2017 15 / 41

A fast sampler for data simulation from spatial, and other, Markov - PowerPoint PPT Presentation

A fast sampler for data simulation from spatial, and other, Markov random fields Andee Kaplan Iowa State University ajkaplan@iastate.edu June 22, 2017 Slides available at http://bit.ly/kaplan-phd Joint work with M. Kaiser, S. Lahiri, and D.

Sampling and Reporting for Sampler 1 and 2 Certification Sampling and Reporting for Sampler 1 and 2

PM Sampler Placement and Sampler Errors! Why should Regulatory and Agricultural Industries Care?

STAT 209 Spatial Data I April 30, 2018 Colin Reimer Dawson 1 / 26 Spatial Data Projections

CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg Combi HSB K-069, 1:30

Resource 1: What is spatial? presentation notes Section Section text Notes 1. Spatial

Broadening the Study of Spatial Intelligence Mary Hegarty University of California, Santa

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

Outline Narcisse Ngada DESY, MKK 1) What is simulation ? 14.05.2014 2) Why simulation ? 3)

2 Introduction Topics To Cover Review of Sampler Design How is sampling inaccurate

The Gibbs Sampler CSE 527 Lecture 9 Lawrence, et al. Detecting Subtle Sequence

Creating a Science of Spatial Learning Nora S. Newcombe Temple University PI, Spatial

Spatial Digitech Keep it s im ple Make it spatial About US Spatial Digitech is a provider of

UCSB is Spatial ! http://www.spatial.ucsb.edu Specialist Meeting on Spatial Thinking across the

Marine Spatial Data for Marine Spatial Planning Ocean Leadership 2010 Public Policy Forum March

Geostatistical data Barry Rowlingson Geostatistician DataCamp Spatial Statistics in R Data

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

A Coh{; hvou s of f i etol t tne and C<QebvaS ou?o;d tahgca frv {o, jv. eait'olls

Generalized diffeomorphisms acting on generalized metrics Roberto Rubio Research seminar on

Pre-ICANN69 GNSO Working Session 6 October 2020 | 1 Agenda 1 2 3 Review of All Rights

Computer Graphics - Splines - Philipp Slusallek Curves Curve descriptions Explicit

Optical Propagation, Detection, and Communication Jeffrey H. Shapiro Massachusetts Institute of

Modelling with EMPIRE nuclear reaction code and nuclear data evaluation Joint ICTP-IAEA Workshop

Polynomial time algorithms in Kolmogorov complexity theory Marius Zimand Towson University CCR

Introduction From 1975 till 2005: Computing Science at CERN (www.cern.ch) Developing HPC

A fast sampler for data simulation from spatial, and other, Markov - PowerPoint PPT Presentation

A fast sampler for data simulation from spatial, and other, Markov random fields Andee Kaplan Iowa State University ajkaplan@iastate.edu June 22, 2017 Slides available at http://bit.ly/kaplan-phd Joint work with M. Kaiser, S. Lahiri, and D.

Sampling and Reporting for Sampler 1 and 2 Certification Sampling and Reporting for Sampler 1 and 2

PM Sampler Placement and Sampler Errors! Why should Regulatory and Agricultural Industries Care?

STAT 209 Spatial Data I April 30, 2018 Colin Reimer Dawson 1 / 26 Spatial Data Projections

CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg Combi HSB K-069, 1:30

Resource 1: What is spatial? presentation notes Section Section text Notes 1. Spatial

Broadening the Study of Spatial Intelligence Mary Hegarty University of California, Santa

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

Outline Narcisse Ngada DESY, MKK 1) What is simulation ? 14.05.2014 2) Why simulation ? 3)

2 Introduction Topics To Cover Review of Sampler Design How is sampling inaccurate

The Gibbs Sampler CSE 527 Lecture 9 Lawrence, et al. Detecting Subtle Sequence

Creating a Science of Spatial Learning Nora S. Newcombe Temple University PI, Spatial

Spatial Digitech Keep it s im ple Make it spatial About US Spatial Digitech is a provider of

UCSB is Spatial ! http://www.spatial.ucsb.edu Specialist Meeting on Spatial Thinking across the

Marine Spatial Data for Marine Spatial Planning Ocean Leadership 2010 Public Policy Forum March

Geostatistical data Barry Rowlingson Geostatistician DataCamp Spatial Statistics in R Data

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

A Coh{; hvou s of f i etol t tne and C*&lt;QebvaS ou?o;d tahgca* frv {o, jv. eait'olls

Generalized diffeomorphisms acting on generalized metrics Roberto Rubio Research seminar on

Pre-ICANN69 GNSO Working Session 6 October 2020 | 1 Agenda 1 2 3 Review of All Rights

Computer Graphics - Splines - Philipp Slusallek Curves Curve descriptions Explicit

Optical Propagation, Detection, and Communication Jeffrey H. Shapiro Massachusetts Institute of

Modelling with EMPIRE nuclear reaction code and nuclear data evaluation Joint ICTP-IAEA Workshop

Polynomial time algorithms in Kolmogorov complexity theory Marius Zimand Towson University CCR

Introduction From 1975 till 2005: Computing Science at CERN (www.cern.ch) Developing HPC

A Coh{; hvou s of f i etol t tne and C<QebvaS ou?o;d tahgca frv {o, jv. eait'olls