Partially Exchangeable Networks and Architectures for Learning - PowerPoint PPT Presentation

Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computation ICML 2019 Samuel Wiqvist Centre for Mathematical Sciences, Lund University, Sweden � samuel wiqvist June 13, 2019 Joint work with Pierre-Alexandre Mattei (IT University Copenhagen), Umberto Picchini (Chalmers/University of Gothenburg), and Jes Frellsen (IT University Copenhagen) Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 1 / 11

ABC: Simulation-based inference ABC only requires that we can simulate data from our model p ( y | θ ), thus ABC is very generic, and can be applied for models where the likelihood is intractable; ABC in a nut-shell: Generate parameter proposals θ ⋆ from the prior p ( θ ); 1 Accept θ ⋆ if the generated data y ⋆ ∼ p ( y | θ ⋆ ) is similar to our observed data y obs ; 2 Repeat Step 1-2 for a large number of times; 3 The accepted θ ’s are samples from an approximation to the posterior p ( θ | y obs ). 4 Curse-of-dimensionality : Instead of comparing y ⋆ with y obs we compare a set of summary statistics S ( y ⋆ ) and S ( y obs ); The main focus of our work is how to automatically learn summary statistics S ( · ) that are informative for θ . Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 2 / 11

How to select/learn summary statistics The problem of selecting informative summary statistics is the main challenge when applying ABC in practice; Usually, summary statistics are ad-hoc and “handpicked” out of subject-domain expertise; In they show that the best summary statistics (in terms of quadratic loss for θ ) is the posterior mean E ( θ | y ); Deep learning methods that learn the posterior mean as a summary statistic for ABC have already been considered. Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 3 / 11

How to select/learn summary statistics The problem of selecting informative summary statistics is the main challenge when applying ABC in practice; Usually, summary statistics are ad-hoc and “handpicked” out of subject-domain expertise; In 1 they show that the best summary statistics (in terms of quadratic loss for θ ) is the posterior mean E ( θ | y ); Deep learning methods that learn the posterior mean as a summary statistic for ABC have already been considered. 1 Paul Fearnhead and Dennis Prangle. “Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation”. In: Journal of the Royal Statistical Society: Series B (Statistical Methodology) 74.3 (2012), pp. 419–474. Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 3 / 11

How to select/learn summary statistics The problem of selecting informative summary statistics is the main challenge when applying ABC in practice; Usually, summary statistics are ad-hoc and “handpicked” out of subject-domain expertise; In 1 they show that the best summary statistics (in terms of quadratic loss for θ ) is the posterior mean E ( θ | y ); Deep learning methods that learn the posterior mean as a summary statistic for ABC have already been considered 2 . 1 Paul Fearnhead and Dennis Prangle. “Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation”. In: Journal of the Royal Statistical Society: Series B (Statistical Methodology) 74.3 (2012), pp. 419–474. 2 Bai Jiang et al. “Learning summary statistic for approximate Bayesian computation via deep neural network”. In: Statistica Sinica (2017), pp. 1595–1618. Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 3 / 11

Designing the PEN architecture We build on the earlier ideas and we want to target time series models; Thus, we construct a regression function y �→ E ( θ | y ) that is d -block-switch invariant, yielding following regression problem: � M − d � � θ i = E ( θ | y i ) + ξ i = ρ β ρ y i φ β φ ( y i + ξ i . 1: d , l : l + d ) l =1 � �� PEN − d We have a universal approximation theorem for this architecture; DeepSets is a special case of PEN. Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 4 / 11

Designing the PEN architecture We build on the earlier ideas and we want to target time series models; Thus, we construct a regression function y �→ E ( θ | y ) that is d -block-switch invariant, yielding following regression problem: � M − d � � θ i = E ( θ | y i ) + ξ i = ρ β ρ y i φ β φ ( y i + ξ i . l : l + d ) 1: d , l =1 � �� PEN − d We have a universal approximation theorem for this architecture; DeepSets 3 is a special case of PEN. 3 Manzil Zaheer et al. “Deep sets”. In: Advances in Neural Information Processing Systems . 2017, pp. 3391–3401. Samuel Wiqvist (Lund University) Partially Exchangeable Networks and ABC June 13, 2019 4 / 11

Partially Exchangeable Networks and Architectures for Learning - PowerPoint PPT Presentation

Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computation ICML 2019 Samuel Wiqvist Centre for Mathematical Sciences, Lund University, Sweden samuel wiqvist June 13, 2019 Joint

EXCHANGEABLE RANDOM MEASURES BY F. C ARON AND E. B. F OX Benjamin Bloem-Reddy

Theorems for Exchangeable Binary Random Variables with Applications Serguei Kaniovski November

Theorems for Exchangeable Binary Random Variables with Applications to Voting Theory Serguei

Optimal Decentralized Control of System with Partially Exchangeable Agents Aditya Mahajan McGill

Architectures Architectural styles Software architectures Architectures versus middleware

Probabilistic Graphical Models 10-708 Learning Partially Observed Learning Partially Observed

Rayleigh- -Taylor instability Taylor instability Rayleigh in partially ionized in partially

Properties of estimators of doubly exchangeable covariance matrix and structured mean vector for

Discussion of Survival Models and Health Sequences Setup: Subjects indexed by i

Non-exchangeable potassium in southern African soils: a neglected reserve from a crop nutritional

Assessment Factors for Toxicity Based Risk Assessment in the Presence of Non-Exchangeable Species

Energy-Based Processes for Exchangeable Data Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans

IDEALEM Implementation of Dynamic Extensible Adaptive Locally Exchangeable Measures Scientific

Exchangeable graph-valued Markov processes Harry Crane Department of Statistics Rutgers June

Sparse random graphs with exchangeable point processes Fran cois Caron Department of

Trial design in the presence of non-exchangeable subpopulations Cancer Biostatistics Section Head

Modelling discontinuities in simulator output using Voronoi tessellations John Paul Gosling

ProtoDUNE-DP Light Data Clara Cuesta on behalf of the CIEMAT team October, 31 st 2019 Photon

Scopen Capstone Project Presentation Byron Aguilar Boning Dong Cesar Gonzalez Team Leader

Identifying and inferring objects from textual descriptions of scenes from books Andrew

investigation of pen as structural self vetoing material for cryogenic low background experiments

Gaussian Model Selection with Unknown Variance Y. Baraud, C. Giraud and S. Huet Universit e de

Non-Photorealistic Rendering Non-Photorealistic Rendering Pen-and-Ink Illustrations Pen-and-Ink

Jacqueline Quintanilla Twin Cities Code Camp April 13, 2019 qjac.net Dont know what CSS Grid

Sambuz

Useful Links

Newsletter

Mail Us

Partially Exchangeable Networks and Architectures for Learning - PowerPoint PPT Presentation

Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computation ICML 2019 Samuel Wiqvist Centre for Mathematical Sciences, Lund University, Sweden samuel wiqvist June 13, 2019 Joint

EXCHANGEABLE RANDOM MEASURES BY F. C ARON AND E. B. F OX Benjamin Bloem-Reddy

Theorems for Exchangeable Binary Random Variables with Applications Serguei Kaniovski November

Theorems for Exchangeable Binary Random Variables with Applications to Voting Theory Serguei

Optimal Decentralized Control of System with Partially Exchangeable Agents Aditya Mahajan McGill

Architectures Architectural styles Software architectures Architectures versus middleware

Probabilistic Graphical Models 10-708 Learning Partially Observed Learning Partially Observed

Rayleigh- -Taylor instability Taylor instability Rayleigh in partially ionized in partially

Properties of estimators of doubly exchangeable covariance matrix and structured mean vector for

Discussion of Survival Models and Health Sequences Setup: Subjects indexed by i

Non-exchangeable potassium in southern African soils: a neglected reserve from a crop nutritional

Assessment Factors for Toxicity Based Risk Assessment in the Presence of Non-Exchangeable Species

Energy-Based Processes for Exchangeable Data Mengjiao Yang*, Bo Dai*, Hanjun Dai, Dale Schuurmans

IDEALEM Implementation of Dynamic Extensible Adaptive Locally Exchangeable Measures Scientific

Exchangeable graph-valued Markov processes Harry Crane Department of Statistics Rutgers June

Sparse random graphs with exchangeable point processes Fran cois Caron Department of

Trial design in the presence of non-exchangeable subpopulations Cancer Biostatistics Section Head

Modelling discontinuities in simulator output using Voronoi tessellations John Paul Gosling

ProtoDUNE-DP Light Data Clara Cuesta on behalf of the CIEMAT team October, 31 st 2019 Photon

Scopen Capstone Project Presentation Byron Aguilar Boning Dong Cesar Gonzalez Team Leader

Identifying and inferring objects from textual descriptions of scenes from books Andrew

investigation of pen as structural self vetoing material for cryogenic low background experiments

Gaussian Model Selection with Unknown Variance Y. Baraud, C. Giraud and S. Huet Universit e de

Non-Photorealistic Rendering Non-Photorealistic Rendering Pen-and-Ink Illustrations Pen-and-Ink

Jacqueline Quintanilla Twin Cities Code Camp April 13, 2019 qjac.net Dont know what CSS Grid

Sambuz

Useful Links

Newsletter

Mail Us

Energy-Based Processes for Exchangeable Data Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans