Joint work with Matthew Holden Andrs Almansa Kostas Zygalakis - PowerPoint PPT Presentation

Joint work with Matthew Holden Andrés Almansa Kostas Zygalakis (Maxwell Institute) (CNRS, University of Paris) (Edinburgh University & Maxwell Institute)

Outline ◦ Introduction ◦ Proposed method ◦ Experiments

Forward problem Imaging device True scene Observed image

Inverse problem Imaging method Observed image Estimated scene

Problem statement • We are interested in recovering an unknown image 𝑦 ∈ ℝ 𝑒 , e.g., • We measure 𝑧 , related to 𝑦 by some mathematical model. • For example, many imaging problems involve models of the form 𝑧 = 𝐵𝑦 + 𝑥, = ( ) + w for some linear operator 𝐵 , and some perturbation or “noise” w. • The recovery of x from y is often ill-posed or ill-conditioned, so we regularize it.

Bayesian statistics • We formulate the estimation problem in the Bayesian statistical framework, a probabilistic mathematical framework in which we represent 𝑦 as a random quantity and use probability distributions to model expected properties. • To derive inferences about x from y we postulate a joint statistical model p(𝑦, 𝑧) typically specified via the decomposition p 𝑦, 𝑧 = 𝑞 𝑧 𝑦 𝑞 𝑦 . • The Bayesian framework is equipped with a powerful decision theory to derive solutions and inform decisions and conclusions in a rigorous and defensible way.

Bayesian statistics • The decomposition p 𝑦, 𝑧 = 𝑞 𝑧 𝑦 𝑞 𝑦 has two ingredients: • The likelihood : the conditional distribution 𝑞 𝑧 𝑦 that models the data observation process (the forward model). • The prior : the marginal distribution 𝑞 𝑦 = ׬ 𝑞(𝑦, 𝑧) 𝑒𝑧 that models expected properties of the solutions. • In imaging, 𝑞 𝑧 𝑦 usually has significant identifiability issues and we rely strongly on 𝑞 𝑦 to regularize the estimation problem and deliver meaningful solutions.

Bayesian statistics • We base our inferences on the posterior distribution p 𝑦|𝑧 = 𝑞 𝑦, 𝑧 = 𝑞 𝑧 𝑦 𝑞 𝑦 𝑞 𝑧 𝑞 𝑧 where 𝑞 𝑧 = ׬ 𝑞(𝑦, 𝑧) 𝑒𝑦 provides an indication of the goodness of fit. • The conditional distribution p 𝑦|𝑧 models our knowledge about the solution 𝑦 after observing the data 𝑧 , in a manner that is clear, modular and elegant. • Inferences are then derived by using Bayesian decision theory.

Bayesian statistics There are three main challenges in deploying Bayesian approaches in imaging sciences: 1. Bayesian computation: calculating probabilities and expectations w.r.t. p 𝑦|𝑧 is computationally expensive, although algorithms are improving rapidly. Bayesian analysis: we do not usually know what questions to ask p 𝑦|𝑧 , imaging 2. sciences are a field in transition and the concept of solution is evolving. 3. Bayesian modelling: while it is true that all models are wrong, but some are useful, image models are often too simple to reliably support advanced inferences.

In this talk • Instead of specifying an analytic form for 𝑞 𝑦 , we consider the situation where the ′ 𝑁 prior knowledge about 𝑦 is available as a set of examples 𝑦 𝑗 i.i.d. w.r.t 𝑦 . 𝑗=1 • We aim to combine this prior knowledge with a likelihood 𝑞 𝑧 𝑦 specified ′ 𝑁 analytically to derive a posterior distribution for 𝑞 𝑦 𝑧, 𝑦 𝑗 . 𝑗=1 ′ 𝑁 • The goal is to construct 𝑞 𝑦 𝑧, 𝑦 𝑗 in a way that preserves the modularity and 𝑗=1 interpretability of analytic Bayesian models, and enables efficient computation.

Bayesian model • Following the manifold hypothesis , we assume that 𝑦 takes values close to an unknown 𝑞 – dimensional submanifold of ℝ 𝑒 . 𝑁 , we introduce a latent representation 𝑨 ∈ ′ • To estimate this submanifold from 𝑦 𝑗 𝑗=1 ℝ 𝑞 with 𝑞 ≪ 𝑒, and a mapping 𝜚 ∶ ℝ 𝑞 → ℝ 𝑒 , such that the pushforward measure 𝑁 . ′ under 𝜚 of 𝑨 ~ 𝑂 0, 𝐽 𝑞 is close to the empirical distribution of 𝑦 𝑗 𝑗=1 • Given 𝜚 , the likelihood p 𝑧 𝑨 = 𝑞 𝑧|𝑦 𝑧 𝜚 𝑨 . We can then easily derive the posterior p 𝑨 𝑧 ∝ 𝑞 𝑧 𝑨 𝑞(𝑨) and benefit from greatly reduced dimensionality. • The posterior p 𝑦 𝑧 is simply the pushforward measure of 𝑨|𝑧 under 𝜚 .

Estimating f • There are different learning approaches to estimate 𝜚 , e.g., variational auto-encoders (VAE)s and generative adversarial networks (GAN)s. • We use a VAE, i.e., we assume 𝑦 is generated from the latent variable 𝑨 as follows: 𝑨 ∼ 𝑂 0, 𝐽 𝑞 , 𝑦~ 𝑞 𝑦 𝑨 • As 𝑞(𝑦|𝑨) is unknown, we approximate it by a parameterized distribution 𝑞 𝜄 𝑦 𝑨 2 𝑨 𝐽 . defined by a neural network (the decoder). This typically has form 𝑂 𝜈 𝑌 𝑨 , 𝜏 𝑌 ′ . This is ′ , … , 𝑦 𝑁 • The objective is to set 𝜄 to maximize the marginal likelihood 𝑞 𝜄 𝑦 1 usually computationally intractable, so we maximize a lower bound instead. See Kingma P. et at. "Auto-encoding variational Bayes." (2013) arXiv:1312.6114 .

Variational Auto-Encoders • The variational lower bound on the log-likelihood is given by log 𝑞 𝜄 𝑦 𝑨 ≥ 𝐹 𝑟 𝜄 log 𝑞 𝜄 𝑦 𝑨 − 𝐸 𝐿𝑀 𝑟 𝜒 𝑨 𝑦 ԡ𝑞 𝜄 𝑨 • 𝑟 𝜒 𝑨 𝑦 is an approximation of 𝑞 𝜄 𝑨 𝑦 , parameterised by a neural network (the encoder). Typically 𝑂 𝜈 𝑦 , 𝜏 2 (𝑦 ). • In maximising the variational lower bound, the encoder and decoder are trained simultaneously. • We use the decoder mean to define 𝜚 , i.e., x = 𝜈 𝑌 𝑨 .

Bayesian computation • To compute probabilities and expectations for z|y we use a preconditioned Crank Nicolson algorithm, which is a slow but robust Metropolized MCMC algorithm. • For additional robustness w.r.t. multimodality, we run M+1 parallel Markov chains 1 2 𝑁 𝑨 𝑧 , p 𝑁 𝑨 𝑧 , …, p 𝑨 𝑧 , and perform randomized chain swaps. targeting p(𝑨), p • Probabilities and expectations for x|y are directly available by 𝜚 -pushing samples. • We are developing fast gradient-based stochastic algorithms. Naïve off-the-shelf implementations are not robust and have poor theoretical guarantees in this setting. Cotter, Simon L., et al. "MCMC methods for functions: modifying old algorithms to make them faster." Statistical Science (2013): 424-446.

Previous works • Our work is closely related to the Joint MAP method of M. González et at. (2019) arXiv:1911.06379, which considers a similar setup but seeks to compute the maximiser of p(𝑦, 𝑨|𝑧) by alternating optimization. 𝑁 ′ • It is also related to works that seek to learn 𝑞 𝑦 𝑧, 𝑦 𝑗 by using a GAN, e.g., 𝑗=1 Adler J et al. (2018) arXiv:1811.05910 and Zhang C et al. (2019) arXiv:1908.01010. • More generally, part of a literature on data-driven regularization schemes; see Arridge S, Maass P, Oktem O, and Schönlieb CB (2019) Acta Numerica, 28:1-174. • Underlying vision of Bayesian imaging methodology set in the seminal paper Besag J, Green P, Higdon D, Mengersen K (1995) Statist. Sci., 10 (1), 3--41.

Experiments • We illustrate the proposed approach with three imaging problems: denoising, deblurring (Gaussian blur 6x6 pixels), and inpainting (75% of missing pixels). • For simplicity, we used the MNIST dataset (training set 60,000 images, test set 10,000 images, images of size 28x28 pixels). In our experiments we use approx. 10 5 iterations and 10 parallel chains. Computing times of the order of 5 minutes. • We report comparisons with J-MAP of Gonzales et al. (2019) and plug-and-play ADMM of Venkatakrishnan (2013) using a deep denoiser specialised for MNIST. S.V. Venkatakrishnan, C.A. Bouman, and B. Wohlberg, Plug-and-Play Priors for Model Based Reconstruction, GlobalSIP, 2013.

Dimension of the latent space • The dimension of the latent space plays an important role in the regularization of the inverse problem and strongly impacts the quality of the model. • We can easily identify suitable dimensions by looking at the empirical marginal p(𝑨) obtained from encoded training examples, e.g., we look at the trace of cov 𝑨 . 𝑞 = 12

Image denoising PSRN -8 dB PSRN 0.5 dB PSRN 20 dB

Image deblurring BSRN 12 dB BSRN 20 dB BSRN 32 dB

Image inpainting PSRN 18 dB PSRN 25 dB PSRN 37 dB

Uncertainty visualization • Inverse problems that are ill-conditioned or ill-posed typically have high levels of intrinsic uncertainty, which are not captured by point estimators. • As a way of visualizing this uncertainty, we compute an eigenvalue decomposition of the (latent) posterior covariance matrix to identify its two leading eigenvectors. • We then produce a grid of solutions across this two-dimensional subspace.

Visualizing uncertainty Truth Noisy isy Observation 𝜚(𝐹 𝑨 𝑧 )

Visualizing uncertainty Truth Blu lurr rred ed & Noisy isy Observation 𝜚(𝐹 𝑨 𝑧 )

Joint work with Matthew Holden Andrs Almansa Kostas Zygalakis - PowerPoint PPT Presentation

Joint work with Matthew Holden Andrs Almansa Kostas Zygalakis (Maxwell Institute) (CNRS, University of Paris) (Edinburgh University & Maxwell Institute) Outline Introduction Proposed method Experiments Forward problem

5.2 Joint Continuous Distributions Anna Karlin Most slides by Alex Tsun recap Joint PDFs

Koronivia Joint Work on Agriculture Koronivia joint work on agriculture COP23 November 2017

NEVADA JOINT MILITARY NEVADA JOINT MILITARY NEVADA JOINT MILITARY NEVADA JOINT MILITARY AFFAIRS

The Work Environment Act and The Work Environment Ordinance The Work Environment Act and The

Arranger, Organising Participant & Joint Lead Manager Joint Lead Manager Joint Lead Manager

Total Joint Replacement Information About OCOM Accredited by the Joint Commission OCOM has

Deadlock-Free Monitors Bart Jacobs (joint work with Jafar Hamin) DistriNet, KU Leuven IFIP WG

Equivalences of 5 -Dimensional CR Manifolds [Joint Work with Wei-Guo Foo and The-Anh Ta]

Symplectic Heegaard splittings and generalizations Joan Birman (on joint work with Dennis Johnson

Logic of Joint Action Natasha Alechina (joint work with Thomas Agotnes) November 2011, St

Optimal Joint Provision of Backhaul and Radio Access Networks Zhi-Quan (Tom) Luo Joint work with

Joint Management Response on the Joint Evaluation on Joint Programmes on Gender Equality in the

Revised Joint FDA Revised Joint FDA and EPA Advisory and EPA Advisory David W K A Acheson, FDA

The Joint School of Nanoscience and Nanoengineering James G. Ryan, Ph.D. Founding Dean Joint

Revised Joint FDA Revised Joint FDA and EPA Advisory and EPA Advisory David W K A Acheson, FDA

BUILDING NORTH QUEENSLANDS DEFENCE SUPPLY CHAIN JOINT LOGISTICS UNIT - NORTH QUEENSLAND 1

Recover Reco verin ing S Struct cture o e of N Nois isy D y Data thro throug ugh h

Statistics in LHC Phenomenology Tilman Plehn MPI f ur Physik & University of Edinburgh

Lecture 8 Hypothesis Testing I-Hsiang Wang Department of Electrical Engineering National Taiwan

Machine Learning Classification: Introduction Hamid R. Rabiee Jafar Muhammadi, Nima Pourdamghani

IceCube-DeepCore: Sensitivity study for the Southern Hemisphere. Claudine Colnard for the IceCube

On the foundations of non-equilibrium quantum statistical mechanics Vojkan Jaksic McGill

NUCLEAR AND SUBNUCLEAR PHYSICS [Module 3] Gabriele Sirri Istituto Nazionale di Fisica Nucleare

Model-independent spin and coupling determination of Higgs-like resonances Nhan Tran Johns

Joint work with Matthew Holden Andrs Almansa Kostas Zygalakis - PowerPoint PPT Presentation

Joint work with Matthew Holden Andrs Almansa Kostas Zygalakis (Maxwell Institute) (CNRS, University of Paris) (Edinburgh University & Maxwell Institute) Outline Introduction Proposed method Experiments Forward problem

5.2 Joint Continuous Distributions Anna Karlin Most slides by Alex Tsun recap Joint PDFs

Koronivia Joint Work on Agriculture Koronivia joint work on agriculture COP23 November 2017

NEVADA JOINT MILITARY NEVADA JOINT MILITARY NEVADA JOINT MILITARY NEVADA JOINT MILITARY AFFAIRS

The Work Environment Act and The Work Environment Ordinance The Work Environment Act and The

Arranger, Organising Participant &amp; Joint Lead Manager Joint Lead Manager Joint Lead Manager

Total Joint Replacement Information About OCOM Accredited by the Joint Commission OCOM has

Deadlock-Free Monitors Bart Jacobs (joint work with Jafar Hamin) DistriNet, KU Leuven IFIP WG

Equivalences of 5 -Dimensional CR Manifolds [Joint Work with Wei-Guo Foo and The-Anh Ta]

Symplectic Heegaard splittings and generalizations Joan Birman (on joint work with Dennis Johnson

Logic of Joint Action Natasha Alechina (joint work with Thomas Agotnes) November 2011, St

Optimal Joint Provision of Backhaul and Radio Access Networks Zhi-Quan (Tom) Luo Joint work with

Joint Management Response on the Joint Evaluation on Joint Programmes on Gender Equality in the

Revised Joint FDA Revised Joint FDA and EPA Advisory and EPA Advisory David W K A Acheson, FDA

The Joint School of Nanoscience and Nanoengineering James G. Ryan, Ph.D. Founding Dean Joint

Revised Joint FDA Revised Joint FDA and EPA Advisory and EPA Advisory David W K A Acheson, FDA

BUILDING NORTH QUEENSLANDS DEFENCE SUPPLY CHAIN JOINT LOGISTICS UNIT - NORTH QUEENSLAND 1

Recover Reco verin ing S Struct cture o e of N Nois isy D y Data thro throug ugh h

Statistics in LHC Phenomenology Tilman Plehn MPI f ur Physik &amp; University of Edinburgh

Lecture 8 Hypothesis Testing I-Hsiang Wang Department of Electrical Engineering National Taiwan

Machine Learning Classification: Introduction Hamid R. Rabiee Jafar Muhammadi, Nima Pourdamghani

IceCube-DeepCore: Sensitivity study for the Southern Hemisphere. Claudine Colnard for the IceCube

On the foundations of non-equilibrium quantum statistical mechanics Vojkan Jaksic McGill

NUCLEAR AND SUBNUCLEAR PHYSICS [Module 3] Gabriele Sirri Istituto Nazionale di Fisica Nucleare

Model-independent spin and coupling determination of Higgs-like resonances Nhan Tran Johns

Arranger, Organising Participant & Joint Lead Manager Joint Lead Manager Joint Lead Manager

Statistics in LHC Phenomenology Tilman Plehn MPI f ur Physik & University of Edinburgh