Bayesian inference and mathematical imaging. Part III: probability - PowerPoint PPT Presentation

Bayesian inference and mathematical imaging. Part III: probability & convex optimisation. Dr. Marcelo Pereyra http://www.macs.hw.ac.uk/ ∼ mp71/ Maxwell Institute for Mathematical Sciences, Heriot-Watt University January 2019, CIRM, Marseille. M. Pereyra (MI — HWU) Bayesian mathematical imaging 0 / 53

Outline Bayesian inference in imaging inverse problems 1 MAP estimation with Bayesian confidence regions 2 Posterior credible regions Uncertainty visualisation Hypothesis testing A decision-theoretic derivation of MAP estimation 3 4 Hierarchical MAP estimation with unknown regularisation parameters Conclusion 5 M. Pereyra (MI — HWU) Bayesian mathematical imaging 1 / 53

Imaging inverse problems We are interested in an unknown image x ∈ R d . We measure y , related to x by a statistical model p ( y ∣ x ) . The recovery of x from y is ill-posed or ill-conditioned, resulting in significant uncertainty about x . For example, in many imaging problems y = Ax + w , for some operator A that is rank-deficient, and additive noise w . M. Pereyra (MI — HWU) Bayesian mathematical imaging 2 / 53

The Bayesian framework We use priors to reduce uncertainty and deliver accurate results. Given the prior p ( x ) , the posterior distribution of x given y p ( x ∣ y ) = p ( y ∣ x ) p ( x )/ p ( y ) models our knowledge about x after observing y . In this talk we consider that p ( x ∣ y ) is log-concave; i.e., p ( x ∣ y ) = exp {− φ ( x )}/ Z , where φ ( x ) is a convex function and Z = ∫ exp {− φ ( x )} d x . M. Pereyra (MI — HWU) Bayesian mathematical imaging 3 / 53

Maximum-a-posteriori (MAP) estimation The predominant Bayesian approach in imaging is MAP estimation x MAP = argmax p ( x ∣ y ) , ˆ x ∈ R d (1) = argmin φ ( x ) , x ∈ R d computed efficiently, even in very high dimensions, by (proximal) convex optimisation (Green et al., 2015; Chambolle and Pock, 2016). However, MAP estimation has some limitations, e.g., 1 it provides little information about p ( x ∣ y ) , 2 it is not theoretically well understood (yet), 3 it struggles with unknown/partially unknown models. M. Pereyra (MI — HWU) Bayesian mathematical imaging 4 / 53

Illustrative example: astronomical image reconstruction Recover x ∈ R d from low-dimensional degraded observation y = M F x + w , where F is the continuous Fourier transform, M ∈ C m × d is a measurement mask operator, and w is Gaussian noise. We use the model p ( x ∣ y ) ∝ exp (−∥ y − M F x ∥ 2 / 2 σ 2 − θ ∥ Ψ x ∥ 1 ) 1 R n + ( x ) . (2) ˆ x MAP ∣ y ∣ Figure : Radio-interferometric image reconstruction of the W28 supernova . M. Pereyra (MI — HWU) Bayesian mathematical imaging 5 / 53

Posterior credible regions Where does the posterior probability mass of x lie? Recall that C α is a ( 1 − α ) % posterior credible region if P [ x ∈ C α ∣ y ] = 1 − α, and the decision-theoretically optimum is the HPD region (Robert, 2001) C ∗ α = { x ∶ φ ( x ) ≤ γ α } , with γ α ∈ R chosen such that ∫ C ∗ α p ( x ∣ y ) d x = 1 − α holds. We could estimate C ∗ α by MCMC sampling, but in high-dimensional log-concave models this is not necessary because something beautiful happens... M. Pereyra (MI — HWU) Bayesian mathematical imaging 8 / 53

A concentration phenomenon! Figure : Convergence to “typical” set { x ∶ log p ( x ∣ y ) ≈ E [ log p ( x ∣ y )]} . M. Pereyra (MI — HWU) Bayesian mathematical imaging 9 / 53

Proposed approximation of C ∗ α Theorem 2.1 (Pereyra (2016)) Suppose that the posterior p ( x ∣ y ) = exp {− φ ( x )}/ Z is log-concave on R d . Then, for any α ∈ ( 4exp (− n / 3 ) , 1 ) , the HPD region C ∗ α is contained by √ C α = { x ∶ φ ( x ) ≤ φ ( ˆ ˜ x MAP ) + d τ α + d )} , √ with universal positive constant τ α = 16log ( 3 / α ) independent of p ( x ∣ y ) . Remark 1 : ˜ C α is a conservative approximation of C ∗ α , i.e., x ∉ ˜ C α � ⇒ x ∉ C ∗ α . Remark 2 : ˜ C α is available as a by-product in any convex inference problem that is solved by MAP estimation! M. Pereyra (MI — HWU) Bayesian mathematical imaging 10 / 53

Approximation error bounds Is ˜ C α a reliable approximation of C ∗ α ? Theorem 2.2 (Finite-dimensional error bound (Pereyra, 2016)) √ γ α = φ ( ˆ x MAP ) + d τ α + d . If p ( x ∣ y ) is log-concave on R d , then Let ˜ γ α − γ α 0 ≤ ˜ ≤ 1 + η α d − 1 / 2 , d √ √ with universal positive constant η α = 16log ( 3 / α ) + 1 / α . C α is stable (as d becomes large, the error ( ˜ γ α − γ α )/ d ⪅ 1). Remark 3: ˜ Remark 4: The lower and upper bounds are asymptotically tight w.r.t. d . M. Pereyra (MI — HWU) Bayesian mathematical imaging 11 / 53

Example: Uncertainty visualisation in astro-imaging Radio-interferometry with redundant wavelet frame (Cai et al., 2017). dirty image ˆ x penML ( y ) ˆ x MAP approx. credible intervals (scale 10 × 10 pixels) “exact” intervals (MCMC, minutes) M31 radio galaxy (size 256 × 256 pixels, comp. time 1 . 8 secs.) M. Pereyra (MI — HWU) Bayesian mathematical imaging 13 / 53

Example: Uncertainty visualisation in astro-imaging Radio-interferometry with redundant wavelet frame (Cai et al., 2017). dirty image ˆ x penML ( y ) ˆ x MAP approx. credible intervals (scale 10 × 10 pixels) “exact” intervals (MCMC, minutes) 3C2888 radio galaxy (size 256 × 256 pixels, comp. time 1 . 8 secs.) M. Pereyra (MI — HWU) Bayesian mathematical imaging 14 / 53

Example: uncertainty visualisation in microscopy Live cell microscopy sparse super-resolution (Zhu et al., 2012): y = Ax + w x MAP = argmin x ∈ R d ∥ y − Hx ∥ 2 / 2 σ 2 + λ ∥ x ∥ 1 ˆ ( A is a blur operator) (log-scale) Consider the molecular structure in the highlighted region: Are we confident about this structure (its presence, position, etc.)? Idea : use ˜ C α to explore/quantify the uncertainty about this structure. M. Pereyra (MI — HWU) Bayesian mathematical imaging 15 / 53

Example: uncertainty visualisation in microscopy Position uncertainty quantification Find maximum molecule displacement within ˜ C 0 . 05 : Molecule position uncertainty x MAP = argmin x ∈ R d ∥ y − Ax ∥ 2 / 2 σ 2 + λ ∥ x ∥ 1 ˆ ( ± 93 nm × ± 140 nm ) Note: Uncertainty analysis ( ± 93 nm × ± 140 nm ) in agreement with MCMC estimates ( ± 78 nm × ± 125 nm - approx. error of order of 1 pixel), and with experimental results (average precision ± 80 nm ) of Zhu et al. (2012). M. Pereyra (MI — HWU) Bayesian mathematical imaging 16 / 53

Hypothesis testing Bayesian hypothesis test for specific image structures (e.g., lesions) H 0 ∶ The structure of interest is ABSENT in the true image H 1 ∶ The structure of interest is PRESENT in the true image The null hypothesis H 0 is rejected with significance α if P ( H 0 ∣ y ) ≤ α. Theorem (Repetti et al., 2018) Let S denote the region of R d associated with H 0 , containing all images without the structure of interest. Then S ∩ ̃ C α = ∅ � ⇒ P ( H 0 ∣ y ) ≤ α . If in addition S is convex, then checking S ∩ ̃ C α = ∅ is a convex problem x ∈ ̃ x , x ∈ R d ∥ ¯ x − x ∥ 2 x ∈ S . min s.t. ¯ C α , 2 ¯ M. Pereyra (MI — HWU) Bayesian mathematical imaging 18 / 53

Uncertainty quantification in MRI imaging x ∈ ̃ x ∈ S x MAP ˆ ¯ C 0 . 01 x ∈ ̃ ˆ x MAP (zoom) ¯ C 0 . 01 (zoom) x ∈ S (zoom) x = x, hence we fail to reject H 0 and conclude that MRI experiment: test images ¯ there is little evidence to support the observed structure. M. Pereyra (MI — HWU) Bayesian mathematical imaging 19 / 53

Uncertainty quantification in MRI imaging ˆ ¯ x ∈ C 0 . 01 x ∈ S 0 x MAP ˆ x MAP (zoom) ¯ x ∈ C 0 . 01 (zoom) x ∈ S 0 (zoom) x ≠ x, hence we reject H 0 and conclude that there is MRI experiment: test images ¯ significant evidence in favour of the observed structure. M. Pereyra (MI — HWU) Bayesian mathematical imaging 20 / 53

Bayesian inference and mathematical imaging. Part III: probability - PowerPoint PPT Presentation

Bayesian inference and mathematical imaging. Part III: probability & convex optimisation. Dr. Marcelo Pereyra http://www.macs.hw.ac.uk/ mp71/ Maxwell Institute for Mathematical Sciences, Heriot-Watt University January 2019, CIRM,

Nuclear Imaging Medical Imaging Medical Imaging Nuclear Imaging Nuclear Imaging Nuclear

Bayesian inference and mathematical imaging. Part I: Bayesian analysis and decision theory. Dr.

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian inference and mathematical imaging. Part IV: mixture, random fields, and hierarchical

Bayesian inference and mathematical imaging. Part II: Markov chain Monte Carlo. Dr. Marcelo

Basics of Bayesian Inference A frequentist thinks of unknown parameters as fixed Basics of

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Approximate Bayesian inference for latent Gaussian models avard Rue 1 H Department of

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Meta-Bayesian Analysis A Bayesian decision-theoretic analysis of Bayesian inference under model

Analytics, Inference and Computation in Cosmology: Exercises on Bayesian Inference Roberto

EST5104 Bayesian Inference EST5803 Advanced Bayesian Inference Ricardo Ehlers ehlers@icmc.usp.br

Machine Learning: Foundations Lecturer: Yishay Mansour Lecture 2 Bayesian Inference Kfir Bar

CS 730/730W/830: Intro AI Bayesian Networks Approx. Inference Exact Inference 1 handout: slides

CS 730/830: Intro AI Bayesian Networks Approx. Inference Exact Inference Wheeler Ruml (UNH)

M. Kardouchi & E. Hervet Universit de Moncton Moncton, NB, Canada 30-mai-03 1

Cluster Architectures Overview Cluster Computing The Problem The Solution The Anatomy

Theorems, Algorithms and Brute Force: Building a Census of 3-Manifolds Benjamin Burton School of

Custom Debian distributions and Debian derivatives Aigars Mahinovs Debconf 5 Debian Day

Multi-Site Vs. Domain A Commerce Case Study May 7, 2019 Page 1 | Multi-Site Vs Domain: A

goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, Joo Paulo Aires , PUCRS

RealCertify: a Maple package for certifying non-negativity Victor Magron , CNRS Joint work with

But the relation sc hemas m ust b e the same. 2. : Pic king certain ro ws

Bayesian inference and mathematical imaging. Part III: probability - PowerPoint PPT Presentation

Bayesian inference and mathematical imaging. Part III: probability & convex optimisation. Dr. Marcelo Pereyra http://www.macs.hw.ac.uk/ mp71/ Maxwell Institute for Mathematical Sciences, Heriot-Watt University January 2019, CIRM,

Nuclear Imaging Medical Imaging Medical Imaging Nuclear Imaging Nuclear Imaging Nuclear

Bayesian inference and mathematical imaging. Part I: Bayesian analysis and decision theory. Dr.

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian inference and mathematical imaging. Part IV: mixture, random fields, and hierarchical

Bayesian inference and mathematical imaging. Part II: Markov chain Monte Carlo. Dr. Marcelo

Basics of Bayesian Inference A frequentist thinks of unknown parameters as fixed Basics of

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Approximate Bayesian inference for latent Gaussian models avard Rue 1 H Department of

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Meta-Bayesian Analysis A Bayesian decision-theoretic analysis of Bayesian inference under model

Analytics, Inference and Computation in Cosmology: Exercises on Bayesian Inference Roberto

EST5104 Bayesian Inference EST5803 Advanced Bayesian Inference Ricardo Ehlers ehlers@icmc.usp.br

Machine Learning: Foundations Lecturer: Yishay Mansour Lecture 2 Bayesian Inference Kfir Bar

CS 730/730W/830: Intro AI Bayesian Networks Approx. Inference Exact Inference 1 handout: slides

CS 730/830: Intro AI Bayesian Networks Approx. Inference Exact Inference Wheeler Ruml (UNH)

M. Kardouchi &amp; E. Hervet Universit de Moncton Moncton, NB, Canada 30-mai-03 1

Cluster Architectures Overview Cluster Computing The Problem The Solution The Anatomy

Theorems, Algorithms and Brute Force: Building a Census of 3-Manifolds Benjamin Burton School of

Custom Debian distributions and Debian derivatives Aigars Mahinovs Debconf 5 Debian Day

Multi-Site Vs. Domain A Commerce Case Study May 7, 2019 Page 1 | Multi-Site Vs Domain: A

goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, Joo Paulo Aires , PUCRS

RealCertify: a Maple package for certifying non-negativity Victor Magron , CNRS Joint work with

But the relation sc hemas m ust b e the same. 2. : Pic king certain ro ws

M. Kardouchi & E. Hervet Universit de Moncton Moncton, NB, Canada 30-mai-03 1