Plug-and-Play Methods Provably Converge with Properly Trained - PowerPoint PPT Presentation

Plug-and-Play Methods Provably Converge with Properly Trained Denoisers Ernest K. Ryu 1 Sicheng Wang 2 Jialin Liu 1 Xiaohan Chen 2 Zhangyang Wang 2 Wotao Yin 1 2019 International Conference on Machine Learning 1 UCLA Mathematics 2 Texas A&M Computer Science and Engineering

Image processing via optimization Consider recovering or denoising an image through the optimization minimize f ( x ) + γg ( x ) , x ∈ R d ◮ x is image ◮ f ( x ) is data fidelity (a posteriori knowledge) ◮ g ( x ) is noisiness of the image (a priori knowledge) ◮ γ ≥ 0 is relative importance between f and g 2

Image processing via ADMM We often use first-order methods, such as ADMM x k +1 = argmin σ 2 g ( x ) + (1 / 2) � x − ( y k − u k ) � 2 � � x ∈ R d y k +1 = argmin αf ( y ) + (1 / 2) � y − ( x k +1 + u k ) � 2 � � y ∈ R d u k +1 = u k + x k +1 − y k +1 with σ 2 = αγ . 3

Image processing via ADMM More concise notation x k +1 = Prox σ 2 g ( y k − u k ) y k +1 = Prox αf ( x k +1 + u k ) u k +1 = u k + x k +1 − y k +1 . The proximal operator of h is � αh ( x ) + (1 / 2) � x − z � 2 � Prox αh ( z ) = argmin . x ∈ R d (Well-defined if h is proper, closed, and convex.) 4

Interpretations of ADMM subroutines The subroutine Prox σ 2 g : R d → R d is a denoiser, i.e., Prox σ 2 g : noisy image �→ less noisy image Prox αf : R d → R d enforces consistency with measured data, i.e., Prox αf : less consistent �→ more consistent with data 5

Other denoisers However, some state-of-the-art image denoisers do not originate from optimization problems. (E.g. NLM, BM3D, and CNN.) Nevertheless, such a denoiser H σ : R d → R d still has the interpretation H σ : noisy image �→ less noisy image where σ ≥ 0 is a noise parameter. It is possible to integrate such denoisers with existing algorithms such as ADMM or proximal gradient? 6

Plug and play! To address this question, Venkatakrishnan et al. 3 proposed Plug-and-Play ADMM (PnP-ADMM), which simply replaces the proximal operator Prox σ 2 g with the denoiser H σ : x k +1 = H σ ( y k − u k ) y k +1 = Prox αf ( x k +1 + u k ) u k +1 = u k + x k +1 − y k +1 . Surprisingly and remarkably, this ad-hoc method exhibited great empirical success, and spurred much follow-up work. 3 Venkatakrishnan, Bouman, and Wohlberg, Plug-and-play priors for model based reconstruction, IEEE GlobalSIP, 2013. 7

Plug and play! By integrating modern denoising priors into ADMM or other proximal algorithms, PnP combines the advantages of data-driven operators and classic optimization. In image denoising, PnP replaces total variation regularization with an explicit denoiser such as BM3D or deep learning-based denoisers. PnP is suitable when end-to-end training is impossible (e.g. due to insufficient data or time). 8

Example: Poisson denoising Corrupted image Other method PnP-ADMM with BM3D Rond, Giryes, and Elad, J. Vis. Commun. Image R. 2016.

Example: Inpainting Original image 5% random sampling Sreehari et al., IEEE Trans. Comput. Imag., 2016.

Example: Inpainting Other method PnP-ADMM with NLM Sreehari et al., IEEE Trans. Comput. Imag., 2016.

Example: Super resolution Low resolution input Other method Other method Other method Other method Other method PnP-ADMM with BM3D Chan, Wang, Elgendy, IEEE Trans. Comput. Imag., 2017.

Example: Single photon imaging Corrupted image other method other method PnP-ADMM with BM3D Chan, Wang, Elgendy, IEEE Trans. Comput. Imag., 2017.

Contribution of this work The empirical success of Plug-and-Play (PnP) naturally leads us to ask theoretical questions: When does PnP converge and what denoisers can we use? ◮ We prove convergence of PnP methods under a certain Lipschitz condition. ◮ We propose real spectral normalization, a technique for constraining deep learning-based denoisers in their training to enforce the proposed Lipschitz condition. ◮ We present experimental results validating our theory. 4 4 Code available at: https://github.com/uclaopt/Provable_Plug_and_Play/ 9

Outline PNP-FBS/ADMM and their fixed points Convergence via contraction Real spectral normalization: Enforcing Assumption (A) Experimental validation PNP-FBS/ADMM and their fixed points 10

PnP FBS Plug-and-play forward-backward splitting: x k +1 = H σ ( I − α ∇ f )( x k ) (PNP-FBS) where α > 0 . PNP-FBS/ADMM and their fixed points 11

PnP FBS PNP-FBS is a fixed-point iteration, and x ⋆ is a fixed point if x ⋆ = H σ ( I − α ∇ f )( x ⋆ ) . Interpretation of fixed points: A compromise between making the image agree with measurements and making the image less noisy. PNP-FBS/ADMM and their fixed points 12

PnP ADMM Plug-and-play alternating directions method of multipliers: x k +1 = H σ ( y k − u k ) y k +1 = Prox αf ( x k +1 + u k ) (PNP-ADMM) u k +1 = u k + x k +1 − y k +1 where α > 0 . PNP-FBS/ADMM and their fixed points 13

PnP ADMM PNP-ADMM is a fixed-point iteration, and ( x ⋆ , u ⋆ ) is a fixed point if x ⋆ = H σ ( x ⋆ − u ⋆ ) x ⋆ = Prox αf ( x ⋆ + u ⋆ ) . PNP-FBS/ADMM and their fixed points 14

PnP DRS Plug-and-play Douglas–Rachford splitting: x k +1 / 2 = Prox αf ( z k ) x k +1 = H σ (2 x k +1 / 2 − z k ) (PNP-DRS) z k +1 = z k + x k +1 − x k +1 / 2 where α > 0 . We can write PNP-DRS as z k +1 = T ( z k ) with T = 1 2 I + 1 2(2 H σ − I )(2Prox αf − I ) . PNP-ADMM and PNP-DRS are equivalent. We analyze convergence of PNP-DRS and translate the result to PNP-ADMM. PNP-FBS/ADMM and their fixed points 15

PnP DRS PNP-DRS is a fixed-point iteration, and z ⋆ is a fixed point if x ⋆ = Prox αf ( z ⋆ ) x ⋆ = H σ (2 x ⋆ − z ⋆ ) . PNP-FBS/ADMM and their fixed points 16

Outline PNP-FBS/ADMM and their fixed points Convergence via contraction Real spectral normalization: Enforcing Assumption (A) Experimental validation Convergence via contraction 17

What we do not assume If we assume 2 H σ − I is nonexpansive, standard tools of monotone operator theory tell us that PnP-ADMM converges. However, this assumption is unrealistic 5 so we do not assume it. We do not assume H σ is continuously differentiable. 5 Chan, Wang, and Elgendy, Plug-and-Play ADMM for Image Restoration: Fixed-Point Convergence and Applications, IEEE TCI, 2017. Convergence via contraction 18

Main assumption Rather, we assume H σ : R d → R d satisfies � ( H σ − I )( x ) − ( H σ − I )( y ) � ≤ ε � x − y � (A) for all x, y ∈ R d for some ε ≥ 0 . Since σ controls the strength of the denoising, we can expect H σ to be close to identity for small σ . If so , Assumption (A) is reasonable. Convergence via contraction 19

Contractive operators Under (A), we show PNP-FBS and PNP-DRS are contractive iterations in the sense that we can express the iterations as x k +1 = T ( x k ) , where T : R d → R d satisfies � T ( x ) − T ( y ) � ≤ δ � x − y � for all x, y ∈ R d for some δ < 1 . If x ⋆ satisfies T ( x ⋆ ) = x ⋆ , i.e., x ⋆ is a fixed point, then x k → x ⋆ geometrically by the classical Banach contraction principle. Convergence via contraction 20

Convergence of PNP-FBS Theorem Assume H σ satisfies assumption (A) for some ε ≥ 0 . Assume f is µ -strongly convex, f is differentiable, and ∇ f is L -Lipschitz. Then T = H σ ( I − α ∇ f ) satisfies � T ( x ) − T ( y ) � ≤ max {| 1 − αµ | , | 1 − αL |} (1 + ε ) � x − y � for all x, y ∈ R d . The coefficient is less than 1 if µ (1 + 1 /ε ) < α < 2 1 1 L − L (1 + 1 /ε ) . Such an α exists if ε < 2 µ/ ( L − µ ) . Convergence via contraction 21

Convergence of PNP-DRS Theorem Assume H σ satisfies assumption (A) for some ε ≥ 0 . Assume f is µ -strongly convex and differentiable. Then T = 1 2 I + 1 2(2 H σ − I )(2Prox αf − I ) satisfies � T ( x ) − T ( y ) � ≤ 1 + ε + εαµ + 2 ε 2 αµ � x − y � 1 + αµ + 2 εαµ for all x, y ∈ R d . The coefficient is less than 1 if ε (1 + ε − 2 ε 2 ) µ < α, ε < 1 . Convergence via contraction 22

Convergence of PNP-ADMM Corollary Assume H σ satisfies assumption (A) for some ε ∈ [0 , 1) . Assume f is µ -strongly convex. Then PNP-ADMM converges for ε (1 + ε − 2 ε 2 ) µ < α. Convergence via contraction 23

PnP-FBS vs. PnP-ADMM PNP-FBS and PNP-ADMM share the same fixed points 6 7 . They are distinct methods for finding the same set of fixed points. PNP-FBS is easier to implement as it requires ∇ f rather than Prox αf . PNP-ADMM has better convergence properties as demonstrated by Theorems 1 and 2 and our experiments. 6 Meinhardt, Moeller, Hazirbas, and Cremers, Learning proximal operators: Using denoising networks for regularizing inverse imaging problems. ICCV, 2017. 7 Sun, Wohlberg, and Kamilov, An online plug-and-play algorithm for regularized image reconstruction. IEEE TCI, 2019. Convergence via contraction 24

Plug-and-Play Methods Provably Converge with Properly Trained - PowerPoint PPT Presentation

Plug-and-Play Methods Provably Converge with Properly Trained Denoisers Ernest K. Ryu 1 Sicheng Wang 2 Jialin Liu 1 Xiaohan Chen 2 Zhangyang Wang 2 Wotao Yin 1 2019 International Conference on Machine Learning 1 UCLA Mathematics 2 Texas A&M

Introducing OSGi Eclipse Plug-ins 1 Plug-in State Information Plug-in Structure

WALL PLUG PLUG-IN SWITCH & POWER METERING MANY FEATURES IN A SINGLE DEVICE The FIBARO Wall

PDR900 Controller PDR900 Controller Plug and play readout for Plug and play readout for 900

Plug-In Folly Part 4 by Pat Murphy, Plan Curtail PART 4A: The Plug-In Hybrid Car Inventing the

Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the

Provably secure hash functions - do we care? Krystian Matusiewicz Technical University of Denmark

Second Generation Model-based Testing Provably Strong Testing Methods for the Certification of

PLUG PLUG Presentation Layer Universal Generator Presentation Layer Universal Generator

Plug-In Folly Part 5 by Pat Murphy, Plan Curtail Part 5A: Conclusion Plug-In Vehicles the

Universal Plug and Play Eventing Vulnerabilities Joeri Blokhuis February 4, 2009 Joeri Blokhuis

FIBARO Wall Plug The smallest in the world Home intelligence Most sophisticated device FIBARO

How to Properly Maintain How to Properly Maintain Your Water Well and Avoid Your Water Well and

How to Properly Prepare a Certificate of 09/19/2018 Indepndent Review MPBF) How to Properly

Converge Cornerstone Fund The Impact of COVID-19 and Ministry Lending S T A R T . S T R E N G T

Welcome to the CONVERGE Virtual Forum COVID-19 Working Groups for Public Health and Social

A Set that is Streamless and Not Provably Noetherian Marc Bezem Department of Informatics

Lecture 7.6: Rings of fractions Matthew Macauley Department of Mathematical Sciences Clemson

A Network Coding Approach A Network Coding Approach to IP Traceback Pegah Sattari, Minas Gjokas,

Deep Approximation via Deep Learning Zuowei Shen Department of Mathematics National University

CS 6316 Machine Learning Dimensionality Reduction Yangfeng Ji Department of Computer Science

On the Quantitative Hardness of CVP Huck Bennett, Alexander Golovnev, Noah Stephens-Davidowitz

PROPOSED CAPITAL IMPROVEMENT PROGRAM FY2020 FY2024 CAPITAL IMPROVEMENT PROGRAM (CIP)

The Slide Master and Sections for Organization: Inserting, Deleting, and Moving Around Slides and

EDUCATION 1 50-930C, 5/20/0 How To Use GoToWebinar 2 employer_education@strsoh.org 1 STATE

Plug-and-Play Methods Provably Converge with Properly Trained - PowerPoint PPT Presentation

Plug-and-Play Methods Provably Converge with Properly Trained Denoisers Ernest K. Ryu 1 Sicheng Wang 2 Jialin Liu 1 Xiaohan Chen 2 Zhangyang Wang 2 Wotao Yin 1 2019 International Conference on Machine Learning 1 UCLA Mathematics 2 Texas A&M

Introducing OSGi Eclipse Plug-ins 1 Plug-in State Information Plug-in Structure

WALL PLUG PLUG-IN SWITCH &amp; POWER METERING MANY FEATURES IN A SINGLE DEVICE The FIBARO Wall

PDR900 Controller PDR900 Controller Plug and play readout for Plug and play readout for 900

Plug-In Folly Part 4 by Pat Murphy, Plan Curtail PART 4A: The Plug-In Hybrid Car Inventing the

Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the

Provably secure hash functions - do we care? Krystian Matusiewicz Technical University of Denmark

Second Generation Model-based Testing Provably Strong Testing Methods for the Certification of

PLUG PLUG Presentation Layer Universal Generator Presentation Layer Universal Generator

Plug-In Folly Part 5 by Pat Murphy, Plan Curtail Part 5A: Conclusion Plug-In Vehicles the

Universal Plug and Play Eventing Vulnerabilities Joeri Blokhuis February 4, 2009 Joeri Blokhuis

FIBARO Wall Plug The smallest in the world Home intelligence Most sophisticated device FIBARO

How to Properly Maintain How to Properly Maintain Your Water Well and Avoid Your Water Well and

How to Properly Prepare a Certificate of 09/19/2018 Indepndent Review MPBF) How to Properly

Converge Cornerstone Fund The Impact of COVID-19 and Ministry Lending S T A R T . S T R E N G T

Welcome to the CONVERGE Virtual Forum COVID-19 Working Groups for Public Health and Social

A Set that is Streamless and Not Provably Noetherian Marc Bezem Department of Informatics

Lecture 7.6: Rings of fractions Matthew Macauley Department of Mathematical Sciences Clemson

A Network Coding Approach A Network Coding Approach to IP Traceback Pegah Sattari, Minas Gjokas,

Deep Approximation via Deep Learning Zuowei Shen Department of Mathematics National University

CS 6316 Machine Learning Dimensionality Reduction Yangfeng Ji Department of Computer Science

On the Quantitative Hardness of CVP Huck Bennett, Alexander Golovnev, Noah Stephens-Davidowitz

PROPOSED CAPITAL IMPROVEMENT PROGRAM FY2020 FY2024 CAPITAL IMPROVEMENT PROGRAM (CIP)

The Slide Master and Sections for Organization: Inserting, Deleting, and Moving Around Slides and

EDUCATION 1 50-930C, 5/20/0 How To Use GoToWebinar 2 employer_education@strsoh.org 1 STATE

WALL PLUG PLUG-IN SWITCH & POWER METERING MANY FEATURES IN A SINGLE DEVICE The FIBARO Wall