Some applications of proximal methods Caroline CHAUX Joint work - PowerPoint PPT Presentation

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Some applications of proximal methods Caroline CHAUX Joint work with P. L. Combettes, L. Duval, J.-C. Pesquet and N. Pustelnik LATP - UMR CNRS 7353, Aix-Marseille Universit´ e, France OSL 2013, Les Houches, 7-11 Jan. 2013 1 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Direct problem = 2 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Direct problem z = • z : observations (e.g. 2D signal of size M = M 1 × M 2 ) 2 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Direct problem z = y • z : observations (e.g. 2D signal of size M = M 1 × M 2 ) • y : original signal (unknown of size N ) 2 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Direct problem z = Ly • z : observations (e.g. 2D signal of size M = M 1 × M 2 ) • y : original signal (unknown of size N ) • L : linear operator (matrix of size M × N ) 2 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Direct problem z = D α ( Ly ) • z : observations (e.g. 2D signal of size M = M 1 × M 2 ) • y : original signal (unknown of size N ) • L : linear operator (matrix of size M × N ) • D α : perturbation of parameter α 2 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Direct problem z = D α ( Ly ) • z : observations (e.g. 2D signal of size M = M 1 × M 2 ) • y : original signal (unknown of size N ) • L : linear operator (matrix of size M × N ) • D α : perturbation of parameter α Objective: inverse problem Find an estimation ˆ y of y from observations z . 2 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION F RAME REPRESENTATION F ∗ Frame coefficients ( x ) Original ( y ) ◮ x ∈ R K : Frame coefficients of original image y ∈ R N ◮ F ∗ : R K → R N : Frame synthesis operator such that ∃ ( ν, ν ) ∈ ] 0 , + ∞ [ 2 , ν Id ≤ F ∗ ◦ F ≤ ν Id (tight frame when ν = ν = ν ) y = F ∗ x ¯ [L. Jacques et al., 2011] 3 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. ◮ f j can be related to noise (e.g. a quadratic term when the noise is Gaussian) 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. ◮ f j can be related to noise (e.g. a quadratic term when the noise is Gaussian) ◮ f j can be related to some a priori on the target solution (e.g. an a priori on the wavelet coefficient distribution) 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. ◮ f j can be related to noise (e.g. a quadratic term when the noise is Gaussian) ◮ f j can be related to some a priori on the target solution (e.g. an a priori on the wavelet coefficient distribution) ◮ f j can be related to a constraint (e.g. a support constraint) 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. ◮ f j can be related to noise (e.g. a quadratic term when the noise is Gaussian) ◮ f j can be related to some a priori on the target solution (e.g. an a priori on the wavelet coefficient distribution) ◮ f j can be related to a constraint (e.g. a support constraint) ◮ L j can model a blur operator. 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. ◮ f j can be related to noise (e.g. a quadratic term when the noise is Gaussian) ◮ f j can be related to some a priori on the target solution (e.g. an a priori on the wavelet coefficient distribution) ◮ f j can be related to a constraint (e.g. a support constraint) ◮ L j can model a blur operator. ◮ L j can model a gradient operator (e.g. total variation). 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION V ARIATIONAL APPROACH J � minimize x ∈H f j ( L j x ) j = 1 where ( f j ) 1 ≤ j ≤ J : functions in the class Γ 0 ( G j ) (class of l.s.c. proper convex functions on G j taking their values in ] − ∞ , + ∞ ] ) and where, for every j ∈ { 1 , . . . , J } , L j : H → G j is a bounded linear operator (where ( G j ) 1 ≤ j ≤ J denote Hilbert spaces). This criterion can be non differentiable. ◮ f j can be related to noise (e.g. a quadratic term when the noise is Gaussian) ◮ f j can be related to some a priori on the target solution (e.g. an a priori on the wavelet coefficient distribution) ◮ f j can be related to a constraint (e.g. a support constraint) ◮ L j can model a blur operator. ◮ L j can model a gradient operator (e.g. total variation). ◮ L j can model a frame operator. 4 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION A NALYSIS APPROACH VS . SYNTHESIS When frame decompositions are considered, the problem can be formulated under a: 5 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION A NALYSIS APPROACH VS . SYNTHESIS When frame decompositions are considered, the problem can be formulated under a: Synthesis Form (SF): R S � � f r ( L r F ∗ x ) + minimize g s ( x ) x ∈ R K r = 1 s = 1 5 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION A NALYSIS APPROACH VS . SYNTHESIS When frame decompositions are considered, the problem can be formulated under a: Synthesis Form (SF): R S � � f r ( L r F ∗ x ) + minimize g s ( x ) x ∈ R K r = 1 s = 1 Analysis Form (AF): R S � � minimize f r ( L r y ) + g s ( Fy ) . y ∈ R N r = 1 s = 1 5 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION A NALYSIS APPROACH VS . SYNTHESIS When frame decompositions are considered, the problem can be formulated under a: Synthesis Form (SF): R S � � f r ( L r F ∗ x ) + minimize g s ( x ) x ∈ R K r = 1 s = 1 Analysis Form (AF): R S � � minimize f r ( L r y ) + g s ( Fy ) . y ∈ R N r = 1 s = 1 Inclusion AF is a particular case of SF [Chaˆ ari et al., 2009] . Equivalence Equivalence when F is an orthonormal transform. 5 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION P ROXIMAL APPROACHES The proximity operator of φ ∈ Γ 0 ( H ) is defined as 1 2 � v − u � 2 + φ ( v ) . prox φ : H → H : u �→ arg min v ∈H 6 / 34

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION P ROXIMAL APPROACHES The proximity operator of φ ∈ Γ 0 ( H ) is defined as 1 2 � v − u � 2 + φ ( v ) . prox φ : H → H : u �→ arg min v ∈H Remark : if C is a nonempty closed convex set of H , and ι C denotes the indicator function of C , i.e., ( ∀ u ∈ H ) ι C ( u ) = 0 if u ∈ C , + ∞ otherwise, then, prox ι C reduces to the projection Π C onto C . 6 / 34

Some applications of proximal methods Caroline CHAUX Joint work - PowerPoint PPT Presentation

G ENERAL CONTEXT P ROXIMAL TOOLS A PPLICATIONS C ONCLUSION Some applications of proximal methods Caroline CHAUX Joint work with P. L. Combettes, L. Duval, J.-C. Pesquet and N. Pustelnik LATP - UMR CNRS 7353, Aix-Marseille Universit e, France

Convergence of perturbed Proximal Gradient algorithms Gersende Fort Institut de Math ematiques

Asymmetric Proximal Point Algorithms with Moving Proximal Centers Deren Han

Stochastic Perturbations of Proximal-Gradient methods for nonsmooth convex optimization: the

Lecture: Fast Proximal Gradient Methods http://bicmr.pku.edu.cn/~wenzw/opt-2018-fall.html

On the Equivalence of Inexact Proximal ALM and ADMM for a Class of Convex Composite Programming

Convex Optimization ( EE227A: UC Berkeley ) Lecture 18 (Proximal methods; Incremental methods

PT Considerations for the Nonoperatively Treated Proximal Humerus Fractures John Cavanaugh PT

Risk Factors In Proximal Humerus Fractures: Males Vs. Females Jerjes, W / Callear, J /

Proximal point algorithm in Hadamard spaces Miroslav Bacak T el ecom ParisTech

Deep Unfolded Proximal Interior Point Algorithm for Image Restoration C. Bertocchi 1 , E.

Proximal Method with Contractions for Smooth Convex Optimization Nikita Doikov Yurii Nesterov

Deep Unfolding of a Proximal Interior Point Method for Image Restoration M.-C. Corbineau 1 in

Nonnegative Tensor Factorization using a proximal algorithm: application to 3D fluorescence

Efficient Meta Learning via Minibatch Proximal Update Pan Zhou Joint work with Xiao-Tong Yuan,

Zone of Proximal Development Vygotsky called the range of developmentally appropriate expecta7ons

Some Clustering Methods on Some Clustering Methods on Some Clustering Methods on Dissimilarity

Proximal Policy Optimization Ruifan Yu (ruifan.yu@uwaterloo.ca) CS 885 June 20 Pro roximal l

CS 285 Instructor: Sergey Levine UC Berkeley Recap: policy gradients fit a model to estimate

CENG4480 Lecture 07: PID Control Bei Yu byu@cse.cuhk.edu.hk (Latest update: October 10, 2018)

Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization L. T. K. Hien 1 N. Gillis 1

Meta-Learning of Structured Representation by Proximal Mapping Mao Li, Yingyi Ma, Xinhua

Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey

iPiano: Inertial Proximal Algorithm for Non-Convex Optimization David Stutz June 2, 2016 David

Efficient Bayesian computation by proximal Markov chain Monte Carlo: when Langevin meets Moreau