Non Convex Minimization using Convex Relaxation Some Hints to - PowerPoint PPT Presentation

Non Convex Minimization using Convex Relaxation Some Hints to Formulate Equivalent Convex Energies Mila Nikolova ( CMLA, ENS Cachan, CNRS, France ) SIAM Imaging Conference (IS14) Hong Kong Minitutorial: May 13, 2014

Outline 1. Energy minimization methods 2. Simple Convex Binary Labeling / Restoration 3. MS for two phase segmentation: The Chan-Vese (CV) model 4. Nonconvex data Fidelity with convex regularization 5. Minimal Partitions 6. References 2

1. Energy minimization methods u : Ω → R k is defined by In many imaging problems the sought-after image � u = arg min � E ( u ) for E ( u ) := Ψ( u, f ) + λ Φ( u ) + ı S ( u ) λ > 0 u f given image, Ψ data fidelity, Φ regularization, S set of constraints, ı indicator function ( i S ( u ) = 0 if u ∈ S and i S ( u ) = + ∞ otherwise) • Often u �→ E ( u ) is nonconvex Algorithms easily get trapped in local minima How to find a global minimizer? Many algorithms, usually suboptimal. 3

Some famous nonconvex problems for labeling and segmentation Potts model [Potts 52] ( ℓ 0 semi-norm applied to differences):   ∑ 0 if t = 0 E ( u ) = Ψ( u, f ) + λ ϕ ( u [ i ] − u [ j ]) ϕ ( t ) :=  1 if t ̸ = 0 i,j u, ˆ u,ℓ F ( u, ℓ ) Line process in Markov random field priors [Geman, Geman 84]: ( � ℓ ) = arg min ( ∑ ) ∑ ∑ F ( u, ℓ ) = ∥ A ( u ) − f ∥ 2 2 + λ φ ( u [ i ] − u [ j ])(1 − ℓ i,j ) + V( ℓ i,j , ℓ k,n ) j ∈N i ( k,n ) ∈N i,j i [ ] [ ] ℓ i,j = 0 ⇔ no edge ⇔ , ℓ i,j = 1 edge between i and j i N i ❝ ❝ ❝ ❝ i i ❝ ❝ ❝ s ❝ ❝ s ❝ ❝ ❝ ❝ ❝ ❝ M.-S. functional [Mumford, Shah 89]: (∫ ) ∫ ( u − v ) 2 dx + λ ∥∇ u ∥ 2 dx + α | L | F ( u, L ) = | L | = length ( L ) Ω \ L Ω 4

Image credits: S. Geman and D. Geman 1984. Restoration with 5 labels using Gibbs sampler “We make an analogy between images and statistical mechanics systems. Pixel gray levels and the presence and orientation of edges are viewed as states of atoms or molecules in a lattice-like physical system. The assignment of an energy function in the physical system determines its Gibbs distribution. Because of the Gibbs distribution, Markov random field (MRF) equivalence, this assignment also determines an MRF image model.” [S. Geman, D. Geman 84] 5

A perfect bypass: Find another functional F : Ω → R , easy to minimize, such that arg min F ( u ) ⊆ arg min E ( u ) u u e.g., F is convex and coercive. • Subtle and case-dependent. • We are in the inception phase... 6

Finding a globally optimal solution to a hard problem by conceiving another problem having the same set of optimal solutions and easy to solve has haunted researchers for a long time. • The Weiszfeld algorithm: E. Weiszfeld, Sur le point pour lequel la somme des distances de n points donn´ ees est minimum,” Tˆ ohoku Mathematical Journal, vol. 43, pp. 355–386, 1937. The word algorithm was unknown to most mathematicians by 1937. The Weiszfeld algorithm has extensively been used (e.g., in economics) when computers were available. • G. Dantzig, R. Fulkerson and S. Johnson, “Solution of a large-scale traveling-salesman problem”, Operations Research, vol. 2, pp. 393–410, 1954 • R. E. Gomory, “Outline of an algorithm for integer solutions to linear programs” Bull. Amer. Math. Soc., 64(5), pp. 217–301, 1958. (Tight) convex relaxation is only one somehow “secured” way to tackle hard minimization problems. This talk focuses on convex relaxations for imaging applications. − Discrete setting – MRF – geometry of images may be difficult to handle. − Continuous setting – in general more accurate approximations can be derived. Experimental comparison of discrete and continuous shape optimization – [Klodt et al, 2008] Applications in imaging: image restoration, image segmentation, disparity estimation of stereo images, depth map estimation, optical flow estimation, (multi) labeling problems, among many others. 7

Loose convex relaxation Often in practice No way to get � How to get � u ? u • In practice arg min E ( u ) ⊆ arg min F ( u ) u u • Convex relaxation is tight in each of the cases – arg min E ( u ) ⊇ arg min F ( u ) u u – we know how to reach � u ∈ arg min E ( u ) from � u ∈ arg min F ( u ) u u We will explain how several successful convex relaxations have been obtained. We will exhibit some limits of the approach. 8

Notation • Image domain and derivatives ◦ Ω ⊂ R 2 continuous setting, Du is the (distributional) derivative of u ; ◦ Ω = h { 1 , · · · , M } × h { 1 , · · · , N } grid with step h , Du is a set difference operators x = ( x 1 , x 2 ) ∈ Ω • { u > t } := { x ∈ Ω : u ( x ) > t } the super-levels of u • Σ ⊂ Ω (in general non connected) ∂ Σ is its boundary in Ω and Per(Σ) its perimeter   1 if x ∈ Σ • 1 l Σ ( x ) = the characteristic function of Σ  0 otherwise   0 if x ∈ Σ • ı Σ ( x ) = the indicator function of Σ  + ∞ otherwise } • supp ( u ) := { x ∈ Ω : u ( x ) ̸ = 0 • BV (Ω) – the set of all functions of bounded variation defined on Ω 9

Useful formulas ⋄ u ∈ BV (Ω) ∫ ∫ + ∞ • Coarea formula ∥ Du ∥ dx = Per( { x : u ( x ) > t } ) dt TV( u ) = (coa) −∞ Per(Σ) = TV(1 l Σ ) (per) • Layer-cake formulas ∫ + ∞ ◦ u ( x ) = 1 l { x : u ( x ) >t } ( x ) dt (cake) −∞ ∫ + ∞ � � � { x : u ( x ) > t } △ { x : f ( x ) > t } � dt ◦ ∥ u − f ∥ 1 = (cake1) −∞ △ symmetric difference [T.Chan, Esedoglu 05], [T. Chan, Esedoglu, Nikolova 06] is a normed vector space, V ∗ its dual and F : V → R is proper ⋄ V { } is F ∗ ( v ) := sup v ∈ V ∗ • The convex conjugate of F ⟨ u, v ⟩ − F ( u ) (cc) u ∈ V 10

2. Simple Convex Binary Labeling / Restoration [T. Chan, Esedoglu, Nikolova 06] Given a binary input image f = 1 l Σ , we are looking for a binary � u ( x ) = 1 l � Σ ( x ) Constraint : u ( x ) = 1 l Σ ( x ) [Vese, Osher 02] l Σ ∥ 2 E ( u ) = ∥ u − 1 2 + λ TV( u )+ ı S ( u ) l E : E ⊂ R 2 , S := { u = 1 E bounded } (the binary images) E is nonconvex because of the constraint S ⇒ Nonconvex (intuitive) minimization: E = { x ∈ R 2 : φ ( x ) > 0 } ∂E = { x ∈ R 2 : φ ( x ) = 0 } • Level set method [Osher, Sethian 88] ⇒ Then E is equivalent to ∫ l Σ ∥ 2 E 1 ( φ ) = ∥ H ( φ ) − 1 2 + λ R 2 |∇ H ( φ ( x )) | dx   1 if t ≥ 0 H : R → R the Heaviside function H ( t ) =  0 if t < 0 Computation gets stuck in local minima 11

l Σ ( x ) , Σ ⊂ R 2 bounded L 1 − T V energy: F ( u ) = ∥ u − f ∥ 1 + λ TV( u ) f ( x )=1 F is coercive and non-strictly convex ⇒ arg min F is nonempty, closed and convex By (coa) and (cake1) ∫ + ∞ ∫ + ∞ � � � � ( ) ( ) � { u > t } △ { f > t } � + λ Per � { u > t } △ Σ � + λ Per F ( u ) = { u > t } { u > t } dt = dt −∞ −∞ E ⊂ R 2 bounded l Σ ∥ 2 ⇒ ∥ 1 l E − 1 2 = ∥ 1 l E − 1 l Σ ∥ 1 ⇒ E (1 l E ) = F (1 l E ) Geometrical nonconvex problem: E 1 ( E ) = | E △ Σ | + λ Per( E ) ≡ E (1 l E ) (geo) � There exists Σ ∈ arg min E ⊂ R 2 E 1 ( E ) u ∈ R 2 F ( u ) set � For � u ∈ arg min Σ( γ ) = { � u > γ } for a.e. γ ∈ [0 , 1] F (1 l � Σ( γ ) ) ≥ E (1 l � Σ ) = F ( � u ) ⇒ u := 1 l � Σ ∈ arg min u F ( u ) � Further, F (1 l � Σ( γ ) ) = F ( � u ) for a.e. γ ∈ [0 , 1] . Therefore (i) � u = 1 l � Σ is a global minimizer of E ⇒ u ∈ arg min � u ∈ R 2 F ( u ) ; u ∈ S E ( u ) , � (ii) � u ∈ arg min u ∈ R 2 F ( u ) ⇒ u := 1 � l � Σ ∈ arg min Σ := { � u > γ } for a.e. γ ∈ [0 , 1] . For a.e. λ > 0 , F has a unique minimizer � u which is binary by (i) [T. Chan, Esedoglu 05] 12

• In practice one finds a binary minimizer of F • If f = 1 l Σ is noisy, the noise is in the shape ∂ Σ Restoring � u = denoising = 0-1 segmentation = shape optimization • The crux: L 1 data fidelity [Alliney 92], [Nikolova 02], [T. Chan, Esedoglu 05] ⇒ (cake1) Data Restored 13

3. MS for two phase segmentation: The Chan-Vese (CV) model [T. Chan, Vese 2001] ∫ ∫ ( c 1 − f ) 2 dx + ( c 2 − f ) 2 dx + λ Per(Σ; Ω) Ω ⊂ R 2 MS(Σ , c 1 , c 2 ) = for bounded Σ Ω \ Σ f : Ω → R 2 . One should solve c 1 ,c 2 ∈ R , Σ ⊂ Ω MS(Σ , c 1 , c 2 ) min for l Σ this amounts to E 1 ( E ) in (geo) For c 1 = 1 , c 2 = 0 and f = 1 ∫ ∫ For the optimal � 1 1 Σ one has ˆ c 1 = Σ fdx and ˆ c 2 = Σ fdx � Ω \ � | � | Ω \ � Σ | Σ | Two-step iterative algorithms to approximate the solution [T. Chan, Vese 2001] ∫ Ω H ( φ )( c 1 − f ) 2 + (1 − H ( φ ))( c 2 − f ) 2 + λ ∥ DH ( φ ) ∥ (a) Solve min ϕ (b) Update c 1 and c 2 Step (a) solves for c 1 and c 2 fixed the nonconvex problem ∫ ∫ ( c 1 − f ) 2 dx + ( c 2 − f ) 2 dx + λ Per(Σ; Ω) E (Σ) = Σ Ω \ Σ Alternative for step (a): Variational approximation + Γ convergence [Modica, Mortola 77] ∫ ( ) R 2 u 2 ( c 1 − f ) 2 + (1 − u ) 2 ( c 2 − f ) 2 + λ ε ∥ Du ∥ 2 + 1 E ε ( u ) = ε W ( u ) dx W double-well potential, W (0) = W (1) = 0 , W ( u ) > 0 else. E.g., W ( u ) = u 2 (1 − u 2 ) W forces � u to be a characteristic function when ε ↘ 0 . 14

Non Convex Minimization using Convex Relaxation Some Hints to - PowerPoint PPT Presentation

Non Convex Minimization using Convex Relaxation Some Hints to Formulate Equivalent Convex Energies Mila Nikolova ( CMLA, ENS Cachan, CNRS, France ) SIAM Imaging Conference (IS14) Hong Kong Minitutorial: May 13, 2014 Outline 1. Energy

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

Graph cut, convex relaxation and continuous max-flow problem Egil Bae (UCLA) and Xue-Cheng Tai

A convex relaxation for weakly supervised classifiers Armand Joulin and Francis Bach SIERRA

Introductory Course on Non-smooth Optimisation Lecture 09 - Non-convex optimisation Jingwei Liang

Some Recent Advances in Non-convex Optimization Purushottam Kar IIT KANPUR Outline of the Talk

Minimization Satoru Iwata (University of Tokyo) Submodular Function Minimization ( )

Optimizing Convex Functions over Non-Convex Domains Dan Bienstock and Alex Michalka

Di ff erentially Private Empirical Risk Minimization with Non-convex Loss Functions Di Wang ,

Minimization Using Descent Information we will consider the minimization of unconstrained

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

Convex hull: basic facts Convex hull: basic facts CG Lecture 1 CG Lecture 1 Problem : give a set

Convex hulls of spheres and convex hulls of convex polytopes lying on parallel hyperplanes

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin

Securing Materialized Views: a Rewriting-Based Approach Sarah Nait Bahloul, Emmanuel Coquery and

Classification Department Biosysteme Karsten Borgwardt Data Mining Course Basel Fall Semester

FPGA Implementation and Comparison of Protections Against SCAs for RLWE Timo Zijlstra 1 Karim

BBR Congestion Control: IETF 100 Update: BBR in shallow buffers Neal Cardwell, Yuchung Cheng, C.

Introduction to Seaborn DATA VIS UALIZ ATION W ITH S EABORN Chris Moftt Instructor Python

Highlighting data IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON Nick Stra y er Instr u

Be Be a Hawk not a Tu Turkey How a Birds Eye View of your Data Can Streamline Data Analysis

Finding structure in the dark: Coupled Dark Energy Models Mark Trodden University of