Non-Smooth Convex Optimization in Data Sciences Jalal Fadili - PowerPoint PPT Presentation

Non-Smooth Convex Optimization in Data Sciences Jalal Fadili Normandie Université-ENSICAEN, GREYC Mathematical coffees 2018

Outline Introduction. Non-smooth convex optimization. Elements of convex analysis. Elements of duality. Optimality conditions. Proximal framework and operator splitting. Proximal calculus. Monotone operator splitting. Sum of two functions. Generalization to more than two functions. Take-away messages. SSNAO’17- 2

Today’s lecture is about ... Non-smooth convex optimization. Convex analysis. Monotone operator splitting: divide and conquer. Fenchel-Rockafellar duality: think primal, act dual. Fast algorithms for e.g. data sciences. Connections with AJ lecture series. SSNAO’17- 4

Regularized inverse problems Inverse problem Prior knowledge (regularization, constraints) Sensing Measurement/degradation y x 0 typically lives + 1 in a low-dimensional manifold y A x 0 ∈ H ε y H y Forward model n 1 1 SSNAO’17-5

Regularized inverse problems Inverse problem Prior knowledge (regularization, constraints) Sensing Measurement/degradation y x 0 typically lives + 1 in a low-dimensional manifold y A x 0 ∈ H ε y H y Forward model n 1 1 Many applications in data sciences: signal/image processing, machine learning, statistics, etc.. SSNAO’17-5

Regularized inverse problems Inverse problem Prior knowledge (regularization, constraints) Sensing Measurement/degradation y x 0 typically lives + 1 in a low-dimensional manifold y A x 0 ∈ H ε y H y Forward model n 1 1 Many applications in data sciences: signal/image processing, machine learning, statistics, etc.. Solve an inverse problem through regularization : x ∈ H F ( x ) min + R ( x ) | {z } | {z } Data fidelity Regularization, constraints R promotes objects living in the same manifold as x 0 . SSNAO’17-5

Elements of convex analysis Notations H is a finite-dimensional Hilbert space (typically the real vector space R N ) endo- wed with the inner product h ., . i and associated norm k . k . I is the identity operator on H . k A x k � = sup x 2 H 1 � � � � � � The operator spectral norm of A : H 1 ! H 2 is denoted � A k x k . � � � � k . k p , p � 1 is the ` p -norm with the usual adaptation for the case p = + 1 . p is the (convex compact) ` p -ball, p � 1 , centered at its origin 0 and of radius B ρ ⇢ > 0 . x + B ρ p is the same ball centered at x . SSNAO’17- 7

Sets Definition (Convex set) A closed set C ⊆ H is said to be convex if : ∀ x, y ∈ C , 0 ≤ ρ ≤ 1 ⇒ ρ x + (1 − ρ ) y ∈ C . Definition (Cone) A cone C is a set such that the ”open” half line { tx : t > 0 } is entirely contained in C whenever x ∈ C . In the usual geometrical representation, a cone has an apex ; here at 0. Property (Convex cone) A cone C is convex ⇐ ⇒ C + C ⊂ C . Proposition (Convexity-preserving operations) Convexity is stable under intersection : if C i , i ∈ I are convex ⇒ ∩ i ∈ I C i is convex. Convexity is stable under Cartesian product, and the converse is true : C i , i ∈ I ⇒ C 1 × · · · × C |I| is convex. are convex ⇐ Convexity is stable under affine mappings : the image of a convex set under an affine map A is also convex (e.g. reflection, Minkowski sum). If a set is convex, so are its interior and its closure. SSNAO’17- 8

Sets Definition (Affine hull) An affine combination of x 1 · · · x n ∈ H is an element P n i =1 a i x i , P n i =1 a i = 1 . All such affine combinations form an affine manifold of H . The affine hull of a nonempty set C ⊂ H is the smallest affine manifolds containing C , or equivalently, ( n n ) � X X a ff ( C ) = � ∀ i, y i ∈ C , x = a i = 1 x ∈ H and a i y i . � i =1 i =1 The interior of a convex set is empty unless it is full dimensional. Let C be a sheet of paper. Its interior is empty in the surrounding R 3 space, . . . but not in the space R 2 of the table it is lying on. The concept of relative interior alleviates this ambiguity by defining the interior for a different topology : the one that equips its affine hull (which becomes a topological space in its own). In convex analysis and optimization, the topology of the whole space is of mode- rate interest, those relative to the affine hull are much richer. SSNAO’17- 9

Relative topology Definition (Relative interior) The relative interior ri( C ) of a convex set C ⇢ H is the interior of C for the topology relative to its affine full, i.e. x 2 ri( C ) if and only if : x 2 a ff ( C ) 9 ρ > 0 (a ff ( C )) \ B ρ H ( x ) ⇢ C . s . t . and C a ff ( C ) dim( C ) ri( C ) { x } { x } { x } 0 [ x, x 0 ] affine line generated by x and x 0 ( x, x 0 ) 1 affine manifold of equation P N Simplex S N in R N N � 1 { x 2 S N : x [ i ] > 0 } i =1 x i = 1 2 ⇢ R N R N B ρ int( B ρ 2 ) N Proposition (Properties of the relative interior) ri( C ) ⇢ C , is convex and dim(ri( C )) = dim( C ) . Let x 2 cl( C ) and x 0 2 ri( C ) , then ( x, x 0 ] 2 ri( C ) . Consequently, the convex sets C , ri( C ) and cl( C ) have the same affine hull, the same relative interior and the same closure. The relative topology fits well with convexity preserving operations. Let C i , i = 1 , · · · , n be convex sets. If \ i ri( C i ) 6 = ; ) \ i ri( C i ) = ri( \ i C i ) . ri( C 1 ) ⇥ · · · ⇥ ri( C n ) = ri( C 1 ⇥ · · · ⇥ C n ) . Let A be an affine map, then ri(A C ) = A(ri( C )) . 0 2 ri( C 1 � C 2 ) ( ) ri( C 1 ) \ ri( C 2 ) 6 = ; . SSNAO’17- 10

Functions Definition (Domain of a function) The domain dom( F ) of a function F : H ! R is dom( F ) = { x 2 H : F ( x ) < + 1 } . Definition (Proper function) A function is proper if dom( F ) 6 = ; . Definition (Epigraph, level set, sublevel sets) The epigraph epi( F ) of a function F : H ! R is epi( F ) = { ( x, t ) 2 H ⇥ R : F ( x )  t } . The level set of F at t 0 is lev t 0 ( F ) = { x 2 H : F ( x ) = t 0 } . The sublevel sets at t 0 is [ t  t 0 lev t ( F ) . Definition (Coercivity) F is (weakly- or 0-)coercive if lim k x k!1 F ( x ) = + 1 . SSNAO’17- 11

Functions Definition (Convex function) A function F : H ! R [ { + 1 } is convex if 8 x, y 2 H , 0 < ρ < 1 , F ( ρ x + (1 � ρ ) y )  ρ F ( x ) + (1 � ρ ) F ( y ) . It is strictly convex if the inequality is strict for x 6 = y . Definition (Lower semicontinuity) We say that a real-valued function f is lower semi-continuous (lsc) if lim inf x → x 0 f ( x ) � f ( x 0 ) . It is lsc on C ⇢ H if it is lsc at each of its points. Let F : H ! R [ { + 1 } . F is lsc ( ) its epigraph is closed ( ) its Proposition sublevel sets at t are closed for all t 2 R . Lower semi-continuity is weaker than continuity, and plays an important role for exis- tence of solutions in minimization problems over a compact set (by closedness of its epigraph). Notation Γ 0 ( H ) is the class of all proper lsc convex functions from H to R [ { + 1 } . SSNAO’17- 12

Properties of convex functions Proposition (Properties of closed convex functions) A function F 2 Γ 0 ( H ) is (strictly) convex if and only if its epigraph is a (strictly) convex set. It is strongly convex with modulus c if F � c/ 2 k · k 2 is convex. Any F 2 Γ 0 ( H ) is minorized by some affine function : F ( y ) � F ( x )+ h u, y � x i , 8 x 2 ri(dom( F )) , 8 y 2 H . Convexity and closedness of functions in Γ 0 ( H ) are preserved under : positive combinations ; pointwise supremum ; (Legendre-Fenchel) conjugacy (see hereafter) ; pre-composition by an affine mapping A such that Im(A) \ dom( F ) 6 = ; ; post-composition G � F with an increasing convex function G 2 Γ 0 ( R ) if 9 x 2 H s . t . F ( x ) 2 dom( G ) and G (+ 1 ) := + 1 . SSNAO’17- 13

Properties of convex functions Theorem (Continuity properties) Let F a convex function on R N . If C is a compact subset of ri(dom( F )) , then F is continuous on the relative interior of its domain. It is moreover locally Lipschitz-continuous on this relative interior. If F is (uniformly) Lipschitz on a nonempty convex subset C , it has a convex Lipschitz extension (with the same Lipschitz constant) on the whole space, that coincides with it on C . Convex functions converging pointwise to some function F do converge uniformly on each compact subset of ri(dom( F )) , and F is convex. Theorem (First-order properties) Let F a convex function on R N . F is differentiable almost everywhere, i.e. the subset of int(dom( F )) where F is not differentiable is of zero Lebesgue measure. ) F 2 C 1 , 1 ( O ) . F differentiable on an open convex set O ( Theorem (Second-order properties [A.D. Alexandrov]) Let F a convex function. For all x 2 int(dom( F )) except on a set of zero Lebesgue measure, F is differentiable at x and there exists a symmetric positive semi-definite operator D 2 F ( x ) such that for all d 2 R N F ( x + d ) = F ( x ) + hr F ( x ) , d i + 1 + o ( k d k 2 ) . ⌦ D 2 F ( x ) d, d ↵ 2 SSNAO’17- 14

Non-Smooth Convex Optimization in Data Sciences Jalal Fadili - PowerPoint PPT Presentation

Non-Smooth Convex Optimization in Data Sciences Jalal Fadili Normandie Universit-ENSICAEN, GREYC Mathematical coffees 2018 Outline Introduction. Non-smooth convex optimization. Elements of convex analysis. Elements of duality. Optimality

Convex Hell 362 dnc CS 16: Convex Hull Whoops, I mean... Convex Hull Whats a Convex Hull?

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems

Some Recent Advances in Non-convex Optimization Purushottam Kar IIT KANPUR Outline of the Talk

constrained convex optimization virgil pavlu 1 convex set a set X in a vector space is convex if

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical

Dimension Free Optimization and Non-Convex Optimization Instructor: Sham Kakade 1 Non-convex

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Functions Instructor: Shaddin

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Sets Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions Instructor: Shaddin

Introductory Course on Non-smooth Optimisation Lecture 09 - Non-convex optimisation Jingwei Liang

Convex hull 1 - 1 Convex hull 1 - 2 Convex hull 1 - 3 Convex hull Definition, extremal

CS133 Computational Geometry Convex Hull 1 Convex Hull Given a set of n points, find the

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

A Primer in Convex Optimization Moritz Diehl partly based on material by Colin Jones, Stephen

Probing correla4ons in A=3 systems using electron scaEering Reynier Cruz Torres Hall A/C

I see a cookie banner Is it even legal? Nataliia Bielova and Cristiana Santos joint work with

Complexity of Composite Optimization Guanghui (George) Lan University of Florida Georgia

New primal-dual subgradient methods for Convex Problems with Functional Constraints Yurii

Lagrange Function and KKT Conditions October 26, 2018 265 / 429 How do you compute the table of

Low-Rank Inducing Norms with Optimality Interpretations LU Christian Grussler 2017 Pontus

A Modular Approach to MaxSAT Modulo Theories Alessandro Cimatti 1 , Alberto Griggio 1 , Bastiaan

PROMETHEE-compatible presentations of multicriteria evaluation tables Karim Lidouh , Anh Vu Doan