Nonlinear Optimization I Dr. Thomas M. Surowiec Humboldt University - PowerPoint PPT Presentation

Introduction Optimality Conditions Nonlinear Optimization I Dr. Thomas M. Surowiec Humboldt University of Berlin Department of Mathematics Summer 2013 Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Course Information 1 Lectures: Tuesdays 15-17 RUD 26 1.304 Thursdays 11-13 RUD 1.013 Recitations: Thursdays 15-17 RUD 25 3.006 2 Office Hours: By appointment. 3 Office: RUD 2.425 Phone: x5497 Email: surowiec math.hu-berlin.de Website: http://www.mathematik.hu-berlin.de/ surowiec/ Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Course Information & Grading 1 Lecture Notes, Slides. Complete lecture notes (should be!) available at some point during the semester. Slides available following each lecture on my website. Slides = Lecture Notes − Proofs − Prose. Homeworks: 2 1 Homework series per week with 4 problems. Problems to be solved on the board during the following recitation. Final Exam: oral. 3 Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Definition of an Optimization Problem Definition (Minimization Problem) Let X ⊂ R n an arbitrary set and f : X → R a continuous function. The problem is to find an x ∗ ∈ X such that f ( x ∗ ) ≤ f ( x ) , ∀ x ∈ X . Alternate formulations: min f ( x ) subject to (s.t.) x ∈ X , or min x ∈ X f ( x ) Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Examples of Optimization Problems Basic Idea: Given observations or measurements of a system of interest, how can we determine certain intrinsic properties? Example Let M be a point of mass with mass m affixed to the end of a vertical spring. At equilibrium, M is located at the origin. K is the restoring force applied by spring to M upon displacement. For small (vertical) displacements y , the spring applies K = − ˆ ky (Hooke’s law). Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Examples of Optimization Problems Example (cont.) ˆ k is an as yet unknown positive spring constant. y ( t ) := Position of M at time t . Ignoring damping and friction, Newton’s law states: y = − ˆ m ¨ ky , (1) i.e. mass m times acceleration ¨ y equals the opposing force of the spring − ˆ ky . (1) is called “the undamped harmonic oscillator equation.” Usually, friction and damping forces behave proportionally to the velocity ( ˙ y ) of M . We model with by − r ˙ y with fixed r > 0. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Examples of Optimization Problems Example (cont.) Together with (1), we obtain y + ˆ m ¨ y + r ˙ ky = 0 . (2) Setting c := r / m , k := ˆ k / m , (2) becomes: ¨ y + c ˙ y + ky = 0 . (3) Assume at time t = 0: ˙ y ( 0 ) = y 0 ∈ R , y ( 0 ) = 0 . (4) Given endtime T > 0, we consider this initial boundary value problem (IVP) on the interval [ 0 , T ] . Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Examples of Optimization Problems Example (cont.) Suppose we do not know x := ( c , k ) T . y j � N For j := 1 , . . . , N , we are given measurements � j = 1 of the spring deviation at time instances t j = ( j − 1 ) T / ( N − 1 ) . Let y ( x ; , t ) be the solution of our IVP for a given x . By solving the unconstrained optimization problem N x ∈ R 2 f ( x ) := 1 � | y ( x ; t j ) − y j | 2 , min (5) 2 j = 1 we seek to determine the spring constant k and damping factor c . Note y ( · , t ) is differentiable wrt to x provided c 2 − 4 k � = 0 . (5) : “Nonlinear Least Squares Problem.” Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Examples of Optimization Problems Basic Idea Deciding product capacity based on fixed and variable costs. Example x output quantity. K v ( x ) variable costs, K f ( x ) = c > 0 fixed costs. K ( x ) := K v ( x ) + K f ( x ) , x ∈ R total costs. Normally one looks for an x ∗ , which minimizes total costs of K ( x ) , i.e. x ∗ = argmin { K f ( x ) + K v ( x ) s.t. x ∈ R } = argmin { K v ( x ) s.t. x ∈ R } (6) In general, x ∗ is not unique. We therefore write: x ∗ ∈ argmin { K v ( x ) s.t. x ∈ R } (7) Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Constrained Optimization Particularly in the previous example, one often has constraints on x . When X � R n , we often have X = X 1 ∩ X 2 ∩ X 3 , where x ∈ R n | c i ( x ) = 0 , i ∈ I 1 � � X 1 = x ∈ R n | c i ( x ) ≤ 0 , i ∈ I 2 � � X 2 = x ∈ R n | x i ∈ Z , i ∈ I 3 X 3 = � � I i , i = 1 , 2 , 3 are called index sets. X 1 , X 2 , X 3 are called equality, inequality, and integer constraints, respectively. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Catagories of Optimization Problems X set of discrete points → discrete or combinatorial optimization. Otherwise, continuous optimization. f , c i for any i is nondifferentiable → nonsmooth optimization. Example Nonsmooth Problems f ( x ) := || · || 2 ( f nondifferentiable at 0). c ( x ) = | x | − x 2 = 0. Optimizing optimization problems (game theory, optimal control, etc.) Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Notions of Solutions Definition (Solution Types) Let f : X → R with X ⊂ R n . The point x ∗ ∈ X is called a (strict) global minimizer of f (on X ) if and only if 1 f ( x ∗ ) ≤ f ( x ) , ( f ( x ∗ ) < f ( x )) ∀ x ∈ X \ { x ∗ } . (strict) local minimizer of f (on X ) if and only if there exists a 2 neighborhood U of x ∗ such that f ( x ∗ ) ≤ f ( x ) , ( f ( x ∗ ) < f ( x )) ∀ x ∈ ( X ∩ U ) \ { x ∗ } . The optimal objective value of f ( x ∗ ) is called a (strict) local minimum. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Stationary Points For f ∈ C 1 , we denote its gradient by � ∂ f � T ( x ) , . . . , ∂ f ∇ f ( x ) = ∂ x n ( x ) . ∂ x 1 If f : X → R is directionally differentiable, then its directional derivative at x in direction d is denoted by f ( x + α d ) − f ( x ) f ′ ( x ; d ) := lim . α α ↓ 0 Definition (Stationarity) Let X ⊂ R n be an open set and f : X → R directionally differentiable. The point x ∗ ∈ X is called a stationary point of f , if ∇ f ( x ∗ ) = 0 . holds true. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Fermat’s Theorem Theorem Let X ⊂ R n be an open set and f : X → R a continuously differentiable function. If x ∗ ∈ X is a local minimizer of f (on X), then ∇ f ( x ∗ ) = 0 . i.e. x ∗ is a stationary point. Proof. On the board. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions A Fair Warning Fermat’s Theorem, regardless of what variant, is a first order necessary condition . Example Let f ( x ) = − x 2 with x ∗ = 0. Here, ∇ f ( x ∗ ) = − 2 x ∗ = 0 , but x ∗ is in fact a global maximizer of f . In the previous example, ∇ 2 f ( x ∗ ) = − 2 indicates that f is concave (down). Hence, x ∗ cannot be even a local minimum. This leads to a general result. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Continuity of Eigenvalues Lemma Let S n be the vector space of symmetrical n × n-matrices. For A ∈ S n let λ ( A ) ∈ R be the smallest eigenvalue of A. Then the following estimate holds true: | λ ( A ) − λ ( B ) | ≤ || A − B || for all A , B ∈ S n In other words, the mapping that takes a real symmetric matrix to its smallest eigenvalue in Lipschitz continuous with modulus 1. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions An Important Implication Lemma 12 allows us to make the following observation: f ∈ C 2 ⇒ ∇ 2 f : R n → R n × n continuous wrt some matrix norm || · || . ∇ 2 f is symmetric. ∇ 2 f ( x ∗ ) positive definite implies ∇ 2 f ( x ) pos. def. for || x − x ∗ || suff. small. In general: Small perturbations (in S n !) of symmetric pos. def. matrices remain pos. def. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Second-Order Necessary Optimality Conditions Theorem Let X ⊂ R n be open and f : X → R be twice continuously differentiable. If x ∗ ∈ X is a local minimizer of f on (X), then ∇ f ( x ∗ ) = 0 and the Hessian ∇ 2 f ( x ∗ ) is positive semi-definite. Proof. On the board. Question: What happens if we try to apply this as a sufficient condition? Example 2 , x ∗ = ( 0 , 0 ) T . f ( x ) = x 2 1 − x 4 Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Introduction Optimality Conditions Second-Order Sufficient Optimality Conditions Theorem Let X ⊂ R n be open and f : X → R twice continuously differentiable. If ∇ f ( x ∗ ) = 0 and 1 ∇ 2 f ( x ∗ ) is positive definite, 2 then x ∗ is a strict local minimizer of f (on X). Proof. On the board. Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013

Nonlinear Optimization I Dr. Thomas M. Surowiec Humboldt University - PowerPoint PPT Presentation

Introduction Optimality Conditions Nonlinear Optimization I Dr. Thomas M. Surowiec Humboldt University of Berlin Department of Mathematics Summer 2013 Dr. Thomas M. Surowiec BMS Couse NLO, Summer 2013 Introduction Optimality Conditions

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

Nonlinear Optimization: Discrete optimization INSEAD, Spring 2006 Jean-Philippe Vert Ecole des

Nonlinear Control Lecture # 1 Introduction Nonlinear Control Lecture # 1 Introduction Nonlinear

Numerical Proofs in Nonlinear Control Sicun Gao, UCSD Nonlinear control working Nonlinear

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Nonlinear Equations and Optimization Motivation: Nonlinear Equations So far we have mostly

Nonlinear Optimization for Optimal Control Part 2 Pieter Abbeel UC Berkeley EECS Outline n

Computational Optimization Advance Topics NonSmooth Optimization Reference: Nonlinear

Nonlinear models Posterior Gradient Ascent Adaptive Step Size Approach to Limit Example Will

Nonlinear tensor product approximation Vladimir Temlyakov ICERM; October 3, 2014 Vladimir

Nonlinear Classifiers II 2 Nonlinear Classifiers: Introduction Classifiers Supervised

Computational Complexity of Experiment Design in Civil Engineering Olga Kosheleva 1 , Yan Wang 2 ,

Infrared spectroscopy depends upon a change in the vibrational state of a molecule E* H C

SSR1 Superconducting Resonator Juan Reyes Gonzlez Interamerican University of Puerto Rico,

Interaction of Fluid Flow and an Elastic Body k 1 Miloslav Feistauer 1 cek 2 Adam Kos Jarom

On the existence of traveling waves for monomer chains with pre-compression Atanas Stefanov

Discussion of Adapting to Radical Change: The Benefits of Short-Horizon Investors by

Quantum Mechanics with Horizons Stefan Leichenauer Caltech (soon UC Berkeley) Outline

Lucky in the Cloud With Diagrams Jan Khnlein Miro Spnemann @jankoehnlein @sponemann Sprotty