Conjugate Direction minimization
Lectures for PHD course on Numerical optimization Enrico Bertolazzi
DIMS – Universit´ a di Trento
November 21 – December 14, 2011
Conjugate Direction minimization 1 / 106
Conjugate Direction minimization Lectures for PHD course on - - PowerPoint PPT Presentation
Conjugate Direction minimization Lectures for PHD course on Numerical optimization Enrico Bertolazzi DIMS Universit a di Trento November 21 December 14, 2011 Conjugate Direction minimization 1 / 106 Outline Introduction 1
Conjugate Direction minimization 1 / 106
Conjugate Direction minimization 2 / 106
Introduction
Conjugate Direction minimization 3 / 106
Introduction
Conjugate Direction minimization 4 / 106
Introduction
Conjugate Direction minimization 5 / 106
Introduction
Conjugate Direction minimization 6 / 106
Introduction
Conjugate Direction minimization 7 / 106
Introduction
Conjugate Direction minimization 8 / 106
Introduction
Conjugate Direction minimization 9 / 106
Convergence rate of Steepest Descent iterative scheme
Conjugate Direction minimization 10 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent for quadratic functions
Conjugate Direction minimization 11 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent for quadratic functions
Conjugate Direction minimization 12 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent for quadratic functions
Conjugate Direction minimization 13 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent for quadratic functions
14 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent for quadratic functions
Conjugate Direction minimization 15 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent for quadratic functions
Conjugate Direction minimization 16 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 17 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 18 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 19 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
1 Or αk = 0; 2 Or λk is a root of the quadratic polynomial λ2B + λµ + A.
athe argument should be improved in the case of multiple eigenvalues Conjugate Direction minimization 20 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 21 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 22 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 23 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
Conjugate Direction minimization 24 / 106
Convergence rate of Steepest Descent iterative scheme The steepest descent convergence rate
1 In the first case (κ = 1) we have A = βI for some β > 0 so it
2 In the second case we have
Conjugate Direction minimization 25 / 106
Conjugate direction method
Conjugate Direction minimization 26 / 106
Conjugate direction method Conjugate vectors
Conjugate Direction minimization 27 / 106
Conjugate direction method Conjugate vectors
Conjugate Direction minimization 28 / 106
Conjugate direction method Conjugate vectors
Conjugate Direction minimization 29 / 106
Conjugate direction method Conjugate vectors
Conjugate Direction minimization 30 / 106
Conjugate direction method First step
Conjugate Direction minimization 31 / 106
Conjugate direction method First step
Conjugate Direction minimization 32 / 106
Conjugate direction method First step
Conjugate Direction minimization 33 / 106
Conjugate direction method First step
Conjugate Direction minimization 34 / 106
Conjugate direction method kth Step
Conjugate Direction minimization 35 / 106
Conjugate direction method kth Step
Conjugate Direction minimization 36 / 106
Conjugate direction method kth Step
Conjugate Direction minimization 37 / 106
Conjugate direction method Successive one dimensional minimization
Conjugate Direction minimization 38 / 106
Conjugate direction method Successive one dimensional minimization
Conjugate Direction minimization 39 / 106
Conjugate direction method Successive one dimensional minimization
Conjugate Direction minimization 40 / 106
Conjugate direction method Successive one dimensional minimization
Conjugate Direction minimization 41 / 106
Conjugate direction method Successive one dimensional minimization
Conjugate Direction minimization 42 / 106
Conjugate direction method Successive one dimensional minimization
Conjugate Direction minimization 43 / 106
Conjugate direction method Conjugate direction minimization
k−1pT k
Conjugate Direction minimization 44 / 106
Conjugate direction method Conjugate direction minimization
Conjugate Direction minimization 45 / 106
Conjugate Gradient method
Conjugate Direction minimization 46 / 106
Conjugate Gradient method
Conjugate Direction minimization 47 / 106
Conjugate Gradient method
Conjugate Direction minimization 48 / 106
Conjugate Gradient method
Conjugate Direction minimization 49 / 106
Conjugate Gradient method
Conjugate Direction minimization 50 / 106
Conjugate Gradient method
1 simplification of the expression for αk; 2 Orthogonality of the residual rk from the previous residue r0,
3 three point formula and simplification of the coefficients
Conjugate Direction minimization 51 / 106
Conjugate Gradient method
Conjugate Direction minimization 52 / 106
Conjugate Gradient method
Conjugate Direction minimization 53 / 106
Conjugate Gradient method
Conjugate Direction minimization 54 / 106
Conjugate Gradient method
k−1rk−1
k Apk ;
k rk
k−1rk−1 ;
Conjugate Direction minimization 55 / 106
Conjugate Gradient convergence rate
Conjugate Direction minimization 56 / 106
Conjugate Gradient convergence rate Polynomial residual expansions
Conjugate Direction minimization 57 / 106
Conjugate Gradient convergence rate Polynomial residual expansions
Conjugate Direction minimization 58 / 106
Conjugate Gradient convergence rate Polynomial residual expansions
Conjugate Direction minimization 59 / 106
Conjugate Gradient convergence rate Polynomial residual expansions
Conjugate Direction minimization 60 / 106
Conjugate Gradient convergence rate Polynomial residual expansions
Conjugate Direction minimization 61 / 106
Conjugate Gradient convergence rate Convergence rate calculation
Conjugate Direction minimization 62 / 106
Conjugate Gradient convergence rate Convergence rate calculation
Conjugate Direction minimization 63 / 106
Conjugate Gradient convergence rate Convergence rate calculation
Conjugate Direction minimization 64 / 106
Conjugate Gradient convergence rate Convergence rate calculation
65 / 106
Conjugate Gradient convergence rate Finite termination of Conjugate Gradient
Conjugate Direction minimization 66 / 106
Conjugate Gradient convergence rate Convergence rate of Conjugate Gradient
1 The constant
2 The following bound, is useful
3 in particular the final estimate will be obtained by
Conjugate Direction minimization 67 / 106
Conjugate Gradient convergence rate Chebyshev Polynomials
1 The Chebyshev Polynomials of the First Kind are the right
2 Another equivalent definition valid in the interval (−∞, ∞) is
3 In spite of these definition, Tk(x) is effectively a polynomial. Conjugate Direction minimization 68 / 106
Conjugate Gradient convergence rate Chebyshev Polynomials
0.5 1 1.5
0.5 1 T1 T2
0.5 1 1.5
0.5 1 T3 T4
0.5 1 1.5
0.5 1 T12
0.5 1 1.5
0.5 1 T20
Conjugate Direction minimization 69 / 106
Conjugate Gradient convergence rate Chebyshev Polynomials
1 It is easy to show that Tk(x) is a polynomial by the use of
1
2
3
4
2 In general we have the following recurrence: 1
2
3
Conjugate Direction minimization 70 / 106
Conjugate Gradient convergence rate Chebyshev Polynomials
1
2
3
Conjugate Direction minimization 71 / 106
Conjugate Gradient convergence rate Convergence rate of Conjugate Gradient method
Conjugate Direction minimization 72 / 106
Conjugate Gradient convergence rate Convergence rate of Conjugate Gradient method
Conjugate Direction minimization 73 / 106
Preconditioning the Conjugate Gradient method
Conjugate Direction minimization 74 / 106
Preconditioning the Conjugate Gradient method Preconditioning
Conjugate Direction minimization 75 / 106
Preconditioning the Conjugate Gradient method Preconditioning
Conjugate Direction minimization 76 / 106
Preconditioning the Conjugate Gradient method Preconditioning
k−1
k
k
k−1
Conjugate Direction minimization 77 / 106
Preconditioning the Conjugate Gradient method CG reformulation
1 solve P s′
2 evaluate s′′
3 solve P T s′′′
Conjugate Direction minimization 78 / 106
Preconditioning the Conjugate Gradient method CG reformulation
j
j
Conjugate Direction minimization 79 / 106
Preconditioning the Conjugate Gradient method CG reformulation
Conjugate Direction minimization 80 / 106
Preconditioning the Conjugate Gradient method CG reformulation
Conjugate Direction minimization 81 / 106
Preconditioning the Conjugate Gradient method CG reformulation
Conjugate Direction minimization 82 / 106
Preconditioning the Conjugate Gradient method CG reformulation
Conjugate Direction minimization 83 / 106
Preconditioning the Conjugate Gradient method CG reformulation
Conjugate Direction minimization 84 / 106
Preconditioning the Conjugate Gradient method CG reformulation
k−1zk−1
k
k zk
k−1zk−1 ;
Conjugate Direction minimization 85 / 106
Nonlinear Conjugate Gradient extension
Conjugate Direction minimization 86 / 106
Nonlinear Conjugate Gradient extension
1 The conjugate gradient algorithm can be extended for
2 Fletcher and Reeves extend CG for the minimization of a
1
2
3 We also translate the index for the search direction pk to be
Conjugate Direction minimization 87 / 106
Nonlinear Conjugate Gradient extension Fletcher and Reeves
k gk
k−1gk−1 ;
Conjugate Direction minimization 88 / 106
Nonlinear Conjugate Gradient extension Fletcher and Reeves 1 To ensure convergence and apply Zoutendijk global
2 p0 is a descent direction by construction, for pk we have
3 Exact line-search is expensive, however if we use inexact
1
2
Conjugate Direction minimization 89 / 106
Nonlinear Conjugate Gradient extension convergence analysis
1globally here means that Zoutendijk like theorem apply Conjugate Direction minimization 90 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 91 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 92 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 93 / 106
Nonlinear Conjugate Gradient extension convergence analysis 1 The inequality of the the previous lemma can be written as:
2 Remembering the Zoutendijk theorem we have
3 so that if gk / pk is bounded from below we have that
4 Unfortunately this bound cant be proved so that Zoutendijk
Conjugate Direction minimization 94 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 95 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 96 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 97 / 106
Nonlinear Conjugate Gradient extension convergence analysis
Conjugate Direction minimization 98 / 106
Nonlinear Conjugate Gradient extension convergence analysis
athe correct assumption is that there exists k0 such that gk ≥ δ for
Conjugate Direction minimization 99 / 106
Nonlinear Conjugate Gradient extension
Conjugate Direction minimization 100 / 106
Nonlinear Conjugate Gradient extension Polack and Ribi´ ere
1 The previous problem can be elided if we restart anew when
2 Restarting is obtained by simply set βFR
3 A more elegant solution can be obtained with a new definition
4 This definition of βPR
Conjugate Direction minimization 101 / 106
Nonlinear Conjugate Gradient extension Polack and Ribi´ ere
k (gk−gk−1)
k−1gk−1
Conjugate Direction minimization 102 / 106
Nonlinear Conjugate Gradient extension Polack and Ribi´ ere
Conjugate Direction minimization 103 / 106
Nonlinear Conjugate Gradient extension Polack and Ribi´ ere
Conjugate Direction minimization 104 / 106
Nonlinear Conjugate Gradient extension
Conjugate Direction minimization 105 / 106
References
Conjugate Direction minimization 106 / 106