General Introduction to Stochastic Optimization Stochastic Gradient Method Overview
Introduction to Stochastic Optimization
January 13, 2015
- P. Carpentier
Master MMMEF — Cours MNOS 2014-2015 3 / 265
Introduction to Stochastic Optimization January 13, 2015 P. - - PowerPoint PPT Presentation
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Introduction to Stochastic Optimization January 13, 2015 P. Carpentier Master MMMEF Cours MNOS 2014-2015 3 / 265 General Introduction to Stochastic
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview
Master MMMEF — Cours MNOS 2014-2015 3 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview
Master MMMEF — Cours MNOS 2014-2015 4 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 5 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 6 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 7 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 8 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 9 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 10 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 11 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 12 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 13 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 14 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
2There is here a tricky point in the notations. . .
Master MMMEF — Cours MNOS 2014-2015 15 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 16 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 17 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 18 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 19 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 20 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Motivation and Goals Reminders in the Deterministic Framework Switching to the Stochastic Case
Master MMMEF — Cours MNOS 2014-2015 21 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 22 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 23 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
1 Let u(0) ∈ Uad and choose a positive real sequence {ǫ(k)}k∈N. 2 At iteration (k + 1), draw a realization w(k+1) of the r.v. W . 3 Compute the gradient of j and update u(k+1) by the formula:
4 Set k = k + 1 and go to step 2.
Master MMMEF — Cours MNOS 2014-2015 24 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
3Note that (Ω, A, P) has to be “big enough” to support such a sample.
Master MMMEF — Cours MNOS 2014-2015 25 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 26 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 27 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
4Recall that E
Master MMMEF — Cours MNOS 2014-2015 28 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 29 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 30 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 31 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
1 The random variable U(0) is F(0)-mesurable. 2 The mapping h : U −
3 The random variable ξ(k) is F(k)-mesurable for all k, and
4 The sequence {ǫ(k)}k∈N is a σ-sequence, that is,
Master MMMEF — Cours MNOS 2014-2015 32 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
5for example a bias on h(u), as considered in the Kiefer-Wolfowitz algorithm
Master MMMEF — Cours MNOS 2014-2015 33 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 34 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
1 h is continuously differentiable and, in a neighborhood of u♯,
2 The sequence
3 ∃ δ > 0 such that supk∈N E
4 The sequence {ǫ(k)}k∈N is a σ(α, β, γ)-sequence. 5 The square matrix (H − λI) is positive-definite, with
Master MMMEF — Cours MNOS 2014-2015 35 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 36 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
γ 2
Master MMMEF — Cours MNOS 2014-2015 37 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 38 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 39 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 40 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 41 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 42 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 43 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 44 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
Master MMMEF — Cours MNOS 2014-2015 45 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
Master MMMEF — Cours MNOS 2014-2015 46 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
M
k+1
Master MMMEF — Cours MNOS 2014-2015 47 / 265
General Introduction to Stochastic Optimization Stochastic Gradient Method Overview Stochastic Gradient Algorithm Connexion with Stochastic Approximation Asymptotic Efficiency and Averaging Practical Considerations
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1000 2000 3000 4000 5000 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
Master MMMEF — Cours MNOS 2014-2015 48 / 265