Pattern Recognition Introduction to Gradient Descent
Ad Feelders
Universiteit Utrecht
Ad Feelders (Universiteit Utrecht) Pattern Recognition 1 / 32
Pattern Recognition Introduction to Gradient Descent Ad Feelders - - PowerPoint PPT Presentation
Pattern Recognition Introduction to Gradient Descent Ad Feelders Universiteit Utrecht Ad Feelders (Universiteit Utrecht) Pattern Recognition 1 / 32 Optimization (single variable) Suppose we want to find the value of x for which the function
Ad Feelders (Universiteit Utrecht) Pattern Recognition 1 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 2 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 3 / 32
d f d x(x = x∗)
Ad Feelders (Universiteit Utrecht) Pattern Recognition 4 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 5 / 32
1 Set i ← 0, and choose an initial value x(0) 2 determine the derivative
3 Repeat the previous step until
Ad Feelders (Universiteit Utrecht) Pattern Recognition 6 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 7 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 8 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 9 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 10 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 11 / 32
1 The vector xA − xB runs parallel to the contour line. 2 Vectors are perpendicular if their dot product is zero. Ad Feelders (Universiteit Utrecht) Pattern Recognition 12 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 13 / 32
1 The gradient points in the direction of fastest increase of f . 2 Minus the gradient points in the direction of fastest decrease of f . Ad Feelders (Universiteit Utrecht) Pattern Recognition 14 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 15 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 16 / 32
1 Set i ← 0, and choose an initial value x(0) 2 determine the gradient ∇f (x(i)) of f (x) at x(i) and update
3 Repeat the previous step until
Ad Feelders (Universiteit Utrecht) Pattern Recognition 17 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 18 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 19 / 32
(Universiteit Utrecht) Pattern Recognition 20 / 32
(Universiteit Utrecht) Pattern Recognition 21 / 32
b0 b1 5 5 40 4 30 30 2 20 1 10 5 4 4 3 0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0
(Universiteit Utrecht) Pattern Recognition 22 / 32
(Universiteit Utrecht) Pattern Recognition 23 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 24 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 25 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 26 / 32
1 choose an initial value w(0) (e.g. at random); i ← 0 2 determine the gradient and update
3 Repeat the previous step until
Ad Feelders (Universiteit Utrecht) Pattern Recognition 27 / 32
1 choose an initial value w(0)
2 For n = 1 to N
3 Repeat step 2 until convergence and check if a (local) maximum has
Ad Feelders (Universiteit Utrecht) Pattern Recognition 28 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 29 / 32
Ad Feelders (Universiteit Utrecht) Pattern Recognition 30 / 32
0e+00 2e+04 4e+04 6e+04 8e+04 1e+05 −0.15 −0.10 −0.05 0.00 0.05 0.10 0.15 iteration w1 Ad Feelders (Universiteit Utrecht) Pattern Recognition 31 / 32
0e+00 2e+04 4e+04 6e+04 8e+04 1e+05 −3.0 −2.5 −2.0 −1.5 −1.0 iteration w0 Ad Feelders (Universiteit Utrecht) Pattern Recognition 32 / 32