Connecting the dots with common sense and linear models
L´ eon Bottou
NEC Labs America
Connecting the dots with common sense and linear models L eon - - PowerPoint PPT Presentation
Connecting the dots with common sense and linear models L eon Bottou NEC Labs America COS 424 2/4/2010 Introduction Useful things: understanding probabilities, understanding statistical learning theory, knowing countless
NEC Labs America
L´ eon Bottou 2/45 COS 424 – 2/4/2010
L´ eon Bottou 3/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −4 −3 −2 −1 1 2 3 4
L´ eon Bottou 4/45 COS 424 – 2/4/2010
. . .
0.39 0.50 5.84 -4.36 -0.01 7.20 -7.40 -7.16 . . .
0.77 5.03 5.46 7.34 1.92 -5.66 -5.33 -6.15 -3.14 4.53 6.37 . . .
6.45 5.10 5.18 2.27 4.57 4.18 -6.07 -5.47 -6.97 2.67 -3.93 . . . 2.77 7.46 4.84 6.97 1.09 -2.17 -6.38 5.66 -2.65 -2.81 -0.69 2.76 . . . 0.42 5.88 0.29 -7.13 2.85 1.79 6.22 1.34 -1.83 3.01 3.99 -1.75 . . . 0.03 1.55
2.53 -3.47 -0.46 3.21 -2.73 6.65 -0.77 . . .
3.14 5.37 3.80 -0.00 1.89 3.24 2.30 -1.45 7.63 -2.12 . . . 6.47 2.04 3.58 -4.96 7.54 2.47 6.39 4.95 -2.51 -6.46 0.49 -0.61 . . . 5.10 1.90 1.79 3.20
4.93 -2.13 -7.11 -5.10 2.13 6.31 7.00 . . . 1.71
7.33 -0.99 4.17 -7.81 -7.64 4.01 -3.37 . . . 7.29
7.66 -6.70
5.34 -5.94 -1.76 3.79 2.92 0.75 7.04 . . .
7.54 2.47 6.39 4.95 -2.51 -6.46 0.49 -0.61 . . . 5.10 1.90 1.79 3.20
4.93 -2.13 -7.11 -5.10 2.13 6.31 7.00 . . . 1.71
7.33 -0.99 4.17 -7.81 -7.64 4.01 -3.37 . . . 7.29
7.66 -6.70 . . . . . . . . . . . . . . . . . .
L´ eon Bottou 5/45 COS 424 – 2/4/2010
L´ eon Bottou 6/45 COS 424 – 2/4/2010
L´ eon Bottou 7/45 COS 424 – 2/4/2010
L´ eon Bottou 8/45 COS 424 – 2/4/2010
L´ eon Bottou 9/45 COS 424 – 2/4/2010
L´ eon Bottou 10/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 11/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 12/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 13/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 14/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 15/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 16/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 17/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Polynomial basis
L´ eon Bottou 18/45 COS 424 – 2/4/2010
0.01 0.1 1 10 100 1000 10000 100000 5 10 15 20 polynomial degree Training MSE True MSE
L´ eon Bottou 19/45 COS 424 – 2/4/2010
L´ eon Bottou 20/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear (hinges)
L´ eon Bottou 21/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear with 2 knots
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear with 3 knots
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear with 4 knots
L´ eon Bottou 22/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear with 5 knots
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear with 9 knots
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear with 18 knots
L´ eon Bottou 23/45 COS 424 – 2/4/2010
0.01 0.1 1 10 100 1000 5 10 15 20 number of knots Training MSE True MSE
L´ eon Bottou 24/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear (ramps)
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise linear (triangles)
L´ eon Bottou 25/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise ramps with 6 knots
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise triangles with 7 knots
L´ eon Bottou 26/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
L´ eon Bottou 27/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise quadratic with 1 knot
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise quadratic with 6 knots
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Piecewise quadratic with 12 knots
L´ eon Bottou 28/45 COS 424 – 2/4/2010
0.05 0.1 0.5 1 5 10 5 10 15 20 number of knots Training MSE True MSE
L´ eon Bottou 29/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Polynomial d=12
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Polynomial d=12 (more examples)
L´ eon Bottou 30/45 COS 424 – 2/4/2010
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Polynomial d=12
−2 −1 1 2 3 4 5 −6 −4 −2 2 4 6
Polynomial d=12 (less noise)
L´ eon Bottou 31/45 COS 424 – 2/4/2010
L´ eon Bottou 32/45 COS 424 – 2/4/2010
L´ eon Bottou 33/45 COS 424 – 2/4/2010
L´ eon Bottou 34/45 COS 424 – 2/4/2010
L´ eon Bottou 35/45 COS 424 – 2/4/2010
L´ eon Bottou 36/45 COS 424 – 2/4/2010
L´ eon Bottou 37/45 COS 424 – 2/4/2010
L´ eon Bottou 38/45 COS 424 – 2/4/2010
L´ eon Bottou 39/45 COS 424 – 2/4/2010
L´ eon Bottou 40/45 COS 424 – 2/4/2010
copied from (Platt, 1998)
L´ eon Bottou 41/45 COS 424 – 2/4/2010
L´ eon Bottou 42/45 COS 424 – 2/4/2010
L´ eon Bottou 43/45 COS 424 – 2/4/2010
12 13 14 15 16 17 18 0.01 0.1 1 10 100 1000 epsilon (quadratic terms) percent error Training set Validation set
L´ eon Bottou 44/45 COS 424 – 2/4/2010
L´ eon Bottou 45/45 COS 424 – 2/4/2010