Second order machine learning
Michael W. Mahoney ICSI and Department of Statistics UC Berkeley
Michael W. Mahoney (UC Berkeley) Second order machine learning 1 / 96
Second order machine learning Michael W. Mahoney ICSI and - - PowerPoint PPT Presentation
Second order machine learning Michael W. Mahoney ICSI and Department of Statistics UC Berkeley Michael W. Mahoney (UC Berkeley) Second order machine learning 1 / 96 Outline Machine Learnings Inverse Problem Your choice: 1st Order
Michael W. Mahoney (UC Berkeley) Second order machine learning 1 / 96
Michael W. Mahoney (UC Berkeley) Second order machine learning 2 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 3 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 4 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 5 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 6 / 96
Introduction
1Solution exists, is unique, and varies continuously with input data Michael W. Mahoney (UC Berkeley) Second order machine learning 7 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 8 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 9 / 96
Introduction
Michael W. Mahoney (UC Berkeley) Second order machine learning 10 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 11 / 96
Efficient and Effective Optimization Methods
1 Efficient, i.e., Low Per-Iteration Cost 2 Effective, i.e., Fast Convergence Rate Michael W. Mahoney (UC Berkeley) Second order machine learning 12 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 13 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 14 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 15 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 16 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 17 / 96
Efficient and Effective Optimization Methods
1 “Not-So-Simple” 1st order method, e.g., accelerated and adaptive 2 2nd order methods, e.g.,
Michael W. Mahoney (UC Berkeley) Second order machine learning 18 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 19 / 96
Efficient and Effective Optimization Methods
1 “Not-So-Simple” 1st order method: FLAG n’ FLARE
x∈X⊆Rd F(x) = f (x) + h(x)
2 2nd order methods: Stochastic Newton-Type Methods
x∈X⊆Rd F(x) = 1
n
Michael W. Mahoney (UC Berkeley) Second order machine learning 20 / 96
Efficient and Effective Optimization Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 21 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 22 / 96
First-order methods: FLAG n’ FLARE
k=1 αk = ∞,
Michael W. Mahoney (UC Berkeley) Second order machine learning 23 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 24 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 25 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 26 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 27 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 28 / 96
First-order methods: FLAG n’ FLARE
i=1
t=1[gt]2 i
t=1 gt2
Michael W. Mahoney (UC Berkeley) Second order machine learning 29 / 96
First-order methods: FLAG n’ FLARE
Second order machine learning 30 / 96
First-order methods: FLAG n’ FLARE
1 T 2 vs. 1 T vs. 1 √ T
Michael W. Mahoney (UC Berkeley) Second order machine learning 31 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 32 / 96
First-order methods: FLAG n’ FLARE
Second order machine learning 33 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 34 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 35 / 96
First-order methods: FLAG n’ FLARE
i=1
t=1[˜
i
Michael W. Mahoney (UC Berkeley) Second order machine learning 36 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 37 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 38 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 39 / 96
First-order methods: FLAG n’ FLARE
Second order machine learning 40 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 41 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 42 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 43 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 44 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 45 / 96
First-order methods: FLAG n’ FLARE
Michael W. Mahoney (UC Berkeley) Second order machine learning 46 / 96
Second-order methods: Stochastic Newton-Type Methods 1 2nd order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 47 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 48 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 49 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 50 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 51 / 96
Second-order methods: Stochastic Newton-Type Methods
x∈D∩X
Second order machine learning 52 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 53 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 54 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 55 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 56 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 57 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 58 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 59 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 60 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 61 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 62 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 63 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 64 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 65 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 66 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 67 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 68 / 96
Second-order methods: Stochastic Newton-Type Methods
k ∇F(x(k))
Michael W. Mahoney (UC Berkeley) Second order machine learning 69 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 70 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 71 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 72 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 73 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 74 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 75 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 76 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 77 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 78 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 79 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 80 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 81 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 82 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 83 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 84 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 85 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 86 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 87 / 96
Second-order methods: Stochastic Newton-Type Methods
0 )
Michael W. Mahoney (UC Berkeley) Second order machine learning 88 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 89 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 90 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 91 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 92 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 93 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 94 / 96
Second-order methods: Stochastic Newton-Type Methods
Michael W. Mahoney (UC Berkeley) Second order machine learning 95 / 96
Conclusion
Michael W. Mahoney (UC Berkeley) Second order machine learning 96 / 96