Stochastic optimization in Hilbert spaces
Aymeric Dieuleveut
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 1 / 48
Stochastic optimization in Hilbert spaces Aymeric Dieuleveut - - PowerPoint PPT Presentation
Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic optimization Hilbert spaces 1 / 48 Outline Learning vs Statistics Aymeric Dieuleveut Stochastic optimization Hilbert spaces 2 / 48 Outline
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 1 / 48
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 2 / 48
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 3 / 48
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 4 / 48
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 5 / 48
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 6 / 48
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 7 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 8 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 8 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 8 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 8 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 8 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 9 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 9 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 9 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 10 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 10 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 10 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 10 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 10 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 11 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 11 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 11 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 11 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 11 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 11 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 12 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 12 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 12 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 12 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 12 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 13 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 13 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 14 / 48
Tradeoffs of Large scale learning - Learning
1 Start at some f0. Aymeric Dieuleveut Stochastic optimization Hilbert spaces 14 / 48
Tradeoffs of Large scale learning - Learning
1 Start at some f0. 2 Iterate :
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 14 / 48
Tradeoffs of Large scale learning - Learning
1 Start at some f0. 2 Iterate :
3 Output fm or ¯
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 14 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 15 / 48
Tradeoffs of Large scale learning - Learning
n ) O
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 15 / 48
Tradeoffs of Large scale learning - Learning
n ) O
n ) O ((1 − κ)m)
With step-size γk proportional to
1 √ k .
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 15 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 16 / 48
Tradeoffs of Large scale learning - Learning
√n is
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 16 / 48
Tradeoffs of Large scale learning - Learning
√n is
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 16 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 17 / 48
Tradeoffs of Large scale learning - Learning
n ) O
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 17 / 48
Tradeoffs of Large scale learning - Learning
n ) O
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 17 / 48
Tradeoffs of Large scale learning - Learning
n ) O
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 17 / 48
Tradeoffs of Large scale learning - Learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 18 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 19 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 20 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 20 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 20 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 20 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 21 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 21 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 21 / 48
A case study -Finite dimension linear least mean squares
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 22 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 23 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 24 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 25 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 26 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 27 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 27 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 27 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 28 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 28 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 29 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 30 / 48
Non parametric learning
ρX
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 30 / 48
Non parametric learning
ρX
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 30 / 48
Non parametric learning
ρX
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 30 / 48
Non parametric learning
ρX
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 30 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 31 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 32 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 33 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 34 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 34 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 35 / 48
Non parametric learning
Stochastic optimization Hilbert spaces 36 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 36 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 36 / 48
Non parametric learning
2r+α
2r 2r+α
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 37 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 38 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 38 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 38 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 38 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 39 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 39 / 48
Non parametric learning
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 39 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 40 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 41 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 41 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 41 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 42 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 43 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 43 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 44 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 44 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 45 / 48
The complexity challenge, approximation of the kernel
Alaoui, A. E. and Mahoney, M. W. (2014). Fast randomized kernel methods with statistical guarantees. CoRR, abs/1411.0306. Bach, F. (2012). Sharp analysis of low-rank kernel matrix approximations. ArXiv e-prints. Bach, F. and Moulines, E. (2013). Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n). ArXiv e-prints. Bottou, L. and Bousquet, O. (2008). The tradeoffs of large scale learning. In IN : ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 20. Caponnetto, A. and De Vito, E. (2007). Optimal Rates for the Regularized Least-Squares Algorithm. Foundations of Computational Mathematics, 7(3) :331–368. Dai, B., Xie, B., He, N., Liang, Y., Raj, A., Balcan, M., and Song, L. (2014). Scalable kernel methods via doubly stochastic gradients. In Advances in Neural Information Processing Systems 27 : Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada, pages 3041–3049. Dieuleveut, A. and Bach, F. (2014). Non-parametric Stochastic Approximation with Large Step sizes. ArXiv e-prints. Aymeric Dieuleveut Stochastic optimization Hilbert spaces 46 / 48
The complexity challenge, approximation of the kernel
Rahimi, A. and Recht, B. (2008). Weighted sums of random kitchen sinks : Replacing minimization with randomization in learning. In Advances in Neural Information Processing Systems 21, Proceedings of the Twenty-Second Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 8-11, 2008, pages 1313–1320. Rudi, A., Camoriano, R., and Rosasco, L. (2015). Less is more : Nystr¨
CoRR, abs/1507.04717. Shalev-Schwartz, S. and K., S. (2011). Theorical basis for more data less work. Shalev-Schwartz, S. and Srebro, N. (2008). SVM optimisation : Inverse dependance on training set size. Proceedings of the International Conference on Machine Learning (ICML). Tarr` es, P. and Yao, Y. (2011). Online learning as stochastic approximation of regularization paths. ArXiv e-prints 1103.5538. Ying, Y. and Pontil, M. (2008). Online gradient descent learning algorithms. Foundations of Computational Mathematics, 5. Aymeric Dieuleveut Stochastic optimization Hilbert spaces 47 / 48
The complexity challenge, approximation of the kernel
Aymeric Dieuleveut Stochastic optimization Hilbert spaces 48 / 48