Optimization Problems for Neural Networks
Chih-Jen Lin
National Taiwan University Last updated: May 25, 2020
Chih-Jen Lin (National Taiwan Univ.) 1 / 78
Optimization Problems for Neural Networks Chih-Jen Lin National - - PowerPoint PPT Presentation
Optimization Problems for Neural Networks Chih-Jen Lin National Taiwan University Last updated: May 25, 2020 Chih-Jen Lin (National Taiwan Univ.) 1 / 78 Outline Regularized linear classification 1 Optimization problem for fully-connected
Chih-Jen Lin (National Taiwan Univ.) 1 / 78
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 2 / 78
Regularized linear classification
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 3 / 78
Regularized linear classification
model
Chih-Jen Lin (National Taiwan Univ.) 4 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 5 / 78
Regularized linear classification
△ △ △ △ △ △
Chih-Jen Lin (National Taiwan Univ.) 6 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 7 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 8 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 9 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 10 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 11 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 12 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 13 / 78
Regularized linear classification
Chih-Jen Lin (National Taiwan Univ.) 14 / 78
Regularized linear classification
✇ f (✇),
l
Chih-Jen Lin (National Taiwan Univ.) 15 / 78
Optimization problem for fully-connected networks
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 16 / 78
Optimization problem for fully-connected networks
k−1
Chih-Jen Lin (National Taiwan Univ.) 17 / 78
Optimization problem for fully-connected networks
Chih-Jen Lin (National Taiwan Univ.) 18 / 78
Optimization problem for fully-connected networks
Chih-Jen Lin (National Taiwan Univ.) 19 / 78
Optimization problem for fully-connected networks
11
12
1nm
21
22
2nm
nm+11 w m nm+12 · · · w m nm+1nm
nm+1×nm
Chih-Jen Lin (National Taiwan Univ.) 20 / 78
Optimization problem for fully-connected networks
j
j ), j = 1, . . . , nm+1,
Chih-Jen Lin (National Taiwan Univ.) 21 / 78
Optimization problem for fully-connected networks
1
2
nm+1
nm+1×1
Chih-Jen Lin (National Taiwan Univ.) 22 / 78
Optimization problem for fully-connected networks
Chih-Jen Lin (National Taiwan Univ.) 23 / 78
Optimization problem for fully-connected networks
Chih-Jen Lin (National Taiwan Univ.) 24 / 78
Optimization problem for fully-connected networks
i=1 ξ(③L+1,i(θ); ② i, ①i).
Chih-Jen Lin (National Taiwan Univ.) 25 / 78
Optimization problem for fully-connected networks
j
j
Chih-Jen Lin (National Taiwan Univ.) 26 / 78
Optimization problem for convolutional neural networks (CNN)
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 27 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 28 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 29 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 30 / 78
Optimization problem for convolutional neural networks (CNN)
k−1
Chih-Jen Lin (National Taiwan Univ.) 31 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 32 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 33 / 78
Optimization problem for convolutional neural networks (CNN)
1,1,1
1,h,1
h,1,1
h,h,1
1,1,din
1,h,din
h,1,din
h,h,din
Chih-Jen Lin (National Taiwan Univ.) 34 / 78
Optimization problem for convolutional neural networks (CNN)
1,1,1 1,2,1 1,3,1 2,1,1 2,2,1 2,3,1 3,1,1 3,2,1 3,3,1
sout,i
1,1,j
sout,i
1,2,j
sout,i
2,1,j
sout,i
2,2,j
Chih-Jen Lin (National Taiwan Univ.) 35 / 78
Optimization problem for convolutional neural networks (CNN)
1,1,d
1,h,d
h,1,d . . . zi h,h,d
Chih-Jen Lin (National Taiwan Univ.) 36 / 78
Optimization problem for convolutional neural networks (CNN)
din
1,1,d
1,h,d
h,1,d . . . zi h,h,d
1,1,d
1,h,d
h,1,d . . . w j h,h,d
Chih-Jen Lin (National Taiwan Univ.) 37 / 78
Optimization problem for convolutional neural networks (CNN)
1+s,1,d
1+s,h,d
h+s,1,d . . . zi h+s,h,d
Chih-Jen Lin (National Taiwan Univ.) 38 / 78
Optimization problem for convolutional neural networks (CNN)
din
1+s,1,d
1+s,h,d
h+s,1,d . . . zi h+s,h,d
1,1,d
1,h,d
h,1,d . . . w j h,h,d
Chih-Jen Lin (National Taiwan Univ.) 39 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 40 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 41 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 42 / 78
Optimization problem for convolutional neural networks (CNN)
1,1,1
2,1,1
ain,bin,1
1,1,din zi 2,1,din . . . zi ain,bin,din
Chih-Jen Lin (National Taiwan Univ.) 43 / 78
Optimization problem for convolutional neural networks (CNN)
1,1,1 w 1 2,1,1 . . . w 1 h,h,din
1,1,1 w dout 2,1,1 . . . w dout h,h,din
Chih-Jen Lin (National Taiwan Univ.) 44 / 78
Optimization problem for convolutional neural networks (CNN)
aoutbout
Chih-Jen Lin (National Taiwan Univ.) 45 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 46 / 78
Optimization problem for convolutional neural networks (CNN)
1,1,1
1+s,1,1
1+(aout−1)s,1+(bout−1)s,1
2,1,1
2+s,1,1
2+(aout−1)s,1+(bout−1)s,1
h,h,1
h+s,h,1
h+(aout−1)s,h+(bout−1)s,1
h,h,din zi h+s,h,din
h+(aout−1)s,h+(bout−1)s,din
Chih-Jen Lin (National Taiwan Univ.) 47 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 48 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 49 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 50 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 51 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 52 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 53 / 78
Optimization problem for convolutional neural networks (CNN)
θ f (θ),
i=1 ξ(③L+1,i(θ); ② i, Z 1,i)
Chih-Jen Lin (National Taiwan Univ.) 54 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 55 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 56 / 78
Optimization problem for convolutional neural networks (CNN)
p p ain bin
Chih-Jen Lin (National Taiwan Univ.) 57 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 58 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 59 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 60 / 78
Optimization problem for convolutional neural networks (CNN)
61 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 62 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 63 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 64 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 65 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 66 / 78
Optimization problem for convolutional neural networks (CNN)
poolvec(Z in,i)
pool = MiPφ ∈ Rdoutaoutbout×dinainbin.
Chih-Jen Lin (National Taiwan Univ.) 67 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 68 / 78
Optimization problem for convolutional neural networks (CNN)
pad, bm pad : size after padding
conv, bm conv : size after convolution.
Chih-Jen Lin (National Taiwan Univ.) 69 / 78
Optimization problem for convolutional neural networks (CNN)
pad, bm pad, dm
pad, bm pad, dm
conv, bm conv, dm+1
conv, bm conv, dm+1
conv, bm conv, dm+1
conv, bm conv, dm+1
Chih-Jen Lin (National Taiwan Univ.) 70 / 78
Optimization problem for convolutional neural networks (CNN)
pad, Pm φ , Pm,i pool, W m, ❜m.
φ Pm padvec(Z m,i))hmhmdm×am
convbm conv+
aconvbconv
poolvec(σ(Sm,i)))dm+1×am+1bm+1,
Chih-Jen Lin (National Taiwan Univ.) 71 / 78
Optimization problem for convolutional neural networks (CNN)
Chih-Jen Lin (National Taiwan Univ.) 72 / 78
Optimization problem for convolutional neural networks (CNN)
11
12
1nm
21
22
2nm
nm+11 w m nm+12 · · · w m nm+1nm
nm+1×nm
1
2
nm+1
nm+1×1
Chih-Jen Lin (National Taiwan Univ.) 73 / 78
Optimization problem for convolutional neural networks (CNN)
j
j
Chih-Jen Lin (National Taiwan Univ.) 74 / 78
Discussion
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 75 / 78
Discussion
Chih-Jen Lin (National Taiwan Univ.) 76 / 78
Discussion
Chih-Jen Lin (National Taiwan Univ.) 77 / 78
Discussion
andez-Delgado, E. Cernadas, S. Barro, and D. Amorim. Do we need hundreds of classifiers to solve real world classification problems? Journal of Machine Learning Research, 15:3133–3181, 2014.
neural networks. In F. Pereira, C. J. C. Burges, L. Bottou, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 25, pages 1097–1105. 2012.
55:169–186, 2003.
Proceedings of the 23rd ACM International Conference on Multimedia, pages 689–692, 2015. C.-C. Wang, K.-L. Tan, C.-T. Chen, Y.-H. Lin, S. S. Keerthi, D. Mahajan, S. Sundararajan, and C.-J. Lin. Distributed Newton methods for deep learning. Neural Computation, 30(6): 1673–1724, 2018. URL http://www.csie.ntu.edu.tw/~cjlin/papers/dnn/dsh.pdf.
Chih-Jen Lin (National Taiwan Univ.) 78 / 78