CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks
Jimmy Ba
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 1 / 57
CSC413/2516 Lecture 7: Generalization & Recurrent Neural - - PowerPoint PPT Presentation
CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks Jimmy Ba Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 1 / 57 Overview Weve focused so far on how to optimize neural nets how to get
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 1 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 2 / 57
x t M = 1 1 −1 1 x t M = 3 1 −1 1 x t M = 9 1 −1 1
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 3 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 4 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 5 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 6 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 7 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 8 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 9 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 10 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 11 / 57
— Hinton, Coursera lectures — Bishop, Pattern Recognition and Machine Learning Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 12 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 13 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 14 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 15 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 16 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 17 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 18 / 57
— Goodfellow et al., Deep Learning Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 19 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 20 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 21 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 21 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 21 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 22 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 23 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 24 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 25 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 26 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 27 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 28 / 57
2 2 2 w=1 w=1
1.5 1.5 w=1 w=1 1 2.5 2.5 w=1 w=1 1 3.5 3.5 w=1 w=1 T=1 T=2 T=3 T=4 w=1 w=1 w=1
input unit linear hidden unit linear
unit
w=1 w=1 w=1
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 29 / 57
input unit 1 linear hidden unit logistic
unit
input unit 2
1.00
0.92
0.03
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 30 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 31 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 32 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 33 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 34 / 57
http://machinelearning.wustl.edu/mlpapers/paper_files/ICML2011Martens_532.pdf Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 35 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 36 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 36 / 57
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, K. Cho, B. van Merrienboer,
Sequence to Sequence Learning with Neural Networks, Ilya Sutskever, Oriol Vinyals and Quoc Le, NIPS 2014. Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 37 / 57
Input:
j=8584 for x in range(8): j+=920 b=(1500+j) print((b+7567))
Target: 25011. Input:
i=8827 c=(i-5347) print((c+8704) if 2641<8500 else 5308)
Target: 1218.
Input:
vqppkn sqdvfljmnc y2vxdddsepnimcbvubkomhrpliibtwztbljipcc
Target: hkhpg
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 38 / 57
Input: print(6652). Target: 6652. ”Baseline” prediction: 6652. ”Naive” prediction: 6652. ”Mix” prediction: 6652. ”Combined” prediction: 6652. Input: d=5446 for x in range(8):d+=(2678 if 4803<2829 else 9848) print((d if 5935<4845 else 3043)). Target: 3043. ”Baseline” prediction: 3043. ”Naive” prediction: 3043. ”Mix” prediction: 3043. ”Combined” prediction: 3043. print((5997-738)). Target: 5259. ”Baseline” prediction: 5101. ”Naive” prediction: 5101. ”Mix” prediction: 5249. ”Combined” prediction: 5229. Input: print(((1090-3305)+9466)). Target: 7251. ”Baseline” prediction: 7111. ”Naive” prediction: 7099. ”Mix” prediction: 7595. ”Combined” prediction: 7699.
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 39 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 40 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 41 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 42 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 43 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 44 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 45 / 57
– Geoffrey Hinton, Coursera
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 46 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 47 / 57
— Goodfellow et al., Deep Learning Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 48 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 49 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 50 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 51 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 52 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 53 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 54 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 55 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 56 / 57
Jimmy Ba CSC413/2516 Lecture 7: Generalization & Recurrent Neural Networks 57 / 57