CS 4803 / 7643: Deep Learning
Zsolt Kira Georgia Tech
Topics:
– Backpropagation – Vector/Matrix/Tensor math – Deriving vectorized gradients for ReLU
CS 4803 / 7643: Deep Learning Topics: Backpropagation - - PowerPoint PPT Presentation
CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math Deriving vectorized gradients for ReLU Zsolt Kira Georgia Tech Administrivia PS1/HW1 out Start thinking about project topics/teams (C)
– Backpropagation – Vector/Matrix/Tensor math – Deriving vectorized gradients for ReLU
(C) Dhruv Batra & Zsolt Kira 2
(C) Dhruv Batra & Zsolt Kira 3
(C) Dhruv Batra & Zsolt Kira 4
(C) Dhruv Batra & Zsolt Kira 6
Slide Credit: Marc'Aurelio Ranzato
(C) Dhruv Batra & Zsolt Kira 7
(C) Dhruv Batra & Zsolt Kira 8
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 9
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 10
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 11
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 12
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 13
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 14
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 15
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
(C) Dhruv Batra & Zsolt Kira 16
Slide Credit: Marc'Aurelio Ranzato, Yann LeCun
17
Upstream gradient Local gradient
Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n
add gate: gradient distributor max gate: gradient router mul gate: gradient switcher
Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n
(C) Dhruv Batra and Zsolt Kira 19
– Several portions will have parameters
function to define our objective
modularize computation
– Back propagation = Gradient Descent + Chain Rule
– Work through mathematical view – Vectors, matrices, and tensors – Next time: Can the computer do this for us automatically?
– https://explained.ai/matrix-calculus/index.html – https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/slides/L5_gradients _notes.pdf
– https://explained.ai/matrix-calculus/index.html – https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/slide s/L5_gradients_notes.pdf
– Scalar Case – Vector Case – Jacobian view – Graphical view – Tensors
(C) Dhruv Batra & Zsolt Kira 20
(C) Dhruv Batra & Zsolt Kira 21