SLIDE 1
Math for VGG
1 Intro
I am writing this to help you understand what the code is doing. It’s still work in progress, toward making it more reader
- friendly. At this point, it’s just a bunch of math formulas you do not want to follow.
2 Notational Conventions
- Since there are lots of variables that take multiple indices, it would be difficult to parse them if we use subscripts
for indices. We therefore put indices in parens, like x(i, j, k, l), instead of subscripts xi,j,k,l. This would be much easier to read.
3 Symbols
Constant Parameters and Indexes
- B (Batch size) : the number of samples in a mini-batch
– 0 ≤ b < B (batch) : an index of a sample in a mini batch
- C (Classes) : the number of classes
– 0 ≤ c < C (class) : an index of a class
- IC (Input Channels) : the number of channels in an input image of a layer (e.g., three if an image has red, green
and blue components) – 0 ≤ ic < IC (input channel index) : an index of a channel in an input image
- OC (Output Channels) : the number of channels in an output image of a layer
– 0 ≤ oc < OC (output channel index) : an index of a channel in an output image
- H (Height) : the number of pixels in a single column of an image
– 0 ≤ i < H (image row index)
- W (Width) : the number of pixels in a single row of an image
– 0 ≤ j < W (image column index)
- K (Kernel size) : half of the number of pixels in a single row or a single column. throughout VGG, K is actually