GoogLeNet
Deeper than deeper
Some slides are from Christian Szegedy
GoogLeNet Deeper than deeper Some slides are from Christian Szegedy - - PowerPoint PPT Presentation
GoogLeNet Deeper than deeper Some slides are from Christian Szegedy GoogLeNet Convolution Pooling Softmax Other GoogLeNet vs Previous GoogLeN Convolution et Pooling Softmax Other Zeiler-Fergus Architecture (1 tower) Why is the deep
Some slides are from Christian Szegedy
Convolution Pooling Softmax Other
GoogLeN et Zeiler-Fergus Architecture (1 tower)
Convolution Pooling Softmax Other
Glorot, X., Bordes, A., & Bengio, Y. (2011). Deep sparse rectifier networks In Proceedings of the 14th International Conference on Artificial Intelligence and
315-323).
Input
Layer 1 Input
Layer 1 Input Layer 2
Layer 1 Input Layer 2 Layer 3
In images, correlations tend to be local
Cover very local clusters by 1x1 convolutions
number of filters
Less spread out correlations
number of filters
Cover more spread out clusters by 3x3 convolutions
number of filters
Cover more spread out clusters by 5x5 convolutions
number of filters
Cover more spread out clusters by 5x5 convolutions
number of filters
A heterogeneous set of convolutions
number of filters
Schematic view (naive version)
number of filters
1x1 convolutions 3x3 convolutions 5x5 convolutions Filter concatenation Previous layer
1x1 convolutions 3x3 convolutions 5x5 convolutions Filter concatenation Previous layer
Naive idea
1x1 convolutions 3x3 convolutions 5x5 convolutions Filter concatenation Previous layer
Naive idea (does not work!)
3x3 max pooling
1x1 convolutions 3x3 convolutions 5x5 convolutions Filter concatenation Previous layer
Inception module
3x3 max pooling 1x1 convolutions 1x1 convolutions 1x1 convolutions
Convolution Pooling Softmax Other
Why does it have so many layers???
Convolution Pooling Softmax Other Network in a network in a network...
Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules. 256 480 480 512 512 512 832 832 1024
Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules.
256 480 480 512 512 512 832 832 1024
Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules.
480 480 512 512 512 832 832 1024
Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules.
480 480 512 512 512 832 832 1024 Computional cost is increased by less than 2X compared to Krizhevsky’s network. (<1.5Bn operations/ evaluation)
Performance break
Classification performance