GoogLeNet Deeper than deeper Some slides are from Christian Szegedy

GoogLeNet Convolution Pooling Softmax Other

GoogLeNet vs Previous GoogLeN Convolution et Pooling Softmax Other Zeiler-Fergus Architecture (1 tower)

Why is the deep learning revolution arriving just now?

Why is the deep learning revolution arriving just now? Re ctified L inear U nit Glorot, X., Bordes, A., & Bengio, Y. (2011). Deep sparse rectifier networks In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume (Vol. 15, pp. 315-323).

Theoretical breakthroughs Arora, S., Bhaskara, A., Ge, R., & Ma, T. Provable bounds for learning some deep representations . ICML 2014

Hebbian Principle Cells that fire together, wire together Input

Cluster according activation statistics Layer 1 Input

Cluster according correlation statistics Layer 2 Layer 1 Input

Cluster according correlation statistics Layer 3 Layer 2 Layer 1 Input

In images, correlations tend to be local

Cover very local clusters by 1x1 convolutions number of 1x1 filters

Less spread out correlations number of 1x1 filters

Cover more spread out clusters by 3x3 convolutions number of 1x1 filters 3x3

Cover more spread out clusters by 5x5 convolutions number of 1x1 filters 3x3

Cover more spread out clusters by 5x5 convolutions number of 1x1 filters 5x5 3x3

A heterogeneous set of convolutions number of 1x1 filters 3x3 5x5

Schematic view (naive version) number of 1x1 filters Filter concatenation 3x3 1x1 3x3 5x5 convolutions convolutions convolutions 5x5 Previous layer

Naive idea Filter concatenation 1x1 3x3 5x5 convolutions convolutions convolutions Previous layer

Naive idea ( does not work! ) Filter concatenation 1x1 3x3 5x5 3x3 max convolutions convolutions convolutions pooling Previous layer

Inception module Filter concatenation 3x3 5x5 1x1 convolutions convolutions convolutions 1x1 convolutions 1x1 1x1 3x3 max convolutions convolutions pooling Previous layer

Inception Convolution Why does it have so Pooling many layers??? Softmax Other

Inception Convolution 9 Inception modules Pooling Softmax Other Network in a network in a network...

1024 Inception 832 832 512 512 512 480 256 480 Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules.

1024 Inception 832 832 512 512 512 480 256 480 Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules. � Can remove fully connected layers on top completely

1024 Inception 832 832 512 512 512 480 256 480 Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules. � Can remove fully connected layers on top completely � Number of parameters is reduced to 5 million �

1024 Inception 832 832 512 512 512 480 256 480 Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules. Computional cost is � increased by less than Can remove fully connected layers on top completely 2X compared to � Krizhevsky’s network. Number of parameters is reduced to 5 million (<1.5Bn operations/ � evaluation)

Efficient Gradient Propatation • Shadow network can always provide good performance • Auxiliary classifier connected to intermediate layers

Multiple Models and Crops Performance break

Classification performance

Where Are We Now

Where Are We Now •It is very hard for hymn •Even if the number of choices is reduced to 1000

Where Are We Now •It is very hard for hyman •Even if the number of choices is reduced to 1000 •It is time consuming •1 image per minute •Human performance •Without training: 13 - 15% error •With training: 5.1% •GoogLeNet: 6.7%

GoogLeNet Deeper than deeper Some slides are from Christian Szegedy - PowerPoint PPT Presentation

GoogLeNet Deeper than deeper Some slides are from Christian Szegedy GoogLeNet Convolution Pooling Softmax Other GoogLeNet vs Previous GoogLeN Convolution et Pooling Softmax Other Zeiler-Fergus Architecture (1 tower) Why is the deep

Food/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model

GoogLeNet BIL722 Advanced Vision - Presentation Mehmet Gnel Team Christian Wei Yangqing

CS7015 (Deep Learning) : Lecture 11 Convolutional Neural Networks, LeNet, AlexNet, ZF-Net, VGGNet,

Recent Trends in Computer Vision and Deep Learning Systems Yangqing Jia Lead Researcher and

Learning Transferable Architectures for Scalable Image Recognition Barret Zoph, Vijay Vasudevan,

CSC 1010 Lecture 7 What do we know so far? Class lecture, lab, Rephactor, Quick Checks,

Closure under regular operations I Recall we define three operations: , , We will see

Intro to Strings Lecture 7 COP 3252 Summer 2017 May 23, 2017 Strings in Java In Java, a

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 23: Speech

Formal Languages 1 Discrete Mathematical Structures Formal Languages

Regular Expressions Greg Plaxton Theory in Programming Practice, Spring 2004 Department of

Theory of Computer Science C6. Context-free Languages: Closure & Decidability Gabriele R

CSE443 Compilers Dr. Carl Alphonce alphonce@buffalo.edu 343 Davis Hall Announcements HW-01

Lecture 4 Regular Expressions 4-0 DFAs vs NFAs Surprisingly, for finite

91.304 Foundations of (Th (Theoretical) Computer Science ti l) C t S i Chapter 1 Lecture

Compiler Construction Lecture 3: Scanner Generators 2020-01-14 Michael Engel Includes material

CS 301 Lecture 07 Closure properties of regular languages Stephen Checkoway February 7, 2018

Applications in finite state automata Completeness of Regular Relations Kurt Eberle

Membership Properties for Regular Languages 5DV037 Fundamentals of Computer Science Ume a

Concatenation hierarchies and separation Marc Zeitoun LaBRI, Bordeaux University Caalm 19,

COMP3630/6360: Theory of Computation Semester 1, 2020 The Australian National University Regular

CSCI 3136 Principles of Programming Languages Lexical Analysis and Automata Theory - 1 Summer

Problems from Formal Language Theory Computability and Complexity Decision Problems Acceptance:

Iteration (Kleene Star) Roland Backhouse October 15, 2002 2 Outline Axioms