Administrative - In-class midterm this Wednesday! (More on this in a - PowerPoint PPT Presentation

Administrative - In-class midterm this Wednesday! (More on this in a bit) - Assignment #3: out Wed - Sample Midterm will be up in few hours Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 1

Lecture 10: Squeezing out the last few percent & Training ConvNets in practice Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 2

Midterm during next class! - Everything in the notes (unless labeled as aside ) is fair game. - Everything in the slides (until and including last lecture) is fair game. - Everything in the assignments is fair game. - There will be no Python/numpy/vectorization questions. - There will be no questions that require you to know specific details of covered papers, but takeaways presented in class are fair game. What it does include: - Conceptual/Understanding questions (e.g. likes ones I like to ask during lectures) - Design/Tips&Tricks/Debugging questions and intuitions - Know your Calculus Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 9 Feb 2015 2 Feb 2015 3

Where we are... Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 4

Transfer Learning ConvNets Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 5

Bit more about small filters Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 6

The power of small filters (and stride 1) Suppose we stack two CONV layers with receptive field size 3x3 => Each neuron in 1st CONV sees a 3x3 region of input. 1st CONV neuron view of the input: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 7

The power of small filters Suppose we stack two CONV layers with receptive field size 3x3 => Each neuron in 1st CONV sees a 3x3 region of input. Q: What region of input does each neuron in 2nd CONV see? 2nd CONV neuron view of 1st conv: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 8

The power of small filters Suppose we stack two CONV layers with receptive field size 3x3 => Each neuron in 1st CONV sees a 3x3 region of input. Q: What region of input does each neuron in 2nd CONV see? 2nd CONV neuron X view of input: Answer: [5x5] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 9

The power of small filters Suppose we stack three CONV layers with receptive field size 3x3 Q: What region of input does each neuron in 3rd CONV see? 3rd CONV neuron view of 2nd CONV: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 10

The power of small filters Suppose we stack three CONV layers with receptive field size 3x3 Q: What region of input does each neuron in 3rd CONV see? X X Answer: [7x7] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 11

The power of small filters Suppose input has depth C & we want output depth C as well 1x CONV with 7x7 filters 3x CONV with 3x3 filters Number of weights: Number of weights: Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 12

The power of small filters Suppose input has depth C & we want output depth C as well 1x CONV with 7x7 filters 3x CONV with 3x3 filters Number of weights: Number of weights: C*(7*7*C) = 49 C^2 Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 13

The power of small filters Suppose input has depth C & we want output depth C as well 1x CONV with 7x7 filters 3x CONV with 3x3 filters Number of weights: Number of weights: C*(7*7*C) C*(3*3*C) + C*(3*3*C) + C*(3*3*C) = 49 C^2 = 3 * 9 * C^2 = 27 C^2 Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 9 Feb 2015 2 Feb 2015 14

The power of small filters Suppose input has depth C & we want output depth C as well 1x CONV with 7x7 filters 3x CONV with 3x3 filters Number of weights: Number of weights: C*(7*7*C) C*(3*3*C) + C*(3*3*C) + C*(3*3*C) = 49 C^2 = 3 * 9 * C^2 = 27 C^2 Fewer parameters and more nonlinearities = GOOD. Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 9 Feb 2015 2 Feb 2015 15

The power of small filters “More non-linearities” and “deeper” usually gives better performance. [Network in Network, Lin et al. 2013] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 16

The power of small filters “More non-linearities” and “deeper” usually gives better performance. => 1x1 CONV! (Usually follows a normal CONV, e.g. [3x3 CONV - 1x1 CONV] [Network in Network, Lin et al. 2013] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 17

The power of small filters “More non-linearities” and “deeper” usually gives better performance. => 1x1 CONV! (Usually follows a normal CONV, e.g. [3x3 CONV - 1x1 CONV] 3x3 CONV view of input 1x1 CONV view of output of 3x3 CONV [Network in Network, Lin et al. 2013] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 9 Feb 2015 2 Feb 2015 18

The power of small filters “More non-linearities” and “deeper” usually gives better performance. => 1x1 CONV! (Usually follows a normal CONV, e.g. [3x3 CONV - 1x1 CONV] [Network in Network, Lin et al. 2013] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 19

[Very Deep Convolutional Networks for Large-Scale Image Recognition, Simonyan et al., 2014] => Evidence that using 3x3 instead of 1x1 works better Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 20

The power of small filters [Fractional max-pooling, Ben Graham, 2014] Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 21

The power of small filters [Fractional max-pooling, Ben Graham, 2014] In ordinary 2x2 maxpool, Fractional pooling samples the pooling regions are pooling region during forward non-overlapping 2x2 pass: A mix of 1x1, 2x1, 1x2, 2x2. squares Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 22

Data Augmentation Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 23

Data Augmentation - i.e. simulating “fake” data What the computer sees - explicitly encoding image transformations that shouldn’t change object identity. Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 24

Data Augmentation 1. Flip horizontally Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 25

Data Augmentation 2. Random crops/scales Sample these during training (also helps a lot during test time) e.g. common to see even up to 150 crops used Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 26

Data Augmentation 3. Random mix/combinations of : - translation - rotation - stretching - shearing, - lens distortions, … (go crazy) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 27

Data Augmentation 4. Color jittering (maybe even contrast jittering, etc.) - Simple: Change contrast small amounts, jitter the color distributions, etc. - Vignette,... (go crazy) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 28

Data Augmentation Fancy PCA way: 1. Compute PCA on all [R,G, 4. Color jittering B] points values in the (maybe even contrast jittering, etc.) training data - Simple: Change contrast 2. sample some color offset small amounts, jitter the along the principal color distributions, etc. components at each forward pass 3. add the offset to all pixels in a training image (As seen in [Krizhevsky et al. 2012] ) Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 9 Feb 2015 2 Feb 2015 29

Notice the more general theme: 1. Introduce a form of randomness in forward pass 2. Marginalize over the noise distribution during prediction Fractional Pooling Dropout Data Augmentation, DropConnect Model Ensembles Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 9 Feb 2015 2 Feb 2015 30

Training ConvNets in Practice Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015 9 Feb 2015 31

Administrative - In-class midterm this Wednesday! (More on this in a - PowerPoint PPT Presentation

Administrative - In-class midterm this Wednesday! (More on this in a bit) - Assignment #3: out Wed - Sample Midterm will be up in few hours Fei-Fei Li & Andrej Karpathy Fei-Fei Li & Andrej Karpathy Lecture 10 - Lecture 8 - 2 Feb 2015

ADMINISTRATIVE PANEL ADMINISTRATIVE PANEL Administrative panel is an instrument which helps

Presentation of the OAS Administrative Tribunal before the Committee on Administrative and

The Statistical Administrative Records System and Administrative Records Experiment 2000: System

Egypts Administrative Corruption Perception Index 2016 February 2018 Egypts Administrative

The Administrative Conference: Review of Research Projects Emily Bremer, Attorney Advisor

Administrative Referee VSI Referees Clinic 28 September 2002 Outline Division of Authority

EJTN ADMINISTRATIVE LAW TRAINING Brussels, 9-10 April 2019 Changing Administrative Law Procedures

EJTN ADMINISTRATIVE LAW TRAINING Brussels, 9-10 April 2019 Changing Administrative Law Procedures

EJTN ADMINISTRATIVE LAW TRAINING Brussels, 9-10 April 2019 Changing Administrative Law Procedures

What is it? 1 5/13/2019 Reimagining UW- Madisons administrative infrastructure People

1. Timing of Administrative Findings / Preservation a. Notice Letter / Request for State Level Fair

29 Illinois Administrative Code 301 Updates Currently in Effect 29 Illinois Administrative Code

TRAPPED BETWEEN TRAPPED BETWEEN ADMINISTRATIVE DETENTION, ADMINISTRATIVE DETENTION, PRISON AND

MARCO MORGANTINI JUDGE ADMINISTRATIVE COURT BOLOGNA Report held on the 29th August 2019

Administrative Administrative Policies and Procedures Policies and Procedures Deb Bartlett

Administrative Administrative Policies and Procedures Policies and Procedures Deb Bartlett

Loon Interactive graphics in R R.W. Oldford The loon package Loon is an interactive visualization

Multimedia Systems - Colour Video Colour is a visual Presence and distributions of

Architecting for the Clo loud @axelfontaine About Axel Fontaine Founder and CEO of Boxfuse

1 2 3 4 5 The basic Scrum principles are: Empirical Process Control Self-Organization

Shaders Rasmus Vahtra, Andres Traks What is a shader? Maybe this thing? Shader definition

The OpenCV Library: Computing Optical Flow David Stavens Stanford Artificial Intelligence Lab I

MPI, Colour Reconnection and Hadronization Efgects in CJV Simon Pltzer Particle Physics,

ggmap Available on CRAN! Recent Advances in Spatial Visualization David J. Kahle, Ph.D. @