Convolutional Neural Nets II EECS 442 Prof. David Fouhey Winter - PowerPoint PPT Presentation

Convolutional Neural Nets II EECS 442 – Prof. David Fouhey Winter 2019, University of Michigan http://web.eecs.umich.edu/~fouhey/teaching/EECS442_W19/

Previously – Backpropagation 𝑔 𝑦 = −𝑦 + 3 2 x -x -x+3 (-x+3) 2 -n n 2 n+3 1 2x − 6 −2𝑦 + 6 −2𝑦 + 6 Forward pass: compute function Backward pass: compute derivative of all parts of the function

Setting Up A Neural Net Input Hidden Output h 1 y 1 x 1 h 2 y 2 x 2 h 3 y 3 h 4

Setting Up A Neural Net Input Hidden 2 Output Hidden 1 a 1 h 1 y 1 x 1 a 2 h 2 y 2 x 2 a 3 h 3 y 3 a 4 h 4

Fully Connected Network a 1 h 1 y 1 Each neuron connects x 1 a 2 h 2 to each neuron in the y 2 previous layer x 2 a 3 h 3 y 3 a 4 h 4

Fully Connected Network Define New Block: “Linear Layer” (Ok technically it’s Affine) W b 𝑀 𝒐 = 𝑿𝒐 + 𝒄 n L Can get gradient with respect to all the inputs (do on your own; useful trick: have to be able to do matrix multiply)

Fully Connected Network a 1 h 1 y 1 x 1 a 2 h 2 y 2 x 2 a 3 h 3 y 3 a 4 h 4 W 1 b 1 W 2 b 2 W 3 b 3 x L f (n) L f (n) L f (n)

Convolutional Layer New Block: 2D Convoluiton W b 𝐷 𝒐 = 𝒐 ∗ 𝑿 + 𝒄 n C

Convolution Layer F w 32 F h c 𝐺 ℎ 𝐺 𝑑 𝑥 𝑐 + ෍ ෍ ෍ 𝐺 𝑗,𝑘,𝑙 ∗ 𝐽 𝑧+𝑗,𝑦+𝑘,𝑑 32 𝑗=1 𝑘=1 𝑙=1 3 Slide credit: Karpathy and Fei-Fei

Convolutional Neural Network (CNN) W 1 b 1 W 2 b 2 W 3 b 3 x C f (n) C f (n) C f (n)

Today C W F 1 CNN 1 H Convert HxW image into a F-dimensional vector • What’s the probability this image is a cat (F=1) • Which of 1000 categories is this image? (F=1000) • At what GPS coord was this image taken? (F=2) • Identify the X,Y coordinates of 28 body joints of an image of a human (F=56)

Today’s Running Example: Classification C W F 1 CNN 1 H Running example: image classification P(image is class #1) P(image is class #2) P(image is class #F)

Today’s Running Example: Classification C W 1 CNN 1 0.5 0.2 0.1 0.2 H y i : class #0 Loss function “Hippo” exp( 𝑋𝑦 𝑧 𝑗 − log σ 𝑙 exp( 𝑋𝑦 𝑙 ))

Today’s Running Example: Classification C W 1 CNN 1 0.5 0.2 0.1 0.2 H y i : class #3 Loss function “Baboon” exp( 𝑋𝑦 𝑧 𝑗 − log σ 𝑙 exp( 𝑋𝑦 𝑙 ))

Model For Your Head C W F 1 CNN 1 H • Provide: • Examples of images and desired outputs • Sequence of layers producing a 1x1xF output • A loss function that measures success • Train the network -> network figures out the parameters that makes this work

Layer Collection You can construct functions out of layers. The only requirement is the layers “fit” together. Optimization figures out what the parameters of the layers are. Image credit: lego.com

Review – Pooling Idea: just want spatial resolution of activations / images smaller; applied per-channel Max-pool 1 1 2 4 2x2 Filter 5 6 7 8 6 8 Stride 2 3 2 1 0 3 4 1 1 3 4 Slide credit: Karpathy and Fei-Fei

Review – Pooling Max-pool 2x2 Filter Stride 2 1 1 2 4 6 8 5 6 7 8 3 2 1 0 3 4 1 1 3 4

Other Layers – Fully Connected 1x1xC 1x1xF Map C-dimensional feature to F-dimensional feature using linear transformation W (FxC matrix) + b (Fx1 vector) How can we write this as a convolution?

Everything’s a Convolution 1x1xC 1x1xF Set Fh=1, Fw=1 1x1 Convolution with F Filters 𝐺 ℎ 𝐺 𝑑 𝑑 𝑥 𝑐 + ෍ ෍ ෍ 𝐺 𝑗,𝑘,𝑙 ∗ 𝐽 𝑧+𝑗,𝑦+𝑘,𝑑 𝑐 + ෍ 𝐺 𝑙 ∗ 𝐽 𝑑 𝑗=1 𝑘=1 𝑙=1 𝑙=1

Converting to a Vector HxWxC 1x1xF How can we do this?

Converting to a Vector* – Pool HxWxC 1x1xF Avg Pool 1 1 2 4 HxW Filter 5 6 7 8 Stride 1 3.1 3 2 1 0 1 1 3 4 *(If F == C)

Converting to a Vector – Convolve HxWxC 1x1xF HxW Convolution with F Filters Single value ∗ Per-filter

Looking At Networks • We’ll look at 3 landmark networks, each trained to solve a 1000-way classification output (Imagenet) • Alexnet (2012) • VGG-16 (2014) • Resnet (2015)

AlexNet Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 Each block is a HxWxC volume. You transform one volume to another with convolution

CNN Terminology Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 Each entry is called an “activation”/“neuron”/“feature”

AlexNet Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384

AlexNet Input Conv 1 227x227 55x55 55x55 227x227 55x55 3 96 96 3 96 ReLU 11x11 filter, stride of 4 (227-11)/4+1 = 55

AlexNet Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 All layers followed by ReLU Red layers are followed by maxpool Early layers have “normalization”

AlexNet – Details Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 C: 11 C:5 C:3 C:3 C:3 P: 3 P:3 P:3 C: Size of conv P: Size of pool

AlexNet Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 13x13 Input, 1x1 output. How?

Alexnet – How Many Parameters? Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384

Alexnet – How Many Parameters? Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 96 11x11 filters on 3 -channel input 11x11 x 3 x 96+96 = 34,944

Alexnet – How Many Parameters? Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 Note: max pool to 6x6 4096 6x6 filters on 256 -channel input 6x6 x 256 x 4096+4096 = 38 million

Alexnet – How Many Parameters? Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 4096 1x1 filters on 4096 -channel input 1x1 x 4096 x 4096+4096 = 17 million

Alexnet – How Many Parameters How long would it take you to list the parameters of Alexnet at 4s / parameter? 1 year? 4 years? 8 years? 16 years? • 62.4 million parameters • Vast majority in fully connected layers • But... paper notes that removing the convolutions is disastrous for performance.

Dataset – ILSVRC • Imagenet Largescale Visual Recognition Challenge • 1000 Categories • 1.4M images

Dataset – ILSVRC Figure Credit: O. Russakovsky

Visualizing Filters Input Conv 1 227x227 55x55 3 96 Conv 1 Filters • Q. How many input dimensions? • A: 3 • What does the input mean? • R, G, B, duh.

What’s Learned First layer filters of a network trained to distinguish 1000 categories of objects Remember these filters go over color. Figure Credit: Karpathy and Fei-Fei

Visualizing Later Filters Input Conv Conv 1 2 227x227 55x55 27x27 3 96 256 Conv 2 Filters • Q. How many input dimensions? • A: 96…. hmmm • What does the input mean? • Uh, the uh, previous slide

Visualizing Later Filters • Understanding the meaning of the later filters from their values is typically impossible: too many input dimensions, not even clear what the input means.

Understanding Later Filters Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 CNN that extracts a 2-hidden layer 13x13x256 output Neural network

Understanding Later Filters Input Conv Conv Conv Conv Conv FC FC Output 1 2 3 4 5 6 7 1x1 227x227 55x55 27x27 13x13 13x13 13x13 1x1 1x1 256 4096 4096 1000 3 96 256 384 384 CNN that extracts a 1-hidden 1x1x4096 feature layer NN

Understanding Later Filters Input Conv Conv Conv Conv Conv 1 2 3 4 5 227x227 55x55 27x27 13x13 13x13 13x13 256 3 96 256 384 384 CNN that extracts a 13x13x256 output

Understanding Later Filters Feed an image in, see what score the filter gives it. A more pleasant version of a real neuroscience procedure. 13x13 256 Which one’s bigger? What image makes the output biggest? 13x13 256

Figure Credit: Girschick et al. CVPR 2014.

What’s Up With the White Boxes? 3 384 13 227 227 13

Convolutional Neural Nets II EECS 442 Prof. David Fouhey Winter - PowerPoint PPT Presentation

Convolutional Neural Nets II EECS 442 Prof. David Fouhey Winter 2019, University of Michigan http://web.eecs.umich.edu/~fouhey/teaching/EECS442_W19/ Previously Backpropagation = + 3 2 x -x -x+3 (-x+3) 2 -n n 2 n+3

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

Deep Convolutional Neural Nets COMPSCI 371D Machine Learning COMPSCI 371D Machine

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Conflict nets: Efficient locally canonical MALL proof nets Dominic J. D. Hughes and Willem

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Petri Nets Petri Nets Inputs and Outputs Petri Nets vs FSM Lionel Morel Modeling Templates

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Petri Nets and Model Checking Natasa Gkolfi University of Oslo March 31, 2017 Petri Nets and

NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

and Inference for Convolutional Neural Networks 1 2 FFT IFFT 3 4 Mathieu et al.: Fast

Texts, Fluency, and English Language Learners Elfrieda H. Hiebert University of California,

Concurrency, Races & Synchronization CS 450: Operating Systems Michael Lee

Simulating Baboon Behavior using Stata Phil Ender UCLA Statistical Consulting Group (Ret) Stata

Disclosure The Future of Sleep Medicine I have nothing to disclose. Allan I. Pack, M.B.Ch.B.,

Designing Networks of Protected Areas Brook Milligan Department of Biology New Mexico State

Data management, storage and sharing Managing data at institute-level: an example Plateforms

Shape Matching Shape-Based Recognition Intro Humans can recognize many objects based on

1 Peter Series Lesson #092 May 25, 2017 Dean Bible Ministries www.deanbibleministries.org Dr.

Convolutional Neural Nets II EECS 442 Prof. David Fouhey Winter - PowerPoint PPT Presentation

Convolutional Neural Nets II EECS 442 Prof. David Fouhey Winter 2019, University of Michigan http://web.eecs.umich.edu/~fouhey/teaching/EECS442_W19/ Previously Backpropagation = + 3 2 x -x -x+3 (-x+3) 2 -n n 2 n+3

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

Deep Convolutional Neural Nets COMPSCI 371D Machine Learning COMPSCI 371D Machine

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Conflict nets: Efficient locally canonical MALL proof nets Dominic J. D. Hughes and Willem

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Petri Nets Petri Nets Inputs and Outputs Petri Nets vs FSM Lionel Morel Modeling Templates

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Petri Nets and Model Checking Natasa Gkolfi University of Oslo March 31, 2017 Petri Nets and

NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 &amp; 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

and Inference for Convolutional Neural Networks 1 2 FFT IFFT 3 4 Mathieu et al.: Fast

Texts, Fluency, and English Language Learners Elfrieda H. Hiebert University of California,

Concurrency, Races &amp; Synchronization CS 450: Operating Systems Michael Lee

Simulating Baboon Behavior using Stata Phil Ender UCLA Statistical Consulting Group (Ret) Stata

Disclosure The Future of Sleep Medicine I have nothing to disclose. Allan I. Pack, M.B.Ch.B.,

Designing Networks of Protected Areas Brook Milligan Department of Biology New Mexico State

Data management, storage and sharing Managing data at institute-level: an example Plateforms

Shape Matching Shape-Based Recognition Intro Humans can recognize many objects based on

1 Peter Series Lesson #092 May 25, 2017 Dean Bible Ministries www.deanbibleministries.org Dr.

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

Concurrency, Races & Synchronization CS 450: Operating Systems Michael Lee