Advanced Section #5: Visualization of convolutional networks and - PowerPoint PPT Presentation

Advanced Section #5: Visualization of convolutional networks and neural style transfer AC 209B: Advanced Topics in Data Science Javier Zazo Pavlos Protopapas

Neural style transfer ◮ Artistic generation of high perceptual quality images that combines the style or texture of some input image, and the elements or content from a different one. Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge, “A neural algorithm of artistic style,” Aug. 2015. 2

Lecture Outline Visualizing convolutional networks Image reconstruction Texture synthesis Neural style transfer DeepDream 3

Visualizing convolutional networks 4

Motivation for visualization ◮ With NN we have little insight about learning and internal operations. ◮ Through visualization we may 1. How input stimuli excite the individual feature maps. 2. Observe the evolution of features. 3. Make more substantiated designs. 5

Architecture ◮ Architecture similar to AlexNet, i.e., [1] – Trained network on the ImageNet 2012 training database for 1000 classes. – Input are images of size 256 × 256 × 3. – Uses convolutional layers, max-pooling and fully connected layers at the end. image size 224 110 26 13 13 13 f lter size 7 3 3 1 1 384 384 256 256 96 C stride 2 3x3 max 3x3 max 3x3 max pool contrast pool 4096 4096 class contrast pool norm. stride 2 norm. stride 2 stride 2 units units softmax 5 3 55 3 13 2 6 1 256 96 256 Input Image Layer 1 Layer 2 Layer 3 Layer 4 Layer 5 Layer 6 Layer 7 Output [1] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton, “Imagenet classification with deep convolutional neural networks,” in Advances in neural information processing systems , 2012, pp. 1097–1105. [2] Matthew D. Zeiler and Rob Fergus, “Visualizing and understanding convolutional networks,” in Computer Vision . 2014, pp. 818–833, Springer. 6

Deconvolutional network ◮ For visualization, the authors employ a deconvolutional network . ◮ Objective : to project hidden feature maps into original input space. – Visualize the activation functions of a specific filter. ◮ The name “deconvolutional” network may be unfortunate, since the network does not perform any deconvolutions (next slide). Matthew D Zeiler, Graham W Taylor, and Rob Fergus, “Adaptive deconvolutional networks for mid and high level feature learning,” in IEEE International Conference on Computer Vision (ICCV) , 2011, pp. 2018–2025 7

Devonvolutional network structure Layer Above Pooled Maps R econstruction Switches Max Pooling Max Unpooling UnpooledMaps R ectif ed Feature Maps Rec �fi ed linear Rec �fi ed linear func � on func � on R ectif edUnpooledMaps Feature Maps Convolu � onal Convolu � onal Filtering {F} Filtering {F T } R econstruction Layer Below Pooled Maps 8

Devonvolutional network description ◮ Unpooling: – The max-pooling operation is non-invertible. – Switch variables: record the locations of maxima. – It places the reconstructed features into the recorded locations. 9

Devonvolutional network description ◮ Rectification: signals go through a ReLu operation. ◮ Filtering: – Use of transposed convolution. – Filters are flipped horizontally and vertically ◮ Transposed convolution projects feature maps back to input space. ◮ Transposed convolution corresponds to the backpropagation of the gradient (an analogy from MLPs). 10

Feature visualization 1. Evaluate the validation database on the trained network. 2. Record the nine highest activation values of each filter’s output. 3. Project the recorded 9 outputs into input space for every neuron. – When projecting, all other activation units in the given layer are set to zero. – This operation ensures we only observe the gradient of a single channel. – Switch variables are used in the unpooling layers 11

First layer of Alexnet 12

Second layer of Alexnet 13

Fourth layer of Alexnet 14

Fifth layer of Alexnet 15

Feature evolution during training ◮ Evolution of features for 1, 2, 5, 10, 20, 30, 40 and 64 epochs. ◮ Strongest activation response for some random neurons at all 5 layers. ◮ Low layers converge soon after a few single passes. ◮ Fifth layer does not converge until a very large number of epochs. ◮ Lower layers may change their feature correspondence after converge. 16

Architecture comparison ◮ Check if different architectures respond similarly or more strongly to the same inputs. ◮ Left picture used filters 7 × 7 instead of 11 × 11, and reduced the stride from 4 to 2. ◮ Evidence that there are less dead units on the modified network. ◮ More defined features, whereas Alexnet has more aliasing effects. 17

Image reconstruction 18

Image reconstruction ◮ Reconstruction of an image from latent features. ◮ Layers in the network retain an accurate photographical representation about the image, retaining geometric and photometric invariance. ◮ a [ l ] corresponds to the latent representation of layer l . ◮ Solve the optimization problem: J [ l ] ˆ x = arg min C ( x , y ) + λR ( y ) , y where � a [ l ]( G ) − a [ l ]( C ) � � 2 J [ l ] � C ( x , y ) = F . Aravindh Mahendran and Andrea Vedaldi, “Understanding deep image representations by inverting them,” Nov. 2014 19

Regularization and optimization ◮ Regularization – α -norm regularizer, R α ( y ) = λ α � y � α α – Total variation regularizer: � 2 + � 2 � β/ 2 �� R V β ( y ) = λ V β y i,j +1 ,k − y i,j,k y i +1 ,j,k − y i,j,k . i,j,k ◮ Image reconstruction: 1. Initialize y with random noise. 2. Feedforward pass the image. 3. Compute the loss function. 4. Compute gradients of the cost and backpropagate to input space. 5. Update generated image G with a gradient step. 20

Example of image reconstruction 21

Example of image reconstruction 22

Texture synthesis 23

Texture examples original pool4 pool3 pool2 pool1 conv1_1 24

Texture synthesis using convnets ◮ Generate high perceptual quality images that imitate a given texture. ◮ Uses a trained convolutional network for object classification. ◮ Employs the correlation of features among layers as a generative process. ◮ Output of a layer: ⇔ Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge, “Texture synthesis using convolutional neural networks”. 25

Cross-correlation of feature maps: Gram matrices ◮ Denote the output of a given filter k at layer l with a [ l ] ijk . ◮ The cross-correlation between this output and a different channel k ′ : n [ l ] n [ l ] H W G [ l ] � � a [ l ] ijk a [ l ] kk ′ = ijk ′ . i =1 j =1 ◮ The Gram matrix: G [ l ] = A [ l ] ( A [ l ] ) T where ( A [ l ] ) T = ( a [ l ] ::1 , . . . , a [ l ] C ). :: n [ l ] 26

Generating new textures ◮ To create a new texture, we synthesize an image that has similar correlation as the one we want to reproduce. ◮ G [ l ]( S ) refers to the Gram matrix of the style image, and G [ l ]( G ) to the newly generated image. 1 2 � � G [ l ]( S ) − G [ l ]( G ) � J [ l ] S ( G [ l ]( S ) , G [ l ]( G ) ) = F , � � 4( n [ l ] W n [ l ] � H ) 2 �� ij ( g ij ) 2 corresponds to the Frobenius norm. where � G � F = ◮ We combine all of the layer losses into a global cost function: L � λ l J [ l ] S ( G [ l ]( S ) , G [ l ]( G ) ) , J S ( x , y ) = l =0 for given weights λ 1 , . . . , λ L : 27

Process description 512 ... 1 4 conv5_ 3 2 1 pool4 512 ... 1 4 conv4_ 3 2 1 pool3 256 ... 1 4 conv3_ 3 2 1 pool2 128 ... 1 conv2_ 2 1 pool1 64 ... 1 conv1_ 2 1 Gradient input # feature descent maps 28

Texture examples original pool4 pool3 pool2 pool1 conv1_1 29

Neural style transfer 30

Neural style transfer ◮ Artistic generation of high perceptual quality images that combine the style or texture of an input image, and the elements or content from a different one. Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge, “A neural algorithm of artistic style,” Aug. 2015. 31

Other examples 32

Methodology 33

Objective function ◮ Neural style transfer combines content and style reconstruction. J total ( x , y ) = αJ [ l ] C ( x , y ) + βJ S ( x , y ) ◮ Need to choose a layer to represent content. – middle layers are recommended (not too shallow, not too deep) for best results. ◮ A set of layers to represent style. ◮ Total cost is minimized using backpropagation. ◮ Input y is initialized with random noise. ◮ Replacing the max-pooling layers with average pooling improves the gradient flow, and this produces more appealing pictures. 34

DeepDream 35

Art from visualization techniques 36

Inceptionism: Going Deeper into Neural Networks ◮ Discriminative trained network for classification. – First layer maybe looks for edges or corners. – Intermediate layers interpret the basic features to look for overall shapes or components, like a door or a leaf. – Final layers assemble those into complete interpretations: trees, buildings, etc. ◮ Turn NN upside down: what sort of image would result in Banana. – need to add texture information (prior). https://research.googleblog.com/2015/06/inceptionism-going-deeper-into-neural.html 37

Class generation 38

Advanced Section #5: Visualization of convolutional networks and - PowerPoint PPT Presentation

Advanced Section #5: Visualization of convolutional networks and neural style transfer AC 209B: Advanced Topics in Data Science Javier Zazo Pavlos Protopapas Neural style transfer Artistic generation of high perceptual quality images that

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Security Visualization Tim Vidas & Hanan Hibshi UPS 2011 1 Visualization Visualization can

Visualization Visualization Understand what ConvNets learn 2 Visualization The development of

Data Visualization Brait ispuu Types of Visualization Mathematical Visualization y =

Anytime Reliability of Systematic LDPC Motivation Convolutional Codes LDPC Convolutional Codes

Convolutional Autoencoder (CAE) Prof. Seungchul Lee Industrial AI Lab. Convolutional Autoencoder

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Visualization CS 299 Introduction to Data Science Overview 1. What Is Visualization? 2.

Visualization Systems 11-1 Ronald Peikert SciVis 2008 - Visualization Systems Modular

Data Visualization Tools, How do you make a visualization? Is it the right visualization?

Half Year Results Presentation 2019 6 months ended 30 June 2019 Section 1 Section 2 Section 3

2018 Full year results presentation 12 months ended 31 December 2018 1 Section 1 Section 2

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

Mapping Metalinguistic Knowledge in Dili, Timor-Leste Melody Ann Ross 13 th East-West Center

Purpose Develop long-term strategies and progressive options for Virgin America to increase its

Lumber Liquidators Investor Presentation May 2018 Safe Harbor The following information

Overview of State Park System Funding PRESENTED TO HOUSE APPROPRIATIONS SUBCOMMITTEE ON ARTICLES

Virtual Functional Segmentation of Snake Robots for Perception-Driven Obstacle-Aided Locomotion

Kevin Adams, Chelsey DeBalsi, Neil Mammele, Lindsay Quatrale, Nordia West Part 1- Background

Visualizing Geo Data Jason Sundram, Data Scientist, PayPal @jsundram From text: To maps: Who

Preliminary Seascape Character Assessment Suffolk, South Norfolk & North Essex Suffolk Coast

Advanced Section #5: Visualization of convolutional networks and - PowerPoint PPT Presentation

Advanced Section #5: Visualization of convolutional networks and neural style transfer AC 209B: Advanced Topics in Data Science Javier Zazo Pavlos Protopapas Neural style transfer Artistic generation of high perceptual quality images that

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Security Visualization Tim Vidas &amp; Hanan Hibshi UPS 2011 1 Visualization Visualization can

Visualization Visualization Understand what ConvNets learn 2 Visualization The development of

Data Visualization Brait ispuu Types of Visualization Mathematical Visualization y =

Anytime Reliability of Systematic LDPC Motivation Convolutional Codes LDPC Convolutional Codes

Convolutional Autoencoder (CAE) Prof. Seungchul Lee Industrial AI Lab. Convolutional Autoencoder

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Visualization CS 299 Introduction to Data Science Overview 1. What Is Visualization? 2.

Visualization Systems 11-1 Ronald Peikert SciVis 2008 - Visualization Systems Modular

Data Visualization Tools, How do you make a visualization? Is it the right visualization?

Half Year Results Presentation 2019 6 months ended 30 June 2019 Section 1 Section 2 Section 3

2018 Full year results presentation 12 months ended 31 December 2018 1 Section 1 Section 2

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

Mapping Metalinguistic Knowledge in Dili, Timor-Leste Melody Ann Ross 13 th East-West Center

Purpose Develop long-term strategies and progressive options for Virgin America to increase its

Lumber Liquidators Investor Presentation May 2018 Safe Harbor The following information

Overview of State Park System Funding PRESENTED TO HOUSE APPROPRIATIONS SUBCOMMITTEE ON ARTICLES

Virtual Functional Segmentation of Snake Robots for Perception-Driven Obstacle-Aided Locomotion

Kevin Adams, Chelsey DeBalsi, Neil Mammele, Lindsay Quatrale, Nordia West Part 1- Background

Visualizing Geo Data Jason Sundram, Data Scientist, PayPal @jsundram From text: To maps: Who

Preliminary Seascape Character Assessment Suffolk, South Norfolk &amp; North Essex Suffolk Coast

Security Visualization Tim Vidas & Hanan Hibshi UPS 2011 1 Visualization Visualization can

Preliminary Seascape Character Assessment Suffolk, South Norfolk & North Essex Suffolk Coast