Deep learning 4.5. Pooling Fran cois Fleuret - PowerPoint PPT Presentation

Deep learning 4.5. Pooling Fran¸ cois Fleuret https://fleuret.org/ee559/ Nov 2, 2020

The historical approach to compute a low-dimension signal ( e.g. a few scores) from a high-dimension one ( e.g. an image) was to use pooling operations. Such an operation aims at grouping several activations into a single “more meaningful” one. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 1 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w w Output 4 r The average pooling computes average values per block instead of max values. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w w Output 4 0 r The average pooling computes average values per block instead of max values. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w w Output 4 0 2 r The average pooling computes average values per block instead of max values. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w w Output 4 0 2 3 r The average pooling computes average values per block instead of max values. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w w Output 4 0 2 3 3 r The average pooling computes average values per block instead of max values. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

The most standard type of pooling is the max-pooling , which computes max values over non-overlapping blocks. For instance in 1d with a kernel of size 2: Input 1 4 -1 0 2 -2 1 3 3 1 r w Output 4 0 2 3 3 r The average pooling computes average values per block instead of max values. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 2 / 7

Input r w s h C Fran¸ cois Fleuret Deep learning / 4.5. Pooling 3 / 7

Input Output r w s h C Fran¸ cois Fleuret Deep learning / 4.5. Pooling 3 / 7

Input Output r w r s s h C C Fran¸ cois Fleuret Deep learning / 4.5. Pooling 3 / 7

Pooling provides invariance to any permutation inside one of the cell. More practically, it provides a pseudo-invariance to deformations that result into local translations. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 4 / 7

Pooling provides invariance to any permutation inside one of the cell. More practically, it provides a pseudo-invariance to deformations that result into local translations. Input Fran¸ cois Fleuret Deep learning / 4.5. Pooling 4 / 7

Pooling provides invariance to any permutation inside one of the cell. More practically, it provides a pseudo-invariance to deformations that result into local translations. Input Output Fran¸ cois Fleuret Deep learning / 4.5. Pooling 4 / 7

F.max_pool2d(input, kernel_size, stride=None, padding=0, dilation=1, ceil_mode=False, return_indices=False) takes as input a N × C × H × W tensor, and a kernel size ( h , w ) or k interpreted as ( k , k ), applies the max-pooling on each channel of each sample separately, and produce if the padding is 0 a N × C × ⌊ H / h ⌋ × ⌊ W / w ⌋ output. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 5 / 7

F.max_pool2d(input, kernel_size, stride=None, padding=0, dilation=1, ceil_mode=False, return_indices=False) takes as input a N × C × H × W tensor, and a kernel size ( h , w ) or k interpreted as ( k , k ), applies the max-pooling on each channel of each sample separately, and produce if the padding is 0 a N × C × ⌊ H / h ⌋ × ⌊ W / w ⌋ output. >>> x = torch.empty(1, 2, 2, 6).random_(3) >>> x tensor([[[[1., 2., 1., 1., 0., 2.], [2., 1., 1., 0., 2., 0.]], [[0., 2., 1., 1., 2., 2.], [1., 1., 1., 1., 0., 0.]]]]) >>> F.max_pool2d(x, (1, 2)) tensor([[[[2., 1., 2.], [2., 1., 2.]], [[2., 1., 2.], [1., 1., 0.]]]]) Fran¸ cois Fleuret Deep learning / 4.5. Pooling 5 / 7

F.max_pool2d(input, kernel_size, stride=None, padding=0, dilation=1, ceil_mode=False, return_indices=False) takes as input a N × C × H × W tensor, and a kernel size ( h , w ) or k interpreted as ( k , k ), applies the max-pooling on each channel of each sample separately, and produce if the padding is 0 a N × C × ⌊ H / h ⌋ × ⌊ W / w ⌋ output. >>> x = torch.empty(1, 2, 2, 6).random_(3) >>> x tensor([[[[1., 2., 1., 1., 0., 2.], [2., 1., 1., 0., 2., 0.]], [[0., 2., 1., 1., 2., 2.], [1., 1., 1., 1., 0., 0.]]]]) >>> F.max_pool2d(x, (1, 2)) tensor([[[[2., 1., 2.], [2., 1., 2.]], [[2., 1., 2.], [1., 1., 0.]]]]) Similar functions implements 1d and 3d max-pooling, and average pooling. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 5 / 7

As for convolution, pooling operations can be modulated through their stride and padding. While for convolution the default stride is 1, for pooling it is equal to the kernel size, but this not obligatory. Default padding is zero. Fran¸ cois Fleuret Deep learning / 4.5. Pooling 6 / 7

class torch.nn.MaxPool2d(kernel_size, stride=None, padding=0, dilation=1, return_indices=False, ceil_mode=False) Wraps the max-pooling operation into a Module . As for convolutions, the kernel size is either a pair ( h , w ) or a single value k interpreted as ( k , k ). Fran¸ cois Fleuret Deep learning / 4.5. Pooling 7 / 7

The end

Deep learning 4.5. Pooling Fran cois Fleuret - PowerPoint PPT Presentation

Deep learning 4.5. Pooling Fran cois Fleuret https://fleuret.org/ee559/ Nov 2, 2020 The historical approach to compute a low-dimension signal ( e.g. a few scores) from a high-dimension one ( e.g. an image) was to use pooling operations. Such

Risk Pooling Strategies to Reduce and Hedge Uncertainty Location Pooling Product Pooling

Deep Learning (Partly) Need for Pooling Demystified Which Pooling . . . Pooling Four Values

Agenda S8539 - Pooling and Orchestrating NVIDIA Jetson for AI and Deep Learning on the Edge

Business rates and pooling Cameron Hall, Ian Hewitt, Mark Holland, Owen Jones, Zoe Lawson, Neeraj

13 IN THIS CHAPTER Benefits of Thread Pooling 308 Considerations and Costs of Thread

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

Pooling Multi-country Data: Short Data and Multi-generations of Technologies Towhidul Islam

Internet Software Technologies I t t S ft T h l i HTML HTML IMCNE IMCNE A.A. 2008/09

Lower Bounds for Encrypted Multi-Maps and Searchable Encryption in the Leakage Cell Probe Model

Embperl - How to Build Large Scale Websites/Webapplications With Perl ApacheCon 2002 Gerald

Efficient Top-K Query Processing on Massively Parallel Hardware ANIL SHANBHAG, HOLGER PIRK, SAM

GSM privacy attacks Karsten Nohl, nohl@srlabs.de Karsten Nohl, nohl@srlabs.de Agenda GSM

Learning Transferable Architectures for Scalable Image Recognition - Barret Zoph, Vijay Vasudevan,

Le Lecture 10 recap ap Prof. Leal-Taix and Prof. Niessner 1 Le LeNet 60k parameters

Scene Classification with Inception-7 Christian Szegedy with Julian Ibarz and Vincent Vanhoucke