Deep-Learning: general principles + Convolutional Neural Networks - PDF document

Deep-Learning: general principles + Convolutional Neural Networks Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Université Paris Fabien.Moutarde@mines-paristech.fr http://people.mines-paristech.fr/fabien.moutarde Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 1 Acknowledgements During preparation of these slides, I got inspiration and borrowed some slide content from several sources, in particular: • Yann LeCun + MA Ranzato: slides on « Deep Learning » from the corresponding course at NYU http://cilvr.cs.nyu.edu/doku.php?id=deeplearning:slides:start • Hinton+Bengio+LeCun: slides of the NIPS’2015 tutorial on Deep Learning http://www.iro.umontreal.ca/~bengioy/talks/DL-Tutorial-NIPS2015.pdf • Fei-Fei Li + A.Karpathy + J.Johnson: Stanford course lecture slides on « Convolutional Neural Networks » http://cs231n.stanford.edu/slides/winter1516_lecture7.pdf Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 2

Outline • Introduction to Deep Learning • Convolutional Neural Networks (CNN or ConvNets) – Intro + Short reminder on Neural Nets – Convolution layers & Pooling layers + global architecture – Training algorithm + Dropout Regularization • Useful pre-trained convNets • Coding frameworks • Transfer Learning • Object localization and Semantic segmentation • Deep-Learning on 1D signal and 3D data • Recent other image-based applications Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 3 Deep-Learning recent breakthroughs Very significant improvement over State-of-the-Art in Pattern Recognition / Image Semantic Analysis: • won many vision pattern recognition competitions (OCR, TSR, object categorization, facial expression ,…) • deployed in photo-tagging by Facebook, Google,Baidu ,… Similar dramatic progress in Speech recognition + Natural Language Processing (NLP) Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 4

Main application domains of Deep-Learning Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 5 Is Deep-Learning « Large-Scale »? Big and/or « Fat » data Deep-Learning: Large MODELS State-of-the-Art Convolutional Neural Networks contain > 100 layers, millions of parameters Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 6

Importance of training data! Dramatic recent progresses in image classification and visual object categorization not only due to Deep-Learning and convNets: it was made possible largely thanks to ImageNet dataset, which is a HUGE collection of labelled general-purpose images (1000 categories, > 1 million examples) Most powerful convNets have been trained on this huge dataset! Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 7 What is Deep-Learning? Learning a hierarchy of increasingly abstract representations Increasing level of abstraction Each stage ~ trainable feature transform Image recognition Pixel → edge → texton → motif → part → object Speech Sample → spectral band → … → phoneme → word Text Character → word → word group → clause → sentence → story [Figure from Goodfellow] Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 8

Importance of « features » in classical Machine-Learning Examples of hand-crafted features Control-points features Haar features HoG (Histogram of Gradients) Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 9 Deep-Learning vs. shallow Machine-Learning DL: jointly learn Shallow ML using handcrafted features classification and features Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 10

Why features should be learnt? Real data examples for a given task are usually not spreaded everywhere in input space, but rather clustered on a low-dimension « manifold » Example: Face images of 1000x1000 pixels è « raw » examples are vectors in R 1000000 !! • BUT: – position = 3 cartesian coord – orientation 3 Euler angles – 50 muscles in face – Luminosity, color è Set of all images of ONE person has ≤ 69 dim à Examples of face images of 1 person are all in a LOW-dim manifold inside a HUGE-dim space Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 11 Good features ~ « mapping » on manifold Luminosity Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 12

Features learning (before Deep-Learning) Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 13 Outline • Introduction to Deep Learning • Convolutional Neural Networks (CNN or ConvNets) – Intro + Short reminder on Neural Nets – Convolution layers & Pooling layers + global architecture – Training algorithm + Dropout Regularization • Useful pre-trained convNets • Coding frameworks • Transfer Learning • Object localization and Semantic segmentation • Deep-Learning on 1D signal and 3D data • Recent other image-based applications Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 14

Convolutional Neural Networks (CNN, or ConvNet) • Proposed in 1998 by Yann LeCun (french prof.@ NYU, now also AI research director of Facebook) • For inputs with correlated dims (2D image , 1D signal,…) • Supervised learning Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 15 ConvNets (2) • Wins most vision pattern recognition competitions (OCR, TSR, object categorization , facial expression,…) • Deployed in photo-tagging by Facebook, Google, Baidu ,… • Also used in real-time video analysis for self-driving cars Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 16

Short reminder on what is a (multi-layer) Neural Network Input Output layer X1 Y1 X2 Y2 X3 Connections with Weights Hidden layers (0, 1 or more) For “Multi - Layer Perceptron” (MLP), neurons type generally “summating with sigmoid activation” Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 17 Reminder on artificial “neurons” ACTIVATION FUNCTIONS PRINCIPLE • Threshold (Heaviside or sign) à binary neurons • Sigmoïd (logistic or tanh) O j S f à most common for MLPs e i W ij • Identity à linear neurons • ReLU (Rectified Linear Unit) æ ö n j å ç ÷ = + O f W W e ç ÷ j 0 j ij i è ø = • Saturation i 1 W 0j = "bias" • Gaussian Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 18

Why MLP directly on pixels is generally a BAD idea? Huge # of parameters, NO invariance at all Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 19 Why convolutions? For image “semantic” classification, shift-invariance of features is useful = And ANY shift-invariant & linear system can always be expressed as a CONVOLUTION : (where h [ n ] is the impulse response). Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 20

Outline • Introduction to Deep Learning • Convolutional Neural Networks (CNN or ConvNets) – Intro + Short reminder on Neural Nets – Convolution layers & Pooling layers + global architecture – Training algorithm + Dropout Regularization • Useful pre-trained convNets • Coding frameworks • Transfer Learning • Object localization and Semantic segmentation • Deep-Learning on 1D signal and 3D data • Recent other image-based applications Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 21 Convolution: sliding a 3D filter over image Non-linear activation : ) ", # = * ! ", # f= tanh, ReLU , … 5x5x3 filter At sliding position i,j ! ", # = $ + %. & '( with & '( = 5x5 image patch in 3 colors à vector of dim 75, as filter coeffs in % Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 22

Convolution in action From http://cs231n.github.io/convolutional-networks/ Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 23 Example of typical results of convolution Deep-Learning: general principles + convNets, Pr. Fabien MOUTARDE, Center for Robotics, MINES ParisTech, PSL, March 2019 24

Deep-Learning: general principles + Convolutional Neural Networks - PDF document

Deep-Learning: general principles + Convolutional Neural Networks Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Universit Paris Fabien.Moutarde@mines-paristech.fr http://people.mines-paristech.fr/fabien.moutarde

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Deep learning Optimization and Regularization in deep networks Hamid Beigy Sharif university of

Minjie Wang Deep Learning Deep Learning trend in the past 10 years Caffe State-of-art DL

robots navigation LUKAS HFLIGER SUPERVISED BY MARIAN GEORGE 2 LUKAS HFLIGER 3 4 LUKAS

Deep Structured Learning Chunhua Shen School of Computer Science, The University of Adelaide

A PODS-based Extended Kalman Filter: Quantifying Sensing Uncertainties in Automatic Bird Species

Analysis of Ultra High Energetic Cosmic Rays measured in monocular mode with the fmuorescence

Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual Cues: image properties that

Cosmic Rays Energy Spectrum from PeV to EeV energies measured by the TALE Detector Tareq

Hawkular Metrics Metric Storage & Alerting Stefan Negrea About Me Co-Creator of Hawkular

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Deep-Learning: general principles + Convolutional Neural Networks - PDF document

Deep-Learning: general principles + Convolutional Neural Networks Pr. Fabien MOUTARDE Center for Robotics MINES ParisTech PSL Universit Paris Fabien.Moutarde@mines-paristech.fr http://people.mines-paristech.fr/fabien.moutarde

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre &lt;

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Deep learning Optimization and Regularization in deep networks Hamid Beigy Sharif university of

Minjie Wang Deep Learning Deep Learning trend in the past 10 years Caffe State-of-art DL

robots navigation LUKAS HFLIGER SUPERVISED BY MARIAN GEORGE 2 LUKAS HFLIGER 3 4 LUKAS

Deep Structured Learning Chunhua Shen School of Computer Science, The University of Adelaide

A PODS-based Extended Kalman Filter: Quantifying Sensing Uncertainties in Automatic Bird Species

Analysis of Ultra High Energetic Cosmic Rays measured in monocular mode with the fmuorescence

Cue combinations, Bayesian models Thurs. March 1, 2018 1 Visual Cues: image properties that

Cosmic Rays Energy Spectrum from PeV to EeV energies measured by the TALE Detector Tareq

Hawkular Metrics Metric Storage &amp; Alerting Stefan Negrea About Me Co-Creator of Hawkular

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Hawkular Metrics Metric Storage & Alerting Stefan Negrea About Me Co-Creator of Hawkular