Imagenet Xavier Gir-i-Nieto ImageNet ILSRVC Li Fei-Fei, How were - PowerPoint PPT Presentation

Day 2 Lecture 4 Imagenet Xavier Giró-i-Nieto

ImageNet ILSRVC Li Fei-Fei, “How we’re teaching computers to understand pictures” TEDTalks 2014. Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L. (2015). Imagenet large scale visual 2 recognition challenge. arXiv preprint arXiv:1409.0575 . [web]

ImageNet ILSRVC Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L. (2015). Imagenet large scale visual recognition challenge. arXiv preprint arXiv:1409.0575 . [web] 3

ImageNet ILSRVC ● 1,000 object classes (categories). ● Images: ○ 1.2 M train ○ 100k test. 4

ImageNet ILSRVC ● Top 5 error rate Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L. (2015). Imagenet large scale visual recognition challenge. arXiv preprint arXiv:1409.0575 . [web]

ImageNet ILSRVC Image Classification 2012 Based on SIFT + Fisher Vectors Slide credit: -9.8% Rob Fergus (NYU) Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L. (2014). Imagenet large scale visual recognition challenge. arXiv 6 preprint arXiv:1409.0575 . [web]

AlexNet (Supervision) Orange A Krizhevsky, I Sutskever, GE Hinton “Imagenet classification with deep convolutional neural networks” Part of: Advances in Neural Information Processing Systems 25 (NIPS 2012) Slide credit: Junting Pan, “Visual Saliency Prediction using Deep Learning Techniques” (ETSETB-UPC 2015) 7

AlexNet (Supervision) Slide credit: Junting Pan, “Visual Saliency Prediction using Deep Learning Techniques” (ETSETB-UPC 2015) 8

AlexNet (Supervision) Slide credit: Junting Pan, “Visual Saliency Prediction using Deep Learning Techniques” (ETSETB-UPC 2015) 9

AlexNet (Supervision) 10 Image credit: Deep learning Tutorial (Stanford University)

AlexNet (Supervision) f(x) = max(0,x) Rectified Linear Unit (non-linearity) Slide credit: Junting Pan, “Visual Saliency Prediction using Deep Learning Techniques” (ETSETB-UPC 2015) 13

AlexNet (Supervision) Dot Product Slide credit: Junting Pan, “Visual Saliency Prediction using Deep Learning Techniques” (ETSETB-UPC 2015) 14

ImageNet ILSRVC ImageNet Classification 2013 Slide credit: Rob Fergus (NYU) Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L. (2015). Imagenet large scale visual recognition challenge. arXiv 15 preprint arXiv:1409.0575 . [web]

Zeiler-Fergus (ZF) The development of better convnets is reduced to trial-and- Visualization can help in error. proposing better architectures. Zeiler, M. D., & Fergus, R. (2014). Visualizing and understanding convolutional networks. In Computer Vision–ECCV 2014 (pp. 818-833). Springer International Publishing. 16

Zeiler-Fergus (ZF) “A convnet model that uses the same components (filtering, pooling) but in reverse, so instead of mapping pixels to features does the opposite.” Zeiler, Matthew D., Graham W. Taylor, and Rob Fergus. "Adaptive deconvolutional networks for mid and high level feature learning." Computer Vision (ICCV), 2011 IEEE International Conference on . IEEE, 2011. 17

Zeiler-Fergus (ZF) DeconvN Net Conv et Zeiler, M. D., & Fergus, R. (2014). Visualizing and understanding convolutional networks. In Computer Vision–ECCV 2014 (pp. 818-833). Springer International Publishing. 18

Zeiler-Fergus (ZF) Zeiler, M. D., & Fergus, R. (2014). Visualizing and understanding convolutional networks. In Computer Vision–ECCV 2014 (pp. 818-833). Springer International Publishing. 19

Zeiler-Fergus (ZF) Zeiler, M. D., & Fergus, R. (2014). Visualizing and understanding convolutional networks. In Computer Vision–ECCV 2014 (pp. 818-833). Springer International Publishing. 20

Zeiler-Fergus (ZF): Stride & filter size The smaller stride (2 vs 4) and filter size (7x7 vs 11x11) results in more distinctive features and fewer “dead" features. AlexNet (Layer 1) ZF (Layer 1) 21

Zeiler-Fergus (ZF) Cleaner features in ZF, without the aliasing artifacts caused by the stride 4 used in AlexNet. AlexNet (Layer 2) ZF (Layer 2) 22

Zeiler-Fergus (ZF): Drop out Regularization with more dropout : introduced in the input layer. Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 . 23 Chicago

Zeiler-Fergus (ZF): Results 24

Zeiler-Fergus (ZF): Results 25

E2E: Classification: ImageNet ILSRVC ImageNet Classification 2013 -5% Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L. (2015). Imagenet large scale visual recognition challenge. arXiv 26 preprint arXiv:1409.0575 . [web]

E2E: Classification 27

E2E: Classification: GoogLeNet Movie: Inception (2010) 28

E2E: Classification: GoogLeNet ● 22 layers, but 12 times fewer parameters than AlexNet. Szegedy, Christian, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. "Going deeper with convolutions." 29 CVPR 2015. [video] [slides] [poster]

E2E: Classification: GoogLeNet 30

E2E: Classification: GoogLeNet Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." ICLR 2014. 31

E2E: Classification: GoogLeNet Multiple scales Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." ICLR 2014. 32

E2E: Classification: GoogLeNet (NiN) 3x3 and 5x5 convolutions deal with different scales. Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." ICLR 2014. [Slides] 33

E2E: Classification: GoogLeNet Dimensionality reduction Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." ICLR 2014. 34

E2E: Classification: GoogLeNet (NiN) 1x1 convolutions does dimensionality reduction (c3<c2) and accounts for rectified linear units (ReLU). Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." ICLR 2014. [Slides] 35

E2E: Classification: GoogLeNet In GoogLeNet, the Cascaded 1x1 Convolutions compute reductions before the expensive 3x3 and 5x5 convolutions. 36

E2E: Classification: GoogLeNet Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." ICLR 2014. 37

E2E: Classification: GoogLeNet They somewhat spatial invariance, and has proven a benefitial effect by adding an alternative parallel path. 38

E2E: Classification: GoogLeNet Two Softmax Classifiers at intermediate layers combat the vanishing gradient while providing regularization at training time. ...and no fully connected layers needed ! 39

E2E: Classification: GoogLeNet 40

E2E: Classification: GoogLeNet NVIDIA, “NVIDIA and IBM CLoud Support ImageNet Large Scale Visual Recognition Challenge” (2015) 41

E2E: Classification: GoogLeNet Szegedy, Christian, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. "Going deeper with convolutions." CVPR 2015. [video] [slides] [poster] 42

E2E: Classification: VGG Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." International Conference on Learning Representations (2015) . [video] [slides] [project] 43

E2E: Classification: VGG Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." International Conference on Learning Representations (2015) . [video] [slides] [project] 44

E2E: Classification: VGG: 3x3 Stacks Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." International Conference on Learning Representations (2015) . [video] [slides] [project] 45

E2E: Classification: VGG ● No poolings between some convolutional layers. ● Convolution strides of 1 (no skipping). Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." International Conference on Learning Representations (2015) . [video] [slides] [project] 46

E2E: Classification 3.6% top 5 error… with 152 layers !! 47

E2E: Classification: ResNet He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "Deep Residual Learning for Image Recognition." arXiv preprint arXiv:1512.03385 48 (2015). [slides]

E2E: Classification: ResNet ● Deeper networks (34 is deeper than 18) are more difficult to train. Thin curves: training error Bold curves: validation error He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "Deep Residual Learning for Image Recognition." arXiv preprint arXiv:1512.03385 (2015). [slides] 49

Imagenet Xavier Gir-i-Nieto ImageNet ILSRVC Li Fei-Fei, How were - PowerPoint PPT Presentation

Day 2 Lecture 4 Imagenet Xavier Gir-i-Nieto ImageNet ILSRVC Li Fei-Fei, How were teaching computers to understand pictures TEDTalks 2014. Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., ... & Fei-Fei, L.

Image as a single label king crab Image Source: ImageNet Image as an object set Man

Augmentation Introduction ImageNet Classification with Deep Convolutional Neural Networks,

Modern CNNs Prof. Seungchul Lee Industrial AI Lab. ImageNet Human performance = 5.1 % from

Geirhos et al. (2019) Introduction ImageNet classifjcation with CNNs Which image cues are

ImageNet in 18 minutes for the masses Motivation - training was fast in Google - no technical

Training ImageNet in 15 Minutes With ChainerMN: A Scalable Distributed Deep Learning Framework

ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky Ilya Sutskever

Regionlets for Generic Object Detection A test on ImageNet Tianbao Yang Xiaoyu Wang

Harmonic Analysis of Deep Convolutional Neural Networks Helmut B olcskei Department of

Greedy Layerwise Learning Can Scale to ImageNet Edouard Oyallon Eugene Belilovsky, Michael

ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever,

Deep Networks for Computer Vision at Google Chuck Rosenberg ImageNet ILSVRC Workshop September

CNN Architectures ILSVRC: Imagenet Large Scale Visual Recognition Challenge [Russakovsky et al

Common Architecture Elements SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 1

Review on ImageNet Classification with Deep Convolutional Neural Networks by Alex Krizhevsky et. al

ACCELERATED COMPUTING FOR AI Bryan Catanzaro, 28 October 2017 DEEP LEARNING BIG BANG ImageNet

Presentation Content 1. Thesis question 2. Current work achieved ETZCR / C.CORNU / 16/06/2010

1. Key mental capacity issues for COVID-19 management Hospitals discharge to care homes

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Analysis of the Effect of Sample Size on the Quality of Data Mining Models David Watkins SPSS

T T o orah Portion Joh John 14:26 Ac Acts ts 9: 9:31 1 C 1 Cori rint nthi hians ans

Disclosures Research Grants: Amgen, AbbVie, Orthotropix, Pfizer, Regeneron, Myosicience

Data Monitoring Committee Training Lecture Three: Methods Overview Introduction 1.1 Statistical

Tiling: A Data Locality Optimizing Algorithm Announcements Monday November 28th, Dr. Sanjay