cs 4803 7643 deep learning
play

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Topics: (Finish) Computing Gradients Backprop in Conv Layers Forward mode vs Reverse mode AD Modern CNN Architectures Zsolt Kira Georgia Tech The architecture of LeNet5 Handwriting Recognition


  1. CS 4803 / 7643: Deep Learning Topics: – (Finish) Computing Gradients – Backprop in Conv Layers – Forward mode vs Reverse mode AD – Modern CNN Architectures Zsolt Kira Georgia Tech

  2. The architecture of LeNet5

  3. Handwriting Recognition Example

  4. Translation Invariance

  5. Some Rotation Invariance

  6. Some Scale Invariance

  7. Case Studies • There are several generations of ConvNets – 2012 – 2014: AlexNet, ZNet, VGGNet • Conv-Relu, Pooling, Fully connected, Softmax • Deeper ones (VGGNet) tend to do better – 2014 • Fully-convolutional networks for semantic segmentation • Matrix outputs rather than just one probability distribution – 2014-2016 • Fully-convolutional networks for classification • Less parameters, faster than comparable Gen1 networks • GoogleNet, ResNet – 2014-2016 • Detection layers (proposals) • Caption generation (combine with RNNs for language)

  8. An Aside

  9. AlexNet: 60M params ZNet: 75M VGG: 138M GoogleNet: 5M

  10. Importance of Depth • After a while, adding depth decreases performance • At first, vanishing/exploding gradients • normalized initialization • Batch normalization • 2 nd order methods • Then, optimization limitation – Deeper network should be able to mimic shallow ones

  11. Localization and Detection

  12. Computer Vision Tasks

  13. Computer Vision Tasks

  14. Classification + Localization

  15. CLS - ImageNet

  16. Idea 1: Localization as Regression

  17. Per-Class vs. Class Agnostic

  18. Where to attach?

  19. Multiple Objects

  20. Human Pose Estimation

  21. Sliding Window: Overfeat

  22. Sliding Window: Overfeat

  23. Sliding Window: Overfeat

  24. Sliding Window: Overfeat

  25. Sliding Window: Overfeat

  26. Sliding Window: Overfeat

  27. Sliding Window: Overfeat Why aren’t boxes across grid?

  28. Sliding Window: Overfeat

  29. Detection as Classification

  30. Detection as Classification

  31. Detection as Classification

  32. Detection as Classification

  33. Detection as Classification

  34. R-CNN

  35. Region of Interest (ROI) Pooling

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend