- 8. More Tasks in Computer
Vision
CS 519 Deep Learning, Winter 2018 Fuxin Li
With materials from Zsolt Kira, Roger Grosse, Nitish Srivastava
8. More Tasks in Computer Vision CS 519 Deep Learning, Winter 2018 - - PowerPoint PPT Presentation
8. More Tasks in Computer Vision CS 519 Deep Learning, Winter 2018 Fuxin Li With materials from Zsolt Kira, Roger Grosse, Nitish Srivastava Image Classification History Caltech datasets: Caltech-256 Caltech-101: 3,030 images in 101
CS 519 Deep Learning, Winter 2018 Fuxin Li
With materials from Zsolt Kira, Roger Grosse, Nitish Srivastava
categories
Caltech-256
152-level Conv Net (covered later) 3.6% (VGG) (AlexNet)
224 x 224 224 x 224 112 x 112 56 x 56 28 x 28 14 x 14 7 x 7 Airplane Dog Car SUV Minivan Sign Pole β¦β¦
(Simonyan and Zisserman 2014)
π Οπ β log π(π§ = π§π|π¦π)
β log π(π§ = π|π¦) = βπβ€ππ + log ΰ·
π
ππβ€ππ
current scene as input
Mnih et al. Playing Atari with Deep Reinforcement Learning
playing system
Tian and Zhu. arXiv 0511:06410
boxes inside the image and classification in the box
Horse Person Horse Person
Image Category Label Object Label
Obj 1 Obj 2 Obj 3 Obj 4
Segment-based Framework
14
512 7 7 Convolve 7x7 filters 4096 7 7
in the βencoderβ CNN
to increase resolution
Some Conv result Un-max-pooling Deconvolution Un-max-pooling Deconvolution Un-max-pooling Deconvolution Un-max-pooling Deconvolution
conv layers and deconv layers with the same resolution
precision and helps at boundaries (low-level information)
network
layer
layer
classes
network