KEMAL ÇİZMECİLER (08.03.2016)
1
KEMAL ZMECLER (08.03.2016) 1 PRESENTATION TOPIC OBJECT DETECTORS - - PowerPoint PPT Presentation
KEMAL ZMECLER (08.03.2016) 1 PRESENTATION TOPIC OBJECT DETECTORS EMERGE IN DEEP SCENE CNNS Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba International Conference on Learning Representations,2015. 2 OUTLINE
1
2
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba International Conference on Learning Representations,2015.
3
4
ImageNet CNN for Object Classification
Same architecture: AlexNet Places CNN for Scene Classification
Slide credit : Bolei Zhou
6
Figure from Fischler & Elschlager (1973)
Slide credit : Bolei Zhou
Zeiler, M. et al. Visualizing and Understanding Convolutional Networks,ECCV 2014.
Deconvolution
Simonyan, K. et al. Deep inside convolutional networks: Visualising image classification models and saliency maps. ICLR workshop, 2014 Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accu-rate object detection and semantic segmentation. CVPR 2014
Back-propagation Strong activation image
Slide credit : Bolei Zhou
Matthew D. Zeiler , Rob Fergus Visualizing and Understanding Convolutional Networks
Conv2 Conv3 Conv4 Pool5 Conv1
Slide credit : Bolei Zhou http://people.csail.mit.edu/torralba/research/drawCNN/drawNet.html
12
13
14
15
16
17
slide by Derek Hoiem
Perez, Patrick, Gangnet, Michel, and Blake, Andrew.Poisson image editing. ACM Trans.Graph., 2003
20
21
22
200K images from scene centric Sun database + object-centric ImageNet
Estimated receptive fields
pool1
Actual size of RF is much smaller than the theoretic size
conv3 pool5
Segmentation using the RF of Units (Highlight the regions within the RF that have the highest value in the feature map. )
More semantically meaningful
24
Top ranked segmented images are cropped and sent to Amazon Turk for annotation.
Pool5, unit 76; Label: ocean; Type: scene; Precision: 93%
Pool5, unit 77; Label:legs; Type: object part; Precision: 96%
Pool5, unit 22; Label: dinner table; Type: scene; Precision: 60%
31
Slide credit : Bolei Zhou
33
35
36
Correlation:0.53 Correlation:0.84
39
40
41