Depth Prediction and RGBD Images for Recognition Yihui He, Metehan - - PowerPoint PPT Presentation

depth prediction and rgbd images for recognition
SMART_READER_LITE
LIVE PREVIEW

Depth Prediction and RGBD Images for Recognition Yihui He, Metehan - - PowerPoint PPT Presentation

Depth Prediction and RGBD Images for Recognition Yihui He, Metehan Ozten yihuihe@foxmail.com, m ozten@umail.ucsb.edu May 25, 2016 Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition Related work and motivation Yihui He,


slide-1
SLIDE 1

Depth Prediction and RGBD Images for Recognition

Yihui He, Metehan Ozten

yihuihe@foxmail.com, m ozten@umail.ucsb.edu

May 25, 2016

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-2
SLIDE 2

Related work and motivation

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-3
SLIDE 3
  • verview
  • ur project: depth estimation & Classification on RGBD images

implement previous work Go further (2) Build a RGBD CIFAR10 based on indoor depth knowledge (3) Compare RGBD and RGB label = f (RGBD) label = f (RGB)

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-4
SLIDE 4

first part: implement previous work

infer depth from RGB image

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-5
SLIDE 5

Infer depth from RGB image: Loss defination

At training time, we combine two objective function1

1 regress to groud truth depth image(Kinect, PrimeSense)

Σp(yp − ˆ yp)2, p stands for pixel.

2 Similarity between superpixels. Rpq = K k=1 βkS(k) pq

β is trainable weight. S is similarity function.

1Fayao Liu, Chunhua Shen, and Guosheng Lin. “Deep convolutional neural

fields for depth estimation from a single image”. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015,

  • pp. 5162–5170.

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-6
SLIDE 6

Architecture: Deep convolutional Neural Field

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-7
SLIDE 7

Infer depth from RGB image: Supervised part

using traditional CNN.

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-8
SLIDE 8

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-9
SLIDE 9

Compare performance with original paper

Method Error Accuracy (lower is better) (higher is better) rel log10 rms δ < 1.25 δ < 1.252 δ < 1.253 Our implementation 0.252 0.103 0.860 0.544 0.861 0.943 Original paper 0.230 0.095 0.824 0.614 0.883 0.971

Table: Sanity check (Bold is better)

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-10
SLIDE 10

second part: go further

Classification on RGBD images

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-11
SLIDE 11

build RGBD CIFAR dataset

32x32x3 400x400x3 400x400x1 32x32x4

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-12
SLIDE 12

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-13
SLIDE 13

architecture

32x32x4

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-14
SLIDE 14

R vs G vs B vs D: training time

20 40 60 80 100 120 140 160 180 Epoch 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 training accuracy Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-15
SLIDE 15

R vs G vs B vs D: testing time

20 40 60 80 100 120 140 160 180 Epoch 0.20 0.25 0.30 0.35 0.40 0.45 0.50 validation accuracy Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-16
SLIDE 16

RGBD vs RGB: training time

50 100 150 200 250 Epoch 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 training accuracy Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-17
SLIDE 17

RGBD vs RGB: testing time

50 100 150 200 250 Epoch 0.2 0.3 0.4 0.5 0.6 val accuracy Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-18
SLIDE 18

architecture

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-19
SLIDE 19

results

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-20
SLIDE 20
  • ur contribution

1 reproduce previous work on depth estimation 2 create the first RGBD CIFAR10 dataset 3 define a new metric for depth prediction problem 4 prove that depth channel has a better feature representation 5 show that training on RGBD images can somehow improve

accuracy

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition

slide-21
SLIDE 21

questions?2

2code, references, report and slides can be access here:

https://github.com/yihui-he/Depth-estimation-with-neural-network

Yihui He, Metehan Ozten Depth Prediction and RGBD Images for Recognition