Monocular Depth Estimation Using Atrous Convolutions Group 5 - - - PowerPoint PPT Presentation

monocular depth estimation using atrous convolutions
SMART_READER_LITE
LIVE PREVIEW

Monocular Depth Estimation Using Atrous Convolutions Group 5 - - - PowerPoint PPT Presentation

Monocular Depth Estimation Using Atrous Convolutions Group 5 - Faraz Saeedan Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019 Technische Universitt Darmstadt Introduction Experiments Results Discussion Conclusion 1 Fabian


slide-1
SLIDE 1

Monocular Depth Estimation Using Atrous Convolutions

Group 5 - Faraz Saeedan Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

slide-2
SLIDE 2

Technische Universität Darmstadt

Introduction Experiments Results Discussion Conclusion

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

1

slide-3
SLIDE 3

Technische Universität Darmstadt

Monocular Depth Estimation

◮ Monodepth: Estimate depth from a single image at test-time (instead of stereo pair) ◮ Existing approaches treat this as a supervised regression problem ◮ Godard et al., CVPR 2017: Depth estimation as a stereo reconstruction problem

Figure: Godard et al. 2017

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

2

slide-4
SLIDE 4

Technische Universität Darmstadt

Atrous Convolutions

Figure: Chen et al., 2017

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

3

slide-5
SLIDE 5

Technische Universität Darmstadt

Project Objective

◮ Get Monodepth baseline to run ◮ Design and implement encoder-decoder network using atrous convolutions ◮ Consider memory and runtime in architectural decisions ◮ Quantitative comparison of proposed architectures to baseline

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

4

slide-6
SLIDE 6

Technische Universität Darmstadt

Our Architecture with ASPP

1x1 Conv 3x3 Conv rate=6 3x3 Conv rate=12 3x3 Conv rate=18 Image Pooling

Atrous Spatial Pyramid Pooling

Block 1 Block 2 Block 3 Block 4 Output Stride 4 8 16 16 Conv1 + Pool1 Concat + 1x1 Conv

Prediction Input Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

5

slide-7
SLIDE 7

Technische Universität Darmstadt

Remember last time? Sky Artifacts

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

6

slide-8
SLIDE 8

Technische Universität Darmstadt

How we solved it: Reimplementation

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

7

slide-9
SLIDE 9

Technische Universität Darmstadt

How we solved it: Reimplementation

Differences of their implementation compared to standard ResNet: ◮ No batch-normalization ◮ Nearest-neighbor instead of bilinear interpolation ◮ Order of convolution strides in ResNet switched + Fixed bug in author’s implementation of loss function

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

7

slide-10
SLIDE 10

Technische Universität Darmstadt

New Disparity Maps

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

8

slide-11
SLIDE 11

Technische Universität Darmstadt

General Experimental Setup

◮ General setup

◮ Batch Size 16 ◮ Learning Rate 2e-4 ◮ 50 epochs

◮ KITTI for training

◮ Rescaled to 256 x 512 px

◮ KITTI Stereo 2015 for testing

◮ Sparse ground truth Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

9

slide-12
SLIDE 12

Technische Universität Darmstadt

Output Strides

Issues

Output stride 64 with Atrous rate > 1

Convolution Kernel

F e a t u r e s a f t e r E n c

  • d

e r

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

10

slide-13
SLIDE 13

Technische Universität Darmstadt

Output Strides

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

11

slide-14
SLIDE 14

Technische Universität Darmstadt

Output Strides

Stride

  • Abs. Rel

Runtime Memory Params (M) 64 0.1120 NaN 6115MiB 58.4 32 0.1048 NaN 8871MiB 58.4 16 0.1041 NaN 8883MiB 58.4 8 NaN NaN 10609MiB 58.4

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

11

slide-15
SLIDE 15

Technische Universität Darmstadt

ASPP vs. no ASPP (with Skip Connections)

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

12

slide-16
SLIDE 16

Technische Universität Darmstadt

ASPP vs. no ASPP (without Skip Connections)

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

13

slide-17
SLIDE 17

Technische Universität Darmstadt

Experiments: Atrous Rates

1x1 Conv 3x3 Conv rate=6 3x3 Conv rate=12 3x3 Conv rate=18 Image Pooling

Atrous Spatial Pyramid Pooling

0.095 0.100 0.105 0.110

Abs.Rel.

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

14

slide-18
SLIDE 18

Technische Universität Darmstadt

Experiments: Atrous Rates

1x1 Conv 3x3 Conv rate=6 3x3 Conv rate=12 3x3 Conv rate=18 Image Pooling

Atrous Spatial Pyramid Pooling

0.095 0.100 0.105 0.110

Abs.Rel.

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

14

slide-19
SLIDE 19

Technische Universität Darmstadt

Experiments: Different number of modules

1x1 Conv 3x3 Conv rate=1 3x3 Conv rate=1 Image Pooling

Atrous Spatial Pyramid Pooling

0.095 0.100 0.105 0.110

Abs.Rel.

Baseline 1 2 3 4 5

ASPP modules

10 20 30 40 50 60 70

Million Parameters

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

15

slide-20
SLIDE 20

Technische Universität Darmstadt

Experiments: Different number of modules

1x1 Conv 3x3 Conv rate=1 3x3 Conv rate=1 Image Pooling

Atrous Spatial Pyramid Pooling

0.095 0.100 0.105 0.110

Abs.Rel.

Baseline 1 2 3 4 5

ASPP modules

10 20 30 40 50 60 70

Million Parameters

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

15

slide-21
SLIDE 21

Technische Universität Darmstadt

Experiments: Atrous Convolutions in Encoder

Block 1 Block 2 Block 3 Block 4 Conv1 + Pool1 1x1 Conv 3x3 Conv rate=6 3x3 Conv rate=12 3x3 Conv rate=18 Image Pooling

Atrous Spatial Pyramid Pooling Input

No improvement with atrous convolutions in ResNet blocks

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

16

slide-22
SLIDE 22

Technische Universität Darmstadt

Experiments: ASPP in ResNet

Block 1 Block 2 Conv1 + Pool1

1x1 Conv 3x3 Conv rate=6 3x3 Conv rate=12 3x3 Conv rate=18 Image Pooling

Atrous Spatial Pyramid Pooling

Input

.....

No improvement with ASPP module between ResNet blocks

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

17

slide-23
SLIDE 23

Technische Universität Darmstadt

Final Results

Pretrained on CityScapes & finetuned on KITTI

Output-stride ASPP Abs.Rel. Sq.Rel. RMSE Log RMSE a1 a2 a3 Params (M) ∆ Abs. Rel. (%) 64 - Godard

  • 0.0970

0.8960 5.093 0.176 0.962 0.962 0.986 58.4 – 16 - Ours 1-1-1-1 0.0927 0.8132 4.865 0.168 0.888 0.967 0.987 58.4 4.43 32 - Ours 1 0.0941 0.8196 4.910 0.173 0.882 0.963 0.986 44.1 2.99 32 - Ours 1-2-3-4 0.0936 0.8281 4.941 0.172 0.884 0.963 0.987 58.4 3.50

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

18

slide-24
SLIDE 24

Technische Universität Darmstadt

Discussion

◮ Loss hyperparameters ◮ Learning rate ◮ Output stride 8 ◮ Decoder architecture ◮ Robustness of predictions

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

19

slide-25
SLIDE 25

Technische Universität Darmstadt

Recap

◮ Reproduced baseline ◮ Experiments

◮ Output stride ◮ Skip connections ◮ ASPP rates ◮ Number of ASPP modules Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

20

slide-26
SLIDE 26

Technische Universität Darmstadt

Conclusion

◮ Atrous convolutions . . .

◮ do not improve monocular depth estimation ◮ need a decreased output stride, which harms runtime and memory

◮ Channel reduction after encoder . . .

◮ decreases parameter count and improves runtime ◮ without losing predictive power Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

21

slide-27
SLIDE 27

Technische Universität Darmstadt

References

◮ C. Godard, O. Aodha and G. Brostow: Unsupervised Monocular Depth Estimation with Left- Right Consistency, CVPR 2017. ◮ L. Chen, Y. Zhu, G. Papandreou, F. Schroff, H. Adam: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. arXiv 2018. ◮ M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth, B. Schiele: The cityscapes dataset for semantic urban scene understanding. CVPR. 2016. ◮ M. Menze and A. Geiger: Object Scene Flow for Autonomous

  • Vehicles. CVPR. 2015.

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

22

slide-28
SLIDE 28

Technische Universität Darmstadt

Experiments: Atrous Rates (Visual Comparison)

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

23

slide-29
SLIDE 29

Technische Universität Darmstadt

Experiments: Atrous Rates (Visual Comparison)

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

24

slide-30
SLIDE 30

Technische Universität Darmstadt

Experiments: Atrous Rates (Visual Comparison)

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

25

slide-31
SLIDE 31

Technische Universität Darmstadt

Experiments: Atrous Rates (Visual Comparison)

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

26

slide-32
SLIDE 32

Technische Universität Darmstadt

Experiments: Atrous Rates (Squared relative error)

0.90 0.95 1.00 1.05 1.10

Sq.Rel.

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

27

slide-33
SLIDE 33

Technische Universität Darmstadt

Experiments: Atrous Rates (RMSE)

4.8 5.0 5.2 5.4 5.6

RMSE

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

28

slide-34
SLIDE 34

Technische Universität Darmstadt

Experiments: Atrous Rates (RMSE of log)

log.pdf log.pdf

0.17 0.18 0.19 0.20

RMSE log

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

29

slide-35
SLIDE 35

Technische Universität Darmstadt

Experiments: Atrous Rates (% inliers 1)

0.78 0.80 0.82 0.84 0.86 0.88 0.90

<1.25

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

30

slide-36
SLIDE 36

Technische Universität Darmstadt

Experiments: Atrous Rates (% inliers 2)

0.875 0.900 0.925 0.950 0.975 1.000

<1.25^2

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

31

slide-37
SLIDE 37

Technische Universität Darmstadt

Experiments: Atrous Rates (% inliers 3)

0.900 0.925 0.950 0.975 1.000 1.025

<1.25^3

stride 32 stride 16 Baseline 1,1,1,1 1,2,3,4 1,3,5,7

ASPP rates

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

32

slide-38
SLIDE 38

Technische Universität Darmstadt

Experiments: Atrous Rates (Squared relative error)

0.90 0.95 1.00 1.05 1.10

Sq.Rel.

Baseline 1 2 3 4 5

ASPP modules

10 20 30 40 50 60 70

Million Parameters

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

33

slide-39
SLIDE 39

Technische Universität Darmstadt

Experiments: Atrous Rates (RMSE)

4.8 5.0 5.2 5.4 5.6

RMSE

Baseline 1 2 3 4 5

ASPP modules

10 20 30 40 50 60 70

Million Parameters

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

34

slide-40
SLIDE 40

Technische Universität Darmstadt

Experiments: Atrous Rates (RMSE of log)

log.pdf log.pdf

0.17 0.18 0.19 0.20

RMSE log

Baseline 1 2 3 4 5

ASPP modules

10 20 30 40 50 60 70

Million Parameters

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

35

slide-41
SLIDE 41

Technische Universität Darmstadt

Experiments: Atrous Rates (% inliers 1)

figures/results/experiment2_d<125.pdf

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

36

slide-42
SLIDE 42

Technische Universität Darmstadt

Experiments: Atrous Rates (% inliers 2)

figures/results/experiment2_d<125^2.pdf

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

37

slide-43
SLIDE 43

Technische Universität Darmstadt

Experiments: Atrous Rates (% inliers 3)

figures/results/experiment2_d<125^3.pdf

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

38

slide-44
SLIDE 44

Technische Universität Darmstadt

Experiments: Output Stride Comparison

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

39

slide-45
SLIDE 45

Technische Universität Darmstadt

Experiments: Output Stride Comparison

Fabian Kessler, Dominik Straub, Steven Lang February 15, 2019

40