Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing - PowerPoint PPT Presentation

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo et.al. (UofT) Stereo Matching 1 / 6

Stereo Estimation Desired Properties: Good enough to detect obstacles precisely Fast: real time Robust to: ◮ Saturation ◮ Shadows ◮ Repetitive patterns ◮ Specularities ◮ etc Can we leverage deep learning to do stereo estimation? W. Luo et.al. (UofT) Stereo Matching 2 / 6

Current Deep Learning Approaches Current approaches use a siamese network Combine the two branches via concatenation follow by further processing Treat the problem as classification (i.e., given a left and right image patches, are they a true match?) [J. Zbontar and Y. LeCun, CVPR15] W. Luo et.al. (UofT) Stereo Matching 3 / 6

Current Deep Learning Approaches Current approaches use a siamese network Combine the two branches via concatenation follow by further processing Treat the problem as classification (i.e., given a left and right image patches, are they a true match?) ◮ Too slow: 1 minute of computation on the GPU for KITTI! ◮ Matching not great, as scores are not correlated for different disparities [J. Zbontar and Y. LeCun, CVPR15] W. Luo et.al. (UofT) Stereo Matching 3 / 6

Stereo Estimation We propose a siamese architecture with a simple product layer, which is much faster (i.e., less than 1s in GPU) Train the network with multi-class loss so that the scores are calibrated, incorporating context information p i ( y i ) ⊙ Inner product Patch representation Left image patches Right image patches (Architecture) (Learning) W. Luo et.al. (UofT) Stereo Matching 4 / 6

Quantitative Results on KITTI 2015 Our approach produces much more accurate matches, 2-orders of magnitude faster than competing approaches [Zbontar & LeCun, CVPR 2015] > 2 pixel > 3 pixel > 4 pixel > 5 pixel End-Point Runtime(s) Non-Occ All Non-Occ All Non-Occ All Non-Occ All Non-Occ All MC-CNN-acrt 15.20 16.83 12.45 14.12 11.04 12.72 10.13 11.80 4.01 px 4.66 px 22.76 Ours(37) 1.84 px 2.56 px 0.34 9.96 11.67 7.23 8.97 5.89 7.62 5.04 6.78 W. Luo et.al. (UofT) Stereo Matching 5 / 6

Quantitative Results on KITTI 2015 Our approach produces much more accurate matches, 2-orders of magnitude faster than competing approaches [Zbontar & LeCun, CVPR 2015] > 2 pixel > 3 pixel > 4 pixel > 5 pixel End-Point Runtime(s) Non-Occ All Non-Occ All Non-Occ All Non-Occ All Non-Occ All MC-CNN-acrt 15.20 16.83 12.45 14.12 11.04 12.72 10.13 11.80 4.01 px 4.66 px 22.76 Ours(37) 1.84 px 2.56 px 0.34 9.96 11.67 7.23 8.97 5.89 7.62 5.04 6.78 To be competitive this methods require cost-aggregation, semi-global matching follow by sophisticated post processing All/All All/Est Noc/All Noc/Est Runtime D1-bg D1-fg D1-all D1-bg D1-fg D1-all D1-bg D1-fg D1-all D1-bg D1-fg D1-all (s) MBM 4.69 13.05 6.08 4.69 13.05 6.08 4.33 12.12 5.61 4.33 12.12 5.61 0.13 SPS-St 3.84 12.67 5.31 3.84 12.67 5.31 3.50 11.61 4.84 3.50 11.61 4.84 2 MC-CNN 2.89 8.88 3.89 2.89 8.88 3.88 2.48 7.64 3.33 2.48 7.64 3.33 67 Displets v2 3.00 5.56 3.43 3.00 5.56 3.43 2.73 4.95 3.09 2.73 4.95 3.09 265 Ours(37) 3.73 8.58 4.54 3.73 8.58 4.54 3.32 7.44 4.00 3.32 7.44 4.00 1 W. Luo et.al. (UofT) Stereo Matching 5 / 6

Qualitative Results on KITTI 2015 Our code is available at: http://www.cs.toronto.edu/deepLowLevelVision/ W. Luo et.al. (UofT) Stereo Matching 6 / 6

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing - PowerPoint PPT Presentation

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo et.al. (UofT) Stereo Matching 1 / 6 Stereo Estimation Desired Properties: Good enough to detect obstacles precisely Fast: real time Robust to:

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Stereo Matching Wei-Chih Tu ( ) National Taiwan University Fall 2018 Stereo Matching

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images

Simple but Effective Tree Structures for Dynamic Programming-Based Stereo Matching Michael Bleyer

Stereo Matching Shao-Yi Chien Department of Electrical Engineering National Taiwan

Lecture 19: Motion Sparse stereo matching Indexing scenes Indexing scenes Tuesday, Nov

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

AST2016 Dang slides Data November 2016 CITATIONS READS 0 10 4 authors , including: Khanh N.

DiaSys: On-Chip Trace Analysis for Multi-Processor System-on-Chip Philipp Wagner, Thomas Wild,

Transactor-based debugging of massively parallel processor array architectures Markus Blocherer,

Speed And Accuracy Dilemma In NoC Simulation: What About Memory Impact? Manuel Selva Abdoulaye

E40M Instrumentation Amps and Noise M. Horowitz, J. Plummer, R. Howe 1 ECG Lab - Electrical

Towards Understanding the Importance of Noise in Training Neural Networks Mo Zhou , Tianyi Liu

Direct Electron Detectors not just the latest new toy: game

PCFGs: Parsing & Evaluation Deep Processing Techniques for NLP Ling 571 January 23, 2017

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing - PowerPoint PPT Presentation

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo et.al. (UofT) Stereo Matching 1 / 6 Stereo Estimation Desired Properties: Good enough to detect obstacles precisely Fast: real time Robust to:

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

Stereo Matching Wei-Chih Tu ( ) National Taiwan University Fall 2018 Stereo Matching

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images

Simple but Effective Tree Structures for Dynamic Programming-Based Stereo Matching Michael Bleyer

Stereo Matching Shao-Yi Chien Department of Electrical Engineering National Taiwan

Lecture 19: Motion Sparse stereo matching Indexing scenes Indexing scenes Tuesday, Nov

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

AST2016 Dang slides Data November 2016 CITATIONS READS 0 10 4 authors , including: Khanh N.

DiaSys: On-Chip Trace Analysis for Multi-Processor System-on-Chip Philipp Wagner, Thomas Wild,

Transactor-based debugging of massively parallel processor array architectures Markus Blocherer,

Speed And Accuracy Dilemma In NoC Simulation: What About Memory Impact? Manuel Selva Abdoulaye

E40M Instrumentation Amps and Noise M. Horowitz, J. Plummer, R. Howe 1 ECG Lab - Electrical

Towards Understanding the Importance of Noise in Training Neural Networks Mo Zhou , Tianyi Liu

Direct Electron Detectors not just the latest new toy: game

PCFGs: Parsing &amp; Evaluation Deep Processing Techniques for NLP Ling 571 January 23, 2017

PCFGs: Parsing & Evaluation Deep Processing Techniques for NLP Ling 571 January 23, 2017