Learning-based Matching Costs Finalist of the Depth Estimation - PowerPoint PPT Presentation

Depth from a Light Field Image with Learning-based Matching Costs Finalist of the Depth Estimation Challenge at LF4CV & Submitted to IEEE TPAMI (under review) Hae-Gon Jeon¹, Jaesik Park², Gyeongmin Choe¹, Jinsun Park¹ Yunsu Bok³, Yu-Wing Tai ⁴ , In So Kweon¹ ¹KAIST ²Intel labs ³ETRI ⁴ Tencent

Goal of the Proposed Method Problem1: Severe vignetting 1. Hard to find accurate correspondence in radiometric distortions and severe noise  Using various hand-craft matching cost 2. Which one is correct matching cost?  Predicting the correct matching cost using two random forests 3. Does it work well in real world light-field images?  Realistic dataset generation based on an imaging pipeline of the Lytro camera Problem2: Severe noise

Overview of the Proposed Method 1. Realistic Light Field Image Generation; 2. Making Cost Volumes using Phase Shift; Emulating an imaging pipeline of Lytro camera Overcoming inherent degradation of light-field images caused by a lenslet array SAD GRAD Census ZNCC q = [ ] 3. Random Forest 1 - Classification; 4. Random Forest 2 - Regression; Selecting dominant matching costs Predicting a disparity value with sub-pixel precision

Data Generation Vignetting Map Noise-free multi-view images Vignetting map from averaged White plane image white plane images

Data Generation Lenslet Image Generation Extract a pixel from each sub- Aggregate these pixels in a lenslet aperture image Sub-aperture image with vignetting map

Data Generation Add Noise Convert color image to raw image Noise level estimation of each color channel Red Channel Green Channel1 Green Channel2 Blue Channel 0.025 0.025 0.025 0.025 0.02 0.02 0.02 0.02 Standard Deviation Standard Deviation Standard Deviation Standard Deviation 0.015 0.015 0.015 0.015 0.01 0.01 0.01 0.01 0.005 0.005 0.005 0.005 Y. Schechner et al., “ Multiplexing for 0 0 0 0 0.2 0.3 0.4 0.5 0.6 0.2 0.3 0.4 0.5 0.6 0.2 0.3 0.4 0.5 0.6 0.2 0.3 0.4 0.5 0.6 optimal lighting ”, IEEE TPAMI 2007 Intensity Intensity Intensity Intensity

Data Generation Realistic Sub-aperture Image Generation Noisy raw image Demosaicing Rearrange pixels at each lenslet to each sub-aperture image

Training Set http://hci-lightfield.iwr.uni-heidelberg.de/ Antinous, Range: [ -3.3, 2.8 ] Boardgames, Range: [ -1.8, 1.6 ] Dishes, Range: [ -3.1, 3.5 ] Greek, Range: [ -3.5, 3.1 ] Kitchen, Range: [ -1.6, 1.8 ] Medieval2, Range: [ -1.7, 2.0 ] Museum, Range: [ -1.5, 1.3 ] Pens, Range: [ -1.7, 2.0 ] Pillows, Range: [ -1.7, 1.8 ] Platonic, Range: [ -1.7, 1.5 ] Rosemary, Range: [ -1.8, 1.8 ] Table, Range: [ -2.0, 1.6 ] Tomb, Range: [ -1.5, 1.9 ] Tower, Range: [ -3.6, 3.5 ] Town, Range: [ -1.6, 1.6 ] Vinyl, Range: [ -1.6, 1.2 ]

Cost Volumes Phase Shift Phase shift => 1/100 pixel precision Sub-aperture images Original Bilinear Bicubic Phase Very narrow baseline; Physically 0.45mm Within 1px Flipping adjacent views Averbuch and Keller, “A unified approach to FFT based images registration”, IEEE TIP 2003

Cost Volumes Phase Shift Jeon et al., “Accurate Depth Map Estimation from a Lenslet Light Field Camera”, CVPR 2015 GT Bilinear Bicubic Phase 1 % 16.2% 15.35% 9.88% 0.2 % GT Bilinear Bicubic Phase 1 % 9.03% 8.73% 6.38% 0.2 %

Cost Volumes Matching Costs Sum of Absolute Difference (SAD) + Robust to image noise; act as averaged filter Zero-mean Normalized Cross correlation (ZNCC) + Compensate for differences in both gain and offset Census Transform (Census) + Tolerate radiometric distortions Sum of Gradient Difference (GRAD) + Synergy with other matching costs + imposing higher weights at edge boundaries H. Hirschmuller and D. Scharstein , “Evaluation of stereo matching costs on images with radiometric differences,” IEEE TPAMI 2009.

Cost Volumes Matching group1 Matching Cost 𝑔( ) , Reference view Target view Depth label Sub-aperture images Cost volume

Cost Volumes Matching group2 Matching Cost 𝑔( ) , Reference view Target view Depth label Sub-aperture images Cost volume

Cost Volumes Computed Cost Volumes Matching group Matching cost Sum of Zero-mean Census Sum of Absolute Normalized Transform Gradient Difference Cross (Census) Difference (SAD) correlation (GRAD) (ZNCC)

Cost Volumes Computed Cost Volumes Disparities from each cost volume via Winner-Takes-All

Cost Volumes Computed Cost Volumes Campbell et al., “Using Multiple hypotheses to improve depth-maps for multi- view stereo”, ECCV 2008 Multiple disparity hypotheses Ground truth 31 53 60 43 55 61 74 55 ⋯ ⋯ ⋯ Multiple disparity hypotheses 67 51 58 53 37 66 12 55 ⋯ ⋯ ⋯ Multiple disparity hypotheses 25 42 43 49 55 61 57 55 ⋯ ⋯ ⋯ 76 72 58 66 23 55 56 55 ⋯ ⋯ ⋯ Vectorizing estimated depth labels SAD+GRAD GRAD+Census Census+SAD 𝛽 ∈ [0, 1.0] with a ground truth depth label

Cost Volumes Computed Cost Volumes Multiple disparity hypotheses Ground truth 25 54 42 48 32 34 11 32 ⋯ ⋯ ⋯ Multiple disparity hypotheses 19 20 33 43 37 32 5 32 ⋯ ⋯ ⋯ 31 42 34 29 12 41 57 32 ⋯ ⋯ ⋯ Multiple disparity hypotheses 44 39 17 56 49 43 32 32 ⋯ ⋯ ⋯ Vectorizing estimated depth labels SAD+GRAD GRAD+Census Census+SAD 𝛽 ∈ [0, 1.0] with a ground truth depth label

Random Forest1 - Classification 25 54 42 48 32 34 11 32 Random forest 1 ⋯ ⋯ ⋯ Classification 𝐫 19 20 33 43 37 32 5 32 ⋯ ⋯ ⋯ 31 42 34 29 12 41 57 32 ⋯ ⋯ ⋯ 44 39 17 56 49 43 32 32 ⋯ ⋯ ⋯ Training a random forest

Random Forest1 - Classification + Removing unnecessary Retrieving a set of matching cost 𝐫 important matching costs + Designing a better using the permutation prediction model importance measure [L. Breiman , “Random forests,” Machine learning ] q 3 q 1 q 2 Importance q 4 q 5 q 6 q 7 q 8 q 10 q 11 q 9 Matching Group1 Matching Group2 Matching Group3 Matching Group4

Random Forest2 - Regression Input of a random forest for regression Random forest 2 𝐫 q 1 q 2 q 3 q 4 q 5 q 6 q 7 q 8 q 9 q 10 q 11 Regression vs. Estimated disparity value with Weighted Median Filter SAD+GRAD with sub-pixel precision [Z. Ma et al ., IEEE ICCV 2013] [H.-G. Jeon et al ., IEEE CVPR 2015]

Benchmark Bad pixel ratio (>0.07px) & Mean square error Mean square error Bad pixel ratio (2017.05.23)

Evaluation Results - Stratified GT Estimated Error Map

Evaluation Results - Training GT Estimated Most errors are shown in depth boundaries Error Map

Evaluation Results - Test GT Estimated Error Map

Real World Examples – Lytro Illum Wanner and Goldluecke, Yu et al, Jeon et al, IEEE TPAMI 14 ICCV 13 CVPR 15 Williem et al, Wang et al, Tao et al, Proposed CVPR 16 IEEE TPAMI 16 IEEE TPAMI 17 Wanner and Goldluecke, Yu et al, Jeon et al, IEEE TPAMI 14 ICCV 13 CVPR 15 Tao et al, Williem et al, Wang et al, Proposed CVPR 16 IEEE TPAMI 16 IEEE TPAMI 17

Real World Examples – Lytro Illum Wanner and Goldluecke, Yu et al, Jeon et al, IEEE TPAMI 14 ICCV 13 CVPR 15 Tao et al, Williem et al, Wang et al, Proposed CVPR 16 IEEE TPAMI 16 IEEE TPAMI 17 Wanner and Goldluecke, Yu et al, Jeon et al, IEEE TPAMI 14 ICCV 13 CVPR 15 Williem et al, Wang et al, Tao et al, Proposed CVPR 16 IEEE TPAMI 16 IEEE TPAMI 17

Conclusion Contributions: ● Analysis of the problems of depth estimation using light-field cameras ● Data augmentation that simulates a pipeline of a hand-held light-field camera ● Pixel-wise disparity value prediction using two random forests Pros: Accurate disparity estimation Object + Handling narrow baseline problem + Robust to Image noise + Applicable real-world light field images 3D Mesh 3D printing Cons: - Heavy computational burden - Need to minimize disparity error in depth discontinuities - Requiring for handling textureless regions

Data Generation Add Noise Without augmented training set With training set augmented With fully augmented with Gaussian noise training set

Learning-based Matching Costs Finalist of the Depth Estimation - PowerPoint PPT Presentation

Depth from a Light Field Image with Learning-based Matching Costs Finalist of the Depth Estimation Challenge at LF4CV & Submitted to IEEE TPAMI (under review) Hae-Gon Jeon, Jaesik Park, Gyeongmin Choe, Jinsun Park Yunsu Bok,

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

Impedance Matching of 640 GHz SIS Mixer Impedance Matching of 640 GHz SIS Mixer of 640 GHz SIS

String Matching Inge Li Grtz CLRS 32 String Matching String matching problem: string

CSE182-L7 Dicitionary matching Pattern matching October 09 CSE182 Dictionary Matching

Graph Matchings Matching A matching M in a graph G is a set of non-loop edges with no shared

Scalable String Matching on the Scalable String Matching on the Scalable String Matching on the

1 Shape- -Context: Matching Context: Matching Scale Invariance in Clutter ? Shape Scale

Five Minute Settlement AEMC Public Forum Costs & price impacts Costs of implementing 5

LPEG: a new approach to pattern LPEG: a new approach to pattern matching in Lua matching in Lua

Outline Flexible, optimal matching for observational Optimal matching of two groups studies

Modeling approaches for switching converters by Giorgio Spiazzi University of Padova ITALY

Objects and scenes Objects and scenes: Recognizing Multiple Object Classes Josef Sivic and Ivan

Lifestyle Change for Happiness + Health Liana Lianov, MD, MPH, FACPM, FACLM Chair, Positive

CMS Tracker Performance Francesco Palmonari (INFN Pisa) on behalf of the CMS Collaboration 8th

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Multiple Linear Regression Recall: a regression model describes how a dependent variable (or

Kinematics in One Dimension Return to Table of Contents Slide 5 / 125 Review of 1-D

Boolean extreme values Jorge Garza Vargas Joint work with Dan-Virgil Voiculescu Motivation and

Learning-based Matching Costs Finalist of the Depth Estimation - PowerPoint PPT Presentation

Depth from a Light Field Image with Learning-based Matching Costs Finalist of the Depth Estimation Challenge at LF4CV & Submitted to IEEE TPAMI (under review) Hae-Gon Jeon, Jaesik Park, Gyeongmin Choe, Jinsun Park Yunsu Bok,

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

Impedance Matching of 640 GHz SIS Mixer Impedance Matching of 640 GHz SIS Mixer of 640 GHz SIS

String Matching Inge Li Grtz CLRS 32 String Matching String matching problem: string

CSE182-L7 Dicitionary matching Pattern matching October 09 CSE182 Dictionary Matching

Graph Matchings Matching A matching M in a graph G is a set of non-loop edges with no shared

Scalable String Matching on the Scalable String Matching on the Scalable String Matching on the

1 Shape- -Context: Matching Context: Matching Scale Invariance in Clutter ? Shape Scale

Five Minute Settlement AEMC Public Forum Costs &amp; price impacts Costs of implementing 5

LPEG: a new approach to pattern LPEG: a new approach to pattern matching in Lua matching in Lua

Outline Flexible, optimal matching for observational Optimal matching of two groups studies

Modeling approaches for switching converters by Giorgio Spiazzi University of Padova ITALY

Objects and scenes Objects and scenes: Recognizing Multiple Object Classes Josef Sivic and Ivan

Lifestyle Change for Happiness + Health Liana Lianov, MD, MPH, FACPM, FACLM Chair, Positive

CMS Tracker Performance Francesco Palmonari (INFN Pisa) on behalf of the CMS Collaboration 8th

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Multiple Linear Regression Recall: a regression model describes how a dependent variable (or

Kinematics in One Dimension Return to Table of Contents Slide 5 / 125 Review of 1-D

Boolean extreme values Jorge Garza Vargas Joint work with Dan-Virgil Voiculescu Motivation and

Five Minute Settlement AEMC Public Forum Costs & price impacts Costs of implementing 5