Fast Multiple-baseline Stereo with Occlusion Marc-Antoine Drouin - PowerPoint PPT Presentation

Fast Multiple-baseline Stereo with Occlusion Marc-Antoine Drouin Martin Trudeau S´ ebastien Roy { drouim,trudeaum,roys } @iro.umontreal.ca June 2005

Overview Introduction. • Previous Works. • Observation. • Our Algorithm. • Experimental Results. • Conclusion. • 1

Dense Stereo Right Depth map Left Y X Near Far 2 cameras • For each pixel in the left image we try to find the corresponding pixel in the right image. • The resulting displacement for that pixel (disparity) re- lates to the distance between the object and the reference camera. 2

Dense Stereo � E ( f ) = e ( p , f ( p )) + smoothing . p ∈P � �� likelihood 2 cameras • P : set of reference pixels. • f : disparity map. • Hypothesis : for each reference pixel corresponds a sup- porting pixel. 3

Camera Configuration top right ref left bottom • 5 cameras in cross configuration. • Disparity map is computed for the central camera. • In red, examples of occlusion. 4

Disparity and Visibility Maps top right ref left bottom Reference camera • One disparity for each pixel. • One visibility mask for each pixel. • i.e. mask (0 , 0 , 0 , 1) . 5

Multi-camera and Occlusion � E ( f, g ) = e ( p , f ( p ) , g ( p )) + smoothing . p ∈P with g ( p ) = V ( p | f ( p ) , f ) ∀ p ∈ P • f disparity map. • g visibility mask map. 6

✎ ✔ ✕ ✕ ✕ ✕ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✕ ✔ ✔ ✔ ✔ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✕ ✕ ✓ ✖ ✗ ✗ ✗ ✗ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✕ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✖ ✕ ✕ ✕ ✕ ✓ ✓ ✗ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✑ ✏ ✏ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✏ ✑ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✗ ✗ ✎ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✛ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✚ ✛ ✛ ✚ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✢ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✜ ✚ ✚ ✗ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✘ ✗ ✗ ✗ ✗ ✗ ✗ ✗ ✗ ✗ ✗ ✗ ✗ ✘ ✘ ✚ ✙ ✚ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✘ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✙ ✘ ✘ ✘ ✘ ✎ ✎ ✍ ✄ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ✄ ✄ ✄ ✄ ☎ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ☎ ☎ ✄ ✆ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ☎ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ☎ ☎ ☎ ☎ ✄ ✄ ✍ � ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ � � � ✁ � � � � � � � � � � � � � � ✁ ✁ ✄ ✂ ✄ ✄ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✁ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✁ ✁ ✁ ✝ ✝ ✝ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☛ ✌ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ✡ ✡ ☞ ✌ ✡ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✌ ✍ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✝ ✡ ✡ ✞ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✞ ✞ ✟ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✝ ✝ ✝ ✝ ✡ ✟ ✟ ✠ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✠ ✠ ✠ ✟ ✠ ✠ ✠ ✟ ✟ ✟ ✟ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ Occlusion • We can pre-compute a sub-set M h of plausible masks. • Some masks are improbable. • Some masks are very probable. plausible Nakamura96 less plausible 7

Nakamura96, Park97, Kang01 and Besnerais04 g ∗ f ( p ) = arg min e ( p , f ( p ) , m ) w ( m ) m ∈M h then, � E ( f, g ∗ e ( p , f ( p ) , g ∗ f ) = f ( p )) + smoothing p ∈P Occlusion • Hypothesis : photo-consistency ⇒ correct visibility. • Visibility is heuristic. 8

Occlusion Zones in Stereo Cumulative histogram of likelihood term • Black : non-occluded pixels. • Red :occluded pixels. 1 1 1 1 0.8 0.8 0.8 0.8 0.6 0.6 0.6 0.6 0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.2 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80 Tsukuba Ground truth Venus Ground truth Sawtooth Ground truth Map Ground truth 1 1 1 1 0.8 0.8 0.8 0.8 0.6 0.6 0.6 0.6 0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.2 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80 Tsukuba Direct search Venus Direct search Sawtooth Direct search Map Direct search • Photo-consistency �⇒ geo-consistency. • Show the limitation of heuristic approaches. 9

Geo-consistency All masks are consistent with the scene geometry. g ( p ) ≤ V ( p | f ( p ) , f ) ∀ p ∈ P Nakamura96 • Using an occluded camera ⇒ important artifact. • Not using a visible camera ⇒ no impact. 10

Kolmogorov02, Faugeras98 and Drouin05 � E ( f, g ) = e ( p , f ( p ) , g ( p )) + smoothing p ∈P with g ( p ) ≤ V ( p | f ( p ) , f ) ∀ p ∈ P Occlusion • Kolmogorov : jumps from one geo-consistent configuration to another. • Faugeras : level set (continuous framework). • Drouin : starts from a non geo-consistent solution and converges to one which is. • One common feature : hard to solve. 11

Disparities and Occlusions camera camera d i d i d j d j x j x i x j x i continuous not continuous Continuous representation • Occlusion : x i + d i ≥ x j + d j Discontinuous representation • Occlusion : x i + d i = x j + d j 12

Disparities and Occlusions camera camera d i d i d j d j x j x i x j x i continuous not continuous Continuous representation • Occlusion occurs when 0 ≤ k<j ( k + d k ) ≥ j + d j max • Occlusion at j depends on visibility at < j . • Efficiently computed. 13

✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ � ✁ � � � � � � � � Dynamic Programming disparity d+2 d+1 d d−1 d−2 d−3 pixel i−5 i−4 i−3 i−2 i−1 i 14

Visibility Masks Top unknown visibility Left Right unknown visibility known visibility Bottom known visibility to be minimized already minimized Occlusion • Cameras can be split in 2 sets C G and C H . • 2 sets of masks are build M G and M H . • Sets depend on the order in which lines are processed. 15

Visibility Masks Top unknown visibility Left Right unknown visibility known visibility Bottom known visibility to be minimized Masks already minimized M g = { (0, 1 ,0, 0 ),(0, 0 ,0, 1 ),(0, 1 ,0, 1 ) } M h = { (1, 0 ,0, 0 ),(0, 0 ,1, 0 ) } Occlusion • Camera order (left, right, top, bottom). • In bold : cameras belonging to C g . 16

Energy Function � E ( f, g ) = e ( p , f ( p ) , g ( p )) + smoothing p ∈P with  a mask in M g if a camera in C g is visible  g ( p ) = arg min e ( p , f ( p ) , m ) otherwise  m ∈M h Configuration of low energy. 17

Disparity and Visibility smoothing Disparity map Visibility map • Difference of depth between two neighbor pixels. • Change in the set of masks ( M h and M g ). • Smoothing function may have any shape. 18

2 Steps Smoothing Active Smoothing Passive Smoothing Iterative Dynamic Programming (Leung04) 19

Experimental Results Tsukuba Head and Lamp • 384 × 288 with 16 disparity steps. • 5 images in cross shape configuration were used. 20

Experimental Results | f ( p ) − f T ( p ) | > 1 • An error of 1 could be the result of discretization. • Standard metric. 21

Experimental Results Algorithms Error Ours + IDP (16 iterations) 1.57% Ours + IDP (4 iterations) 1.67% Nakamura96+ Graph Cut 1.77% Ours + IDP (1 iteration) 1.82% Kolmogorov02 2.30% Nakamura96 + IDP (12 iterations) 2.35% Drouin05 +BNV 2.46% 22

Experimental Results Middlebury sequence • 334 × 383 with 20 disparity steps. • 6 scenes with 7 images each in single baseline configuration were used. 23

Experimental Results Middlebury sequence algorithms barn1 barn2 bull poster venus sawtooth average Graph Cut (no occlusion) 3.5 % 3.1 % 0.7 % 3.7 % 3.4 % 3.3% 3.0% IDP (no occlusion) 3.0 % 4.9% 1.2% 6.0 % 5.8% 3.7% 4.1% Drouin05 +Graph Cut 0.8 % 0.6 % 0.4 % 1.1 % 2.4 % 1.1 % 1.3% Nakamura96 + Graph Cut 1.4 % 1.5 % 0.9 % 1.1 % 4.0 % 1.5% 1.7% Ours +IDP 0.7 % 3.9 % 0.8 % 4.0 % 5.3% 1.0 % 2.6% Nakamura96 + IDP 1.6 % 6.0 % 1.9 % 4.5 % 7.4% 2.2 % 3.9% • The camera configuration is not favorable to our ap- proach. 24

Experimental Results Tsukuba sequence • 320 × 240 with about 24 disparity steps. • 5 images in cross shape configuration were used. 25

Conclusion Summary • Hybrid between geo-consistent and heuristic approaches. • Fast and can easily be parallelized. • Code can be download from : www.iro.umontreal.ca/~drouim/ Future work • Generalizing to arbitrary camera configurations. Wish list • Designing an hardware implementation in FPGA. 26

Fast Multiple-baseline Stereo with Occlusion Marc-Antoine Drouin - PowerPoint PPT Presentation

Fast Multiple-baseline Stereo with Occlusion Marc-Antoine Drouin Martin Trudeau S ebastien Roy { drouim,trudeaum,roys } @iro.umontreal.ca June 2005 Overview Introduction. Previous Works. Observation. Our Algorithm.

Chronic Total Occlusion of the Chronic Total Occlusion of the Coronary Artery Coronary Artery

Employing Dynamic Employing Dynamic Transparency for 3D Occlusion Transparency for 3D Occlusion

3D Photography: Stereo Matching Kevin Kser, Marc Pollefeys Spring 2012

3D Vision: Stereo Marc Pollefeys, Torsten Sattler Spring 2016

Today Recap: epipolar constraint Stereo image rectification Stereo: Stereo

Computer Graphics - Advanced Rasterization - Stefan Lemme Recap: occlusion query Occlusion

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

White Paper Runs Name Description baseline2018a Baseline Project-official baseline (official

Stereo Matching 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What is stereo

Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2.

ACUTE ARTERIAL Vascular Surgery Conference OCCLUSION Michael Lebow, MD ACUTE ARTERIAL OCCLUSION

Image Space Occlusion Culling Image Space Occlusion Culling 1 Hudson et al, SoCG 97 Occluder A

Efficient Deep Learning for Stereo Matching Wenjie Luo, Alex Schwing and Raquel Urtasun W. Luo

New Results from STEREO: a Search for a Sterile Neutrino 1eV at Short Baseline Laura Bernard

Latest results of the STEREO Experiment a Search for a Sterile Neutrino ~1eV at Short Baseline

Technical Baseline Management Technical Baseline Management September 30, 2003 Pat Hascall LAT

3D Photography: Stereo Vision Kalin Kolev, Marc Pollefeys Spring 2013

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Developing Objects Segregation capabilities and the notion of Object Containment from unlabeled

Point-Based Global Illumination for Movie Production Per Christensen Pixar Animation Studios

Synthetic Occlusion Augmentation wit ith Volumetric Heatmaps fo for r 3D Human Pose Esti

Feature Representation in Person Re-identification Hong Chang Institute of Computing Technology

Learning Optical Flow with Limited Data Jia Xu ( ) T encent AI Lab 2019-03-14 1

Depth from Stereo Sanja Fidler CSC420: Intro to Image Understanding 1 / 12 Depth from Two