Flexible priors for robust This subtitle is 20 points dense stereo - - PowerPoint PPT Presentation

flexible priors for robust
SMART_READER_LITE
LIVE PREVIEW

Flexible priors for robust This subtitle is 20 points dense stereo - - PowerPoint PPT Presentation

Edit this text to create a Heading Flexible priors for robust This subtitle is 20 points dense stereo matching Bullets are blue They have 110% line spacing, 2 points before & after Longer bullets in the form of a paragraph are


slide-1
SLIDE 1

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Flexible priors for robust dense stereo matching

Sudipta N. Sinha

Microsoft Research

Lines, Planes and Manhattan Models for 3-D Mapping Workshop, IROS 2017

slide-2
SLIDE 2

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Much progress in 20+ years ..

Passive Stereo Matching

[Okutomi et al. 1996] [Taniai et al. 2017] [Bleyer & Gelautz 2004]

slide-3
SLIDE 3

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Reflections, transparency ▪ Thin structures ▪ Untextured regions ▪ Outdoor weather ▪ low light, rain, snow .. ▪ High resolution, real-time computation ▪ Imperfect calibration

Still challenging …

Nair+ ICCV 2015

slide-4
SLIDE 4

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Priors add robustness in presence of noise, ambiguity ▪ How to incorporate priors? ▪ Traditional tools (MRF inference; energy minimization) ▪ Solve stereo in conjunction with other tasks ▪ Supervised learning (deep learning) ▪ Adding structure (or constraints) to model ▪ Priors must be adaptive, to be useful

Incorporating priors and constraints

slide-5
SLIDE 5

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Taxonomy

Specific scene ▪ Strong bias ▪ Constraints, ▪ Structure General scene ▪ Low bias ▪ Soft prior ▪ Few assumptions Stereo (+ Segmentation, Flow, VO, Semantics) Stereo (coarse) Stereo (accurate)

slide-6
SLIDE 6

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Stereo (+ Segmentation, Flow, VO, Semantics) Specific scene General scene Stereo (coarse) Stereo (accurate) ▪ Man-made scenes ▪ Image-based rendering

slide-7
SLIDE 7

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Stereo matching with planar priors

[Sinha, Steedly and Szeliski 2009]

MRF optimization

slide-8
SLIDE 8

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Stereo matching with planar priors

slide-9
SLIDE 9

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

IBR using piecewise planar depth maps

slide-10
SLIDE 10

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

IBR using piecewise planar depth maps

slide-11
SLIDE 11

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Scenes with Reflections and Transparency

Reflective, glossy and transparent surfaces Stereo is not a solved problem!

slide-12
SLIDE 12

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Scenes with Reflections and Transparency

▪ Model such scenes with two layers; reflections have depth ▪ Two-layer depth maps; piecewise planarity

Input Image Depth Map (layer1) Depth Map (layer2)

[Sinha+ 2012]

slide-13
SLIDE 13

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Scenes with Reflections and Transparency

Decompose images (color) given the two depth layers

Two-layer stereo matching

slide-14
SLIDE 14

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Scenes with Reflections and Transparency

slide-15
SLIDE 15

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Scenes with Reflections and Transparency

▪ Two-layer Piecewise Planar Stereo

▪ MRF Labels: ▪ Single Planes ▪ Pair of Planes

▪ Energy minimization via graph cuts

Reflectivity Map

(opaque) (opaque + reflective)

Labeling

slide-16
SLIDE 16

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Scenes with Reflections and Transparency

▪ Two-layer Piecewise Planar Stereo

▪ MRF Labels: ▪ Single Planes ▪ Pair of Planes

▪ Energy minimization via graph cuts

Reflectivity Map

(opaque) (opaque + reflective)

Labeling

▪ BUT, recovered reflectivity mask often inaccurate ▪ Still, many open issues in reflection modeling

slide-17
SLIDE 17

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Stereo (+ Segmentation) Flow, VO, Semantics) Specific scene General scene Stereo (coarse) Stereo (accurate) ▪ Image-based rendering ▪ Image editing

slide-18
SLIDE 18

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Tackle general scenes ▪ Local planes as primitives ▪ Estimate pixel-to-plane labeling ▪ Learn surface appearance (color) models

Stereo matching with planar priors

Semi-global stereo (SGM) Find planes Plane label map Depth map

[Kowdle, Sinha and Szeliski 2012]

slide-19
SLIDE 19

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Iterate between:

  • Segmentation (graph cuts)
  • Learning color distributions (like GrabCut [Rother+ 2004])

Stereo matching with planar priors

SGM stereo Only stereo cues Appearance + stereo cues

[Kowdle, Sinha and Szeliski 2012]

slide-20
SLIDE 20

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Interactive, 3D immersive viewer for photos

Microsoft Photosynth2 (2012)

▪ MSR SpinMovies

prototype

slide-21
SLIDE 21

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Automatic Multi-view Object Segmentation

structure from motion (SfM) reconstruction

[Kowdle, Sinha and Szeliski 2012]

slide-22
SLIDE 22

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Automatic Multi-view Object Segmentation

structure from motion (SfM) reconstruction

[Kowdle, Sinha and Szeliski 2012]

slide-23
SLIDE 23

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Automatic Multi-view Object Segmentation

structure from motion (SfM) reconstruction

[Kowdle, Sinha and Szeliski 2012]

slide-24
SLIDE 24

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Automatic Multi-view Object Segmentation

structure from motion (SfM) reconstruction

[Kowdle, Sinha and Szeliski 2012]

slide-25
SLIDE 25

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Automatic Multi-view Object Segmentation

structure from motion (SfM) reconstruction

[Kowdle, Sinha and Szeliski 2012]

slide-26
SLIDE 26

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Automatic Multi-view Object Segmentation

structure from motion (SfM) reconstruction

[Kowdle, Sinha and Szeliski 2012]

slide-27
SLIDE 27

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Accurate depth discontinuities and surface normals ▪ more critical than accurate depth ▪ Planes (surfaces) as MRF labels instead of disparities. ▪ Strong prior ▪ Recovering planes (various ways) ▪ Restricted label set enforces geometric constraints

▪ planarity, parallelism, two-layers ..

Piecewise planar priors / IBR

slide-28
SLIDE 28

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Limitations of strong priors

▪ Strong bias; Underlying 3d shape lacks detail

▪ Common cause of failure

▪ Proposal generator could miss some surfaces!

▪ Data-augmentation for deep learning

▪ Currently based on image processing + CG rendering ▪ Image-based rendering could also be effective !

Piecewise planar priors / IBR

slide-29
SLIDE 29

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Stereo (+ Segmentation) Flow, VO, Semantics) Specific scene General scene Stereo (coarse) Stereo (accurate) ▪ Dense 3D Reconstruction ▪ High-resolution stereo pairs

slide-30
SLIDE 30

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Middlebury Teddy (2005) KITTI (2012)

Disney (Kim+ 2013) 10—20 Mpixels.

Middlebury v3 (2016) 5-6 Mpixels.

slide-31
SLIDE 31

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Semi Global Matching [Hirschmüller 2005]

  • Most successful practical stereo method
  • Used in photogrammetry, assisted driving, robotics, …
slide-32
SLIDE 32

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Approximates 2D MRF using 1D optimization along 8 cardinal directions

  • related to BP, TRW-S [Drory et al. 2014]

Semi Global Matching [Hirschmüller 2005]

slide-33
SLIDE 33

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Approximates 2D MRF using 1D optimization along 8 cardinal directions

  • related to BP, TRW-S [Drory et al. 2014]

Semi Global Matching [Hirschmüller 2005]

▪ Evaluates the whole DSI ▪ Inefficient for high-resolution images

slide-34
SLIDE 34

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Solve many local plane sweep stereo (LPS) problems ▪ Generates surface proposals; fuse them into a disparity map

Local Plane Sweep (LPS) Stereo

[Sinha+ 2014]

slide-35
SLIDE 35

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Local Plane Sweep (LPS) Stereo

[Sinha+ 2014]

slide-36
SLIDE 36

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Local Plane Sweep (LPS) Stereo

disparity range (pixels) time (seconds)

[Sinha+ 2014]

slide-37
SLIDE 37

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Local Plane Sweep (LPS) Stereo

disparity range (pixels) time (seconds)

▪ more accurate than SGM; faster ▪ 2nd fastest after ELAS (Geiger+2010) ▪ BUT, plane fitting can be unreliable; can give errors

[Sinha+ 2014]

slide-38
SLIDE 38

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Approximates 2D MRF using 1D optimization along 8 cardinal directions

Semi Global Matching [Hirschmüller 2005]

▪ Fronto parallel bias ▪ Inaccurate on slanted untextured surfaces

slide-39
SLIDE 39

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Left image GT disparities

SGM @ quarter resolution SGM @ full resolution (6 MP)

slide-40
SLIDE 40

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: SGM with orientation priors

▪ What if we knew the surface slant? ▪ Replace fronto-parallel bias with bias parallel to surface

[Scharstein, Taniai, Sinha, 3DV 2017]

Idea:

▪ Rasterize disparity surface prior (at arbitrary depth) ▪ Adjust V(d, d’) to follow discrete disparity “steps”

slide-41
SLIDE 41

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: 2D orientation priors

slide-42
SLIDE 42

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: 3D orientation priors

Jump locations vary with disparity

slide-43
SLIDE 43

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: Where do we get priors?

Matched features + triangulation

Matched features + plane fitting

Low-res matching + plane fitting

Ground truth oracle

Semantic analysis

Manhattan-world assumptions

slide-44
SLIDE 44

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: Results

slide-45
SLIDE 45

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: Results

slide-46
SLIDE 46

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

SGM-P: Results

▪ Huge performance gains for slanted untextured scenes ▪ Soft constraint, doesn’t hurt accuracy – Good!

slide-47
SLIDE 47

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Taxonomy

Stereo (+ Segmentation, Flow, VO, Semantics) Specific scene General scene Stereo (coarse) Stereo (accurate)

slide-48
SLIDE 48

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Semantic Correspondence +

Co-Segmentation (2016)

Joint estimation of …

▪ Stereo + Segmentation (2011) ▪ Scene Flow (2017)

+ Ego-motion + Motion Segmentation

slide-49
SLIDE 49

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Joint recovery of disparity and segmentation in both views ▪ Model: Objects in scene, each have

▪ Color model (GMM) ▪ Surface model (plane + parallax)

▪ Assumptions:

▪ Color distribution is compact ▪ Object surface close to a 3D plane

Object Stereo [Bleyer+ 2011]

slide-50
SLIDE 50

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Minimize: 𝐹 𝐸, 𝑃 = 𝐹𝑞ℎ𝑝𝑢𝑝 𝐸 + 𝐹𝑑𝑝𝑚𝑝𝑠 𝑃 + 𝐹𝑡𝑛−𝐸 𝐸 + 𝐹𝑡𝑛−𝑃 𝑃 + 𝐹𝑛𝑒𝑚 𝐸, 𝑃 + …

Object Stereo [Bleyer+ 2011]

slide-51
SLIDE 51

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Proposal generation ▪ Merge proposals optimally

  • MRF Fusion moves (non-submodular graph cuts)
  • Quadratic Pseudo Boolean Optimization (QPBO)

Object Stereo [Bleyer+ 2011]

Current solution Proposal Fusion result

Minimize: 𝐹 𝐸, 𝑃 = 𝐹𝑞ℎ𝑝𝑢𝑝 𝐸 + 𝐹𝑑𝑝𝑚𝑝𝑠 𝑃 + 𝐹𝑡𝑛−𝐸 𝐸 + 𝐹𝑡𝑛−𝑃 𝑃 + 𝐹𝑛𝑒𝑚 𝐸, 𝑃 + …

slide-52
SLIDE 52

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

▪ Input: Images with semantically related objects (different instances) ▪ Output: Common object regions and dense correspondence

associated with these regions.

Joint Correspondence and Co-segmentation

[Taniai, Sinha, Sato 2016]

Mask + Flow

Source Target

Warped Source Image

slide-53
SLIDE 53

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Joint Correspondence and Co-segmentation

[Taniai, Sinha, Sato 2016]

Input

Ground Truth OURS SIFT Flow DSP

Warped results

Target

slide-54
SLIDE 54

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Joint Correspondence and Co-segmentation

[Taniai, Sinha, Sato 2016]

OURS OURS (Single) Faktor + Irani Joulin+

Segmentation Accuracy

Flow Accuracy

OURS OURS (Single) SIFT Flow DSP DFF

slide-55
SLIDE 55

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Joint Correspondence and Co-segmentation

[Taniai, Sinha, Sato 2016]

OURS OURS (Single) Faktor + Irani Joulin+

Segmentation Accuracy

Flow Accuracy

OURS OURS (Single) SIFT Flow DSP DFF

▪ Joint approach improves accuracy on both tasks ▪ Outperforms methods specifically designed for each task

slide-56
SLIDE 56

56

Optical Flow Disparity Map Moving object segmentation Left Right

Input: Stereo Video

Output Fast Multi-frame Stereo Scene Flow with Motion Segmentation Taniai, Sinha, Sato 2017

slide-57
SLIDE 57

57 Visual

  • dometry

Initial motion segmentation Optical flow

𝐽𝑢

0, 𝐽𝑢+1

Frig Rigid flow ෩ S Initial segmentation Epipolar stereo

𝐽𝑢±1

0,1 , 𝐽𝑢 0, 𝐽𝑢 1

Flow fusion Fnon Non-rigid flow

𝐽𝑢

0, 𝐽𝑢+1

+ ෪ D + 𝐐, D + 𝐐

𝐐 Ego-motion D Disparity

+ ෩ S

Binocular stereo

𝐽𝑢

0, 𝐽𝑢 1

෪ D

  • Init. disparity

𝐽𝑢±1

0,1 , 𝐽𝑢 0, 𝐽𝑢 1

𝐽𝑢

0, 𝐽𝑢+1

+Frig, Fnon

Flow

Input

Fast Multi-frame Stereo Scene Flow with Motion Segmentation Taniai, Sinha, Sato 2017

Final Segmentation

slide-58
SLIDE 58

58

KITTI 2015 Scene Flow Benchmark (Nov 2016)

200 road scenes with multiple moving objects

Fast Multi-frame Stereo Scene Flow with Motion Segmentation Taniai, Sinha, Sato 2017

slide-59
SLIDE 59

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Summary

▪ Piecewise planar stereo

▪ Coarse, simplified depth maps ▪ Good for Image-based rendering ▪ Planes (surfaces) as primitives ▪ How about higher-level primitives?

▪ 3D CAD models ▪ 3D shape templates

slide-60
SLIDE 60

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Summary

▪ Extending SGM

▪ Reducing the search space (local plane sweeps) ▪ Soft orientation prior

▪ Merit of joint formulations for complementary tasks

▪ Stereo + segmentation ▪ Semantic correspondence ▪ Scene fow from stereoscopic video

slide-61
SLIDE 61

Edit this text to create a Heading

▪ This subtitle is 20 points ▪ Bullets are blue ▪ They have 110% line spacing, 2 points before & after ▪ Longer bullets in the form of a paragraph are harder to

read if there is insufficient line spacing. This is the maximum recommended number of lines per slide (seven).

▪ Sub bullets look like this

Collaborators

Rick Szeliski Facebook Tatsunori Taniai RIKEN, Tokyo Daniel Scharstein, Middlebury College Michael Goesele TU Darmstadt Drew Steedly Microsoft Adarsh Kowdle PerceptiveIO Michael Bleyer Microsoft Johannes Kopf Facebook Yoichi Sato

  • Univ. of Tokyo

Pushmeet Kohli DeepMind Carsten Rother TU Dresden