RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor - PowerPoint PPT Presentation

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1 , Michael Krainin 1 , Evan Herbst 1 , Hao Du 3 , Marvin Cheng 1 , Xiaofeng Ren 2 , and Dieter Fox 1,2 1 University of Washington Computer Science & Engineering 2 Intel Labs Seattle (now ISTC at UW) 3 Google 1

The Kinect 2

PrimeSense Technology Red Green Blue Depth 3

RGB-D Data 4

The Goal Align the “frames” from a Kinect to create a single 3D map (or model) of the environment Like this… 5

Related Work  SLAM  [Davison et al, PAMI 2007] (monocular)  [Konolige et al, IJR 2010] (stereo)  [Pollefeys et al, IJCV 2007] (multi-view stereo)  [Borrmann et al, RAS 2008] (3D laser)  [May et al, JFR 2009] (ToF sensor)  Loop Closure Detection  [Nister et al, CVPR 2006]  [Paul et al, ICRA 2010]  Photo collections  [Snavely et al, SIGGRAPH 2006]  [Furukawa et al, ICCV 2009] 7

System Overview 1. Frame-to-frame alignment 2. Global Optimization (Loop Closure) 3. Map representation 8

RANSAC (Random Sample Consensus)  Visual features (from image) in 3D (from depth)  Figure out how the camera moved by matching these feature 9

What is RANSAC?  For each feature point, find the most similar descriptor in the other frame  Find largest set of consistent matches  Move the new frame to align these matches 10

Alignment (RANSAC) 11

RANSAC Details  Feature Detector / Descriptor Options  SIFT (SiftGPU)  SURF  FAST Detector / Calonder Descriptor  (All available in OpenCV)  Matching:  L2 descriptor distance  Either SIFT style matching or window matching 12

RANSAC Failure Cases  Low light  Lack of visual “texture” or features  Kinect still provides depth or “shape” information 13

ICP (Iterative Closest Point)  Iterative Closest Point (ICP) uses shape to align frames  Does not require the RGB image  Does need a good initial “guess”  Repeat the following two steps:  For each point in cloud 1, find the closest point in cloud 2  Compute the transformation that best aligns this set of corresponding pairs 14

ICP Failure Cases  Not enough distinctive shape  Don’t have a close enough initial “guess”  Here the shape is basically a simple plane… 16

Joint Optimization (RGBD-ICP) 17

Optimal Transformation 18

Optimal Transformation SCARY MATH!?!? 19

Two-Stage Alternative 20

Loop Closure  Sequential alignments accumulate error  Revisiting a previous location results in an inconsistent map 21

Loop Closure Detection  Detect by running RANSAC against previous frames  Pre-filter options (for efficiency):  Only a subset of frames ( keyframes )  Only keyframes with similar estimated 3D pose  Place recognition using vocabulary tree  Post-filter (avoid false positives)  Estimate maximum expected drift and reject detections changing pose too greatly 23

Loop Closure Correction (TORO)  TORO [Grisetti 2007]:  Constraints between camera locations in pose graph  Maximum likelihood global camera poses 24

Loop Closure Correction (SBA)  Minimize reprojection error of features 25

Comparison (TORO) 26

Comparison (SBA) 27

A Second Comparison TORO SBA 28

Resulting Map 30

Map Representation: Surfels  Surface Elements [Pfister 2000, Weise 2009, Krainin 2010]  Circular surface patches  Accumulate color / orientation / size information  Incremental, independent updates  Incorporate occlusion reasoning  750 million points reduced to 9 million surfels 31

Experiments  Reprojection error is better for RANSAC:  Errors for variations of the algorithm:  Timing for variations of the algorithm: 33

Experiments: Overlay 1 34

Experiments: Overlay 2 35

Application: Measurements 36

Application: Quadrocopter  Collaboration with Albert Huang, Abe Bacharach, and Nicholas Roy from MIT 37

Occupancy Map 42

Application: Interactive Mapping  Allow anyone to construct maps with a Kinect  Uses for these maps  Localization  Measurements  Remodeling  Buy new furniture  Video game levels??? 47

Conclusion  Kinect-style depth cameras have recently become available as consumer products  RGB-D Mapping can generate rich 3D maps using these cameras  RGBD-ICP combines visual and shape information for robust frame-to-frame alignment  Global consistency achieved via loop closure detection and optimization (RANSAC, TORO, SBA)  Surfels provide a compact map representation  ROS + OpenCV are powerful tools to enable these applications 49

Open Questions  Which are the best features to use?  How to find more loop closure constraints between frames?  What is the right representation (point clouds, surfels, meshes, volumetric, geometric primitives, objects)?  How to generate increasingly photorealistic maps?  Autonomous exploration for map completeness?  Can we use these rich 3D maps for semantic mapping? 50

Links  www.cs.washington.edu/robotics/projects/rgbd-3d- mapping/  www.ros.org  The following have nice ROS integration but also work separately:  http://opencv.willowgarage.com/wiki/  http://www.pointclouds.org/  peter@cs.washington.edu 51

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor - PowerPoint PPT Presentation

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1 , Michael Krainin 1 , Evan Herbst 1 , Hao Du 3 , Marvin Cheng 1 , Xiaofeng Ren 2 , and Dieter Fox 1,2 1 University of Washington Computer Science &

RGB-D Mapping Overview CSE 571 Robotics Map RGB-D Mapping `` University of Washington Dieter

Cameras Cameras Good cameras need focusing, cheap ones dont Good cameras need focusing,

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1 ,

RGBD Tutorial 14210240041 Gu Pan Image RGB YUV Lab Depth Image RGB image Depth image Each pixel in

Mapping and Modeling with RGB-D Cameras University of Washington Dieter Fox 1 Outline

RGB ar chitect s RGB ar chitect s RGB ar chitect s Concepts behind Blended Learning

Correcting Image Defects Chaiwoot Boonyasiriwat October 30, 2020 RGB Color Space Most

Texture and other Mappings Texture Mapping Texture Mapping Bump Mapping Bump Mapping

CSE-571 Robotics Mapping Sparse landmarks RGB / Depth Maps Problems in Mapping Occupancy

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

Range gated cameras technology and its applications Friday, October 10, 14 Range gated cameras

Cameras Cameras photographs, you should: photographs, you should: increase the diameter of the

Holistic Scene Understanding for 3D Object Detection with RGB-D cameras Dahua Lin, Sanja Fidler,

for each dst in my.out_edges if dst.depth > my.depth+1 then dst.depth = my.depth+1

Image Warping Image Mapping Image Mapping - Examples Forward Mapping Forward Mapping -

TEXTURE MAPPING 1 OUTLINE Introduce Mapping Methods Texture Mapping Environment

Basic properties of Toeplitz and Hankel operators in non-algebraic setting Karol Lenik Pozna

A Generating Equation for Integrable Charges Till Bargheer, Niklas Beisert, Florian Loebbert

Supersymmetric field theories Antoine Van Proeyen Antoine Van Proeyen KU Leuven Summer school

RADIATIVE and NONLEPTONIC HYPERON DECAYS in BROKEN SU(3) P. enczykowski Institute of Nuclear

Compact structures of interstellar Compact structures of interstellar plasma in the Galaxy

Installation Installation Procedures Procedures for Clusters for Clusters PART 1 Cluster

Explosive Dark Matter Production of Heavy Elements in Compact Stars Joseph Bramante Queens

Reference informa9on Consult reference for complete informa9on!

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor - PowerPoint PPT Presentation

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1 , Michael Krainin 1 , Evan Herbst 1 , Hao Du 3 , Marvin Cheng 1 , Xiaofeng Ren 2 , and Dieter Fox 1,2 1 University of Washington Computer Science &

RGB-D Mapping Overview CSE 571 Robotics Map RGB-D Mapping `` University of Washington Dieter

Cameras Cameras Good cameras need focusing, cheap ones dont Good cameras need focusing,

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments Peter Henry 1 ,

RGBD Tutorial 14210240041 Gu Pan Image RGB YUV Lab Depth Image RGB image Depth image Each pixel in

Mapping and Modeling with RGB-D Cameras University of Washington Dieter Fox 1 Outline

RGB ar chitect s RGB ar chitect s RGB ar chitect s Concepts behind Blended Learning

Correcting Image Defects Chaiwoot Boonyasiriwat October 30, 2020 RGB Color Space Most

Texture and other Mappings Texture Mapping Texture Mapping Bump Mapping Bump Mapping

CSE-571 Robotics Mapping Sparse landmarks RGB / Depth Maps Problems in Mapping Occupancy

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

Range gated cameras technology and its applications Friday, October 10, 14 Range gated cameras

Cameras Cameras photographs, you should: photographs, you should: increase the diameter of the

Holistic Scene Understanding for 3D Object Detection with RGB-D cameras Dahua Lin, Sanja Fidler,

for each dst in my.out_edges if dst.depth &gt; my.depth+1 then dst.depth = my.depth+1

Image Warping Image Mapping Image Mapping - Examples Forward Mapping Forward Mapping -

TEXTURE MAPPING 1 OUTLINE Introduce Mapping Methods Texture Mapping Environment

Basic properties of Toeplitz and Hankel operators in non-algebraic setting Karol Lenik Pozna

A Generating Equation for Integrable Charges Till Bargheer, Niklas Beisert, Florian Loebbert

Supersymmetric field theories Antoine Van Proeyen Antoine Van Proeyen KU Leuven Summer school

RADIATIVE and NONLEPTONIC HYPERON DECAYS in BROKEN SU(3) P. enczykowski Institute of Nuclear

Compact structures of interstellar Compact structures of interstellar plasma in the Galaxy

Installation Installation Procedures Procedures for Clusters for Clusters PART 1 Cluster

Explosive Dark Matter Production of Heavy Elements in Compact Stars Joseph Bramante Queens

Reference informa9on Consult reference for complete informa9on!

for each dst in my.out_edges if dst.depth > my.depth+1 then dst.depth = my.depth+1