Deep Hough Voting for 3D Object Detection in Point Clouds Charles - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas; The IEEE International Conference on Computer Vision (ICCV), 2019 Jan Bayer, Computational Robotics Laboratory, Czech Technical University in Prague

Introduction Goal: detect object classes, and bounding boxes from ● 3D point clouds Input: Uncolored 3D point clouds ● Robust to illumination changes – Contribution ● A reformulation of Hough voting in the context of – deep learning through an end-to-end difgerentiable architecture State-of-the-art 3D object detection performance – on SUN RGB-D and ScanNet An in-depth analysis of the importance of voting Deep Hough Voting for 3D Object Detection in Point Clouds, – Qi et al. ICCV 2019 for 3D object detection in point clouds

3D object detection methods Extended 2D-based detectors to 3D ● 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans, Hou et al. CVPR 2019. – Deep sliding shapes for amodal 3d object detection in rgb-d images, Song et al. CVPR 2016 – 3D CNN detectors → high cost of 3D convolutions – Projection to 2D bird’s eye view images ● Multi-view 3d object detection network for autonomous driving, Chen et al. CVPR 2017 – Designed for outdoor LIDAR data – 2D-based detectors, projection to point cloud ● Frustum pointnets for 3d object detection from rgb-d data, Qi et al. CVPR 2018 – 2d-driven 3d object detection in rgb-d images, Lahoud et al. CVPR 2017 – Strictly dependent on the 2D detector – 2D object detection quickly reduces the search space –

Extended 2D-based detectors to 3D 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans, Hou et al. CVPR 2019. ● Fuse both 2D RGB input features with 3D scan geometry features –

Projection to 2D bird’s eye view images MV3D: Multi-view 3d object detection network for autonomous driving, Chen et al. CVPR 2017 Data from front RGB camera, and LIDAR → 3 views are used to generate 2D features ● Fused features are used to jointly predict ● object class and do oriented 3D box regression

2D-based detectors, projection to point cloud F-PointNet Frustum pointnets for 3d object detection from rgb-d data, Qi et al. CVPR 2018 ● 2D CNN object detector to propose 2D regions and classify their content ● Similar architecture to older approach 2D-driven ●

VoteNet Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

Pointnet Pointnet ● Pointnet: Deep learning on point sets for 3d classifjcation and segmentation, Qi et al. CoRR 2016 – For input point cloud of the size N – Generates N local features (one for each input point) ● Generates single global feature ● Processing the combination of local and global features → classifjcation, and 3d scene segmentation ●

Pointnet++ Improves Pointnet recognition of fjne-grained patterns, and complex scenes segmentation – For N input points generate M feature points for classifjcation, segmentation requires upsampling to – provide information for all the input points Pointnet++: Deep hierarchical feature learning on point sets in a metric space, Qi et al. CoRR 2017

VoteNet – point cloud feature learning backbone 4 Set abstraction layers ● Sub sampling: 2048, 1024, 512, 256 – 2 Upsampling layers ● Upsampling to 1024 points with C =256 – Interpolate the features on input points – to output points (weighted average of 3 nearest input point features) Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

Hough voting Sampling the image generates patches ● Network is used to regress features for k-NN search ● Codebook contains pre-computed associations between features and 6D object poses ● Deep learning of local rgb-d patches for 3d object detection and 6d pose estimation, Kehl et al. ECCV 2016 ●

VoteNet - voting Deep NN generates votes directly from the input – features More effjcient than kNN lookups ● MLP net with fully connected layers ● For each seed is generated one vote ● independently on others – Vote is 3d ofgset of the object center, relative to the feature position Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

VoteNet – object proposal and classification Sampling and grouping ● Votes are divided into K clusters by spatial – clustering Classifjcation, object location, and boundaries ● PointNet-like network aggregates the votes in – order to generate object proposals Output – set of object – proposals: Objectness score ● Bounding box ● parameters – Center – Heading – Scale Semantic classifjcation ● Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019 score

Indoor evaluation datasets: description SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite, Song et al. CVPR 2015 ● Sensors: Asus Xtion, Intel Ralsense, Microsoft Kinect – Available at: http://rgbd.cs.princeton.edu/ – 10,335 RGB-D images (including NYU, B3DO, SUN3D) – Annotated 64,595 3D bounding boxes – 800 object categories, 47 scene categories – Built method: SIFT+RANSAC + point-plane ICP – ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes, Dai et al. CVPR 2017 ● Available at: http://www.scan-net.org/ScanNet/ – RGB-D data from real-world environments – 2.5 million views, 15013 scans, 707 spaces – Annotated with 3D camera poses – Surface reconstructions, CAD models – Instance-level semantic segmentations. –

Object detection results on SUN RGB-D set Evaluation metric: mean Average Precision (mAP) ● Intersection over Union (IoU) for thresholding correctly matched objects – 5000 RGB-D training images with amodal oriented 3D bounding boxes for 37 object categories. ● Evaluated with 3D IoU threshold 0.25 ● VoteNet model is 4x smaller than F-PointNet model ● Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

Object detection results - SUN RGB-D dataset Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

Object detection results on ScanNetV2 set 1200 training examples, hundreds of rooms, 18 object categories ● VoteNet used non colored point clouds, while others not ● Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

Object detection results on ScanNetV2 set Deep Hough Voting for 3D Object Detection in Point Clouds,Qi et al. ICCV 2019

Deep Hough Voting for 3D Object Detection in Point Clouds Charles - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas; The IEEE International Conference on Computer Vision (ICCV), 2019 Jan Bayer, Computational Robotics Laboratory, Czech Technical

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES

Generalized Hough Transform 16-385 Computer Vision Hough Circles Finding Circles by Hough

Clouds A B Clouds A Eastern 2/3 of the U.S. Clouds Clouds on Mars are made of _____ . A.

When you look up into the sky, you will often see clouds. No two clouds are the same, and there

2 Microstructures of Warm Clouds Clouds that lie completely below the 0 C isotherm, referred to

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

The Multi-IFO Hough Search using LIGO S4 Data Badri Krishnan (For the LIGO Scientific

CS 4495 Computer Vision Finding 2D Shapes and the Hough Transform Aaron Bobick School of

Michigan Votes in 2020 Voter registration, absentee voting, and Election Day New Voting Laws New

Electronic Voting Electronic voting at a precinct Analysis of an Internet Voting Focus

Perception with Point Clouds Robert Platt Northeastern University Topics depth sensors

Hough Transform 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University motivation

6 Artificial Modification of Clouds The microstructures of clouds are influenced by the concen-

4. Droplet Growth in Warm Clouds In warm clouds, droplets can grow by condensation in a

Session 3: Hydrology & Clouds 3:00- 5:30 PM Session 3: Hydrology & Clouds 3:00- 5:30 PM

Computer Graphics CS 543 Lecture 6 (Part 2) Projection (Part I) Prof Emmanuel Agu Computer

4/13/2020 Revelation 21:1 22:5 21 Then I saw a new heaven and a new earth; for the first

Why RedeeMing CineMA? Discernment: Discernment: Discernment: Rejecting trash Discernment:

Quantitative Cyber-Security Colorado State University Yashwant K Malaiya CS559 L24 CSU

lecture 5 - projective transformation - normalized view volume - GL_PROJECTION matrix - clip

Enhancing Voxel Carving by Capture Volume Calculations International Conference on Image

27th Cartographer Open House Thursday, October 25th, 2018 Windows CI Lou Amadio (Microsoft)

maths and technology just like an engineer does. Just like engineers have to work with budgets and

Deep Hough Voting for 3D Object Detection in Point Clouds Charles - PowerPoint PPT Presentation

Deep Hough Voting for 3D Object Detection in Point Clouds Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas; The IEEE International Conference on Computer Vision (ICCV), 2019 Jan Bayer, Computational Robotics Laboratory, Czech Technical

Deep Hough Voting for 3D Object Detection in Point Clouds Charles Qi ( ) GAMES

Generalized Hough Transform 16-385 Computer Vision Hough Circles Finding Circles by Hough

Clouds A B Clouds A Eastern 2/3 of the U.S. Clouds Clouds on Mars are made of _____ . A.

When you look up into the sky, you will often see clouds. No two clouds are the same, and there

2 Microstructures of Warm Clouds Clouds that lie completely below the 0 C isotherm, referred to

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

The Multi-IFO Hough Search using LIGO S4 Data Badri Krishnan (For the LIGO Scientific

CS 4495 Computer Vision Finding 2D Shapes and the Hough Transform Aaron Bobick School of

Michigan Votes in 2020 Voter registration, absentee voting, and Election Day New Voting Laws New

Electronic Voting Electronic voting at a precinct Analysis of an Internet Voting Focus

Perception with Point Clouds Robert Platt Northeastern University Topics depth sensors

Hough Transform 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University motivation

6 Artificial Modification of Clouds The microstructures of clouds are influenced by the concen-

4. Droplet Growth in Warm Clouds In warm clouds, droplets can grow by condensation in a

Session 3: Hydrology &amp; Clouds 3:00- 5:30 PM Session 3: Hydrology &amp; Clouds 3:00- 5:30 PM

Computer Graphics CS 543 Lecture 6 (Part 2) Projection (Part I) Prof Emmanuel Agu Computer

4/13/2020 Revelation 21:1 22:5 21 Then I saw a new heaven and a new earth; for the first

Why RedeeMing CineMA? Discernment: Discernment: Discernment: Rejecting trash Discernment:

Quantitative Cyber-Security Colorado State University Yashwant K Malaiya CS559 L24 CSU

lecture 5 - projective transformation - normalized view volume - GL_PROJECTION matrix - clip

Enhancing Voxel Carving by Capture Volume Calculations International Conference on Image

27th Cartographer Open House Thursday, October 25th, 2018 Windows CI Lou Amadio (Microsoft)

maths and technology just like an engineer does. Just like engineers have to work with budgets and

Session 3: Hydrology & Clouds 3:00- 5:30 PM Session 3: Hydrology & Clouds 3:00- 5:30 PM