volumetric and multi view cnns for object classification
play

Volumetric and Multi-View CNNs for Object Classification on 3D Data - PowerPoint PPT Presentation

Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi*, Hao Su*, Matthias Niener, Angela Dai, MengyuanYan, Leonidas J.Guibas Rich Applications of 3D Augmented Robot Reality Perception 3D Representations for


  1. Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi*, Hao Su*, Matthias Nießner, Angela Dai, MengyuanYan, Leonidas J.Guibas

  2. Rich Applications of 3D Augmented Robot Reality Perception

  3. 3D Representations for Generic Object Classification Volumetric Multi-Views 3DShapeNets by Z. Wu et MVCNN by H. Su et al. al. CVPR 15 ICCV 15 VoxNet by D. Maturana et DeepPano by B. Shi et al. al. IEEE/RSJ 15 IEEE/SPL 15

  4. Volumetric CNNs Revisited Volumetric CNNs 3DShapeNets by Z. Wu et al. CVPR 15

  5. Multi-View CNNs Revisited Multi-View CNNs MVCNN by H. Su et al. ICCV 15

  6. Shape Classification Results Revisited 95 90.1% 90 85 77.3% 80 75 70 3DShapeNets MVCNN Wu et al. Su et al.

  7. Shape Classification Results Revisited 95 90.1% 90 85 77.3% 80 Big gap between 75 volumetric and multi-view 70 based methods Why? 3DShapeNets MVCNN Wu et al. Su et al.

  8. Cause 1: Architecture and Engineering LeNet, 1998 AlexNet, 2012

  9. Cause 1: Architecture and Engineering LeNet, 1998 3DShapeNets, 2015 AlexNet, 2012

  10. Cause 2: Resolution Multi-View CNNs MVCNN Su et al. 224x224 Images

  11. Cause 2: Resolution Multi-View CNNs Volumetric CNNs MVCNN Su et al. 3DShapeNets Wu et al. 30x30x30 Volumes 224x224 Images

  12. Diagnosis of Causes: Variable Control • Same resolution, study architectures • Same architecture, look into resolutions

  13. Sphere Rendering Occupancy Grid Image Polygon Mesh 30x30x30 224x224

  14. Sphere Rendering Same “3D Resolution” Occupancy Grid Image Polygon Mesh 30x30x30 224x224

  15. Investigation into Architecture Multi-View Different 3D CNN Image CNN Architecture Same 3D Resolution (30x30x30) Sphere Rendering Occupancy Grid Images Volumes

  16. CNNs with Same 3D Resolution Inputs 88 Shape Classification Accuracy 86 84 82 80 78 76 74 72 MVCNN with Sphere 3DShapeNets Rendering Images Wu et al.

  17. Novel 3D CNN Architectures  3D NIN with Subvolume Supervision Push Harder for Learning Better!

  18. Novel 3D CNN Architectures  Anisotropic Probing Network

  19. Results of Our Novel 3D CNNs 88 Shape Classification Accuracy 86 84 82 80 78 76 74 72 MVCNN with 3DShapeNets Ours 3D CNN Sphere Rendering Wu et al. Images

  20. Results of Our Novel 3D CNNs Closed the Gap under same 3D Resolution 88 Shape Classification Accuracy 86 84 82 80 78 76 74 72 MVCNN with 3DShapeNets Ours 3D CNN Sphere Rendering Wu et al. Images

  21. Investigation into Resolution Multi-View Multi-View Same 3D CNN Image CNN Image CNN Architecture Different 3D Resolution Standard Rendering Sphere Rendering 30x30x30 Images Images Volume

  22. Performance Trend wrt 3D Resolution 94 92 Accuracy (%) 90 88 86 MVCNN-Sphere 84 82 0 50 100 150 200 250 3D Resolution

  23. Performance Trend wrt 3D Resolution 94 92 Accuracy (%) 90 88 86 MVCNN-Sphere 84 Our 3D CNN 82 0 50 100 150 200 250 3D Resolution

  24. Generalization to Real Scans Shape retrieval on scan data Real Scan Dataset 243 objects 12 categories

  25. Volumetric and Multi-View CNNs for Object Classification on 3D Data Code and Data Available Online! http://graphics.stanford.edu/projects/3dcnn/ Welcome to Our Poster #38!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend