Sparse 3D Convolutional Neural Networks for Large-Scale Shape Retrieval
Alexandr Notchenko, Ermek Kapushev, Evgeny Burnaev {avnotchenko,kapushev,burnaevevgeny}@gmail.com & 3D Deep Learning Workshop at NIPS 2016
Sparse 3D Convolutional Neural Networks for Large-Scale Shape - - PowerPoint PPT Presentation
Sparse 3D Convolutional Neural Networks for Large-Scale Shape Retrieval Alexandr Notchenko , Ermek Kapushev, Evgeny Burnaev {avnotchenko,kapushev,burnaevevgeny}@gmail.com & 3D Deep Learning Workshop at NIPS 2016 3D Shape representations
Alexandr Notchenko, Ermek Kapushev, Evgeny Burnaev {avnotchenko,kapushev,burnaevevgeny}@gmail.com & 3D Deep Learning Workshop at NIPS 2016
Regular size, good to go in CNN Irregular size, not clear how to use in NN
Regular size, good to go in CNN Irregular size, not clear how to use in NN Not really 3D, 2D CNNs are powerful enough already
Mean sparsity for all classes of ModelNet40 train dataset at voxel resolution 40 equal to 5.5%.
http://www2.warwick.ac.uk/fac/sci/statistics/staff/academic-research/graham/bmvc.pdf
formerly: Associate Professor at Warwick University now at Facebook AI Research, Paris Lab
Precomputed feature vector of dataset. (Vcar , Vperson ,...) Vplane - feature vector
Sparse3DCNN
Query Retrieved items Cosine distance
Constant Learning Rate = 0.002 Can finish learning when all samples
Optimisation algorithm: Nesterov Accelerated Gradient with momentum = 0.99 Can finish learning when all samples
method Classification Retrieval AUC Retrieval mAP 3DShapeNet 77.32% 49.94% 49.23% MVCNN 90.10%
3DSCNN 90.3% 47.30% 45.16% S3DCNN + triplet
46.71%
Algorithm ModelNet40 Classification ModelNet40 Retrieval (mAP) Geometry Image [13] 83.9% 51.3% Set-convolution [11] 90% 3D-GAN [10] 83.3% VRN Ensemble [9] 95.54% FusionNet [7] 90.8% Pairwise [6] 90.7% MVCNN [3] 90.1% 79.5% GIFT [5] 83.10% 81.94% VoxNet [2] 83% DeepPano [4] 77.63% 76.81% 3DShapeNets [1] 77% 49.2%
[1] Z. Wu, S. Song, A. Khosla, F. Yu, L. Zhang, X. Tang and J. Xiao. 3D ShapeNets: A Deep Representation for Volumetric Shapes. CVPR2015. [2] D. Maturana and S. Scherer. VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition. IROS2015. [3] H. Su, S. Maji, E. Kalogerakis, E. Learned-Miller. Multi-view Convolutional Neural Networks for 3D Shape Recognition. ICCV2015. [4] B Shi, S Bai, Z Zhou, X Bai. DeepPano: Deep Panoramic Representation for 3-D Shape
[5] Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, Longin Jan Latecki. GIFT: A Real-time and Scalable 3D Shape Search Engine. CVPR 2016. [6] Edward Johns, Stefan Leutenegger and Andrew J. Davison. Pairwise Decomposition of Image Sequences for Active Multi-View Recognition CVPR 2016. [7] Vishakh Hegde, Reza Zadeh 3D Object Classification Using Multiple Data Representations. [8] Nima Sedaghat, Mohammadreza Zolfaghari, Thomas Brox Orientation-boosted Voxel Nets for 3D Object Recognition. [9] Andrew Brock, Theodore Lim, J.M. Ritchie, Nick Weston Generative and Discriminative Voxel Modeling with Convolutional Neural Networks. [10] Jiajun Wu, Chengkai Zhang, Tianfan Xue, William T. Freeman, Joshua B. Tenenbaum. Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial
[11] Siamak Ravanbakhsh, Jeff Schneider, Barnabas Poczos. Deep Learning with sets and point clouds [12] A. Garcia-Garcia, F. Gomez-Donoso†, J. Garcia-Rodriguez, S. Orts-Escolano, M. Cazorla, J. Azorin-Lopez PointNet: A 3D Convolutional Neural Network for Real-Time Object Class Recognition [13] Ayan Sinha, Jing Bai, Karthik Ramani Deep Learning 3D Shape Surfaces Using Geometry Images ECCV 2016
Alexandr Notchenko, Ermek Kapushev, Evgeny Burnaev