Evgeny Burnaev
Skoltech, ADASE group
Advanced Approaches to Object Recognition and 3D Model Construction from Heterogeneous Data
Joint with Alexander Notchenko
Advanced Approaches to Object Recognition and 3D Model Construction - - PowerPoint PPT Presentation
Advanced Approaches to Object Recognition and 3D Model Construction from Heterogeneous Data Evgeny Burnaev Skoltech, ADASE group Joint with Alexander Notchenko Supervised Deep Learning data Type Supervision 2D Image classification, Class
Joint with Alexander Notchenko
2
Supervision
Class label, object detection box, segmentation contours
3
4
5
6
7
8
9
In Intel el R Real ealSen ense S e Ser eries es As Asus Xt Xtion Pr Pro Mi Micr crosoft K t Kinect v ect v2 St Structure Se Sensor
10
1 1
Me Meth thod Pros (+) +) Co Cons (-) Ma Many 2D pr projections su sust stain su surface texture, The There is a lot
L method hods Re Redundant representation, vu vulnerable to optic illusi sions Vo Voxels si simple, can be sp sparse se, has s vo volumetric properties lo losin ing surface propertie ies Po Point Clo Cloud Ca Can be sparse lo losin ing surface propertie ies an and volumet etric c proper erties es 2. 2.5D 5D im images Ch Cheap measurement devic ices, se sense ses s depth se self occlusi sion of bodies s in a sc scene, a lot of Noise se in me measureme ments
12
1 3
14
Alexandr Notchenko, Ermek Kapushev, Evgeny Burnaev
15
% of their time searching for the right information
Massive and complex CAD models are usually di disorde derly archived in enterprises, which makes design reuse a difficult task
16
17
Precomputed feature vector
(Vcar , Vperson ,...) Vplane - feature vector of plane Query Retrieved Items Cosine distance
Sp Sparse 3D 3D CNN CNN
18
19
20
Autonomous Vehicles AR Robotics
21
22
approximate DR 3D to 2D approximate DR 3D to 2.5D
Domain projection
Learnable Projection (3D Scanner output)
Conceptual / Physical
Heterogeneous Data
Object and relationship params
Physical dynamics Noise
Physical Representation: Shape, Albedo, Location Conditional generators of
Furniture generator Humans generator
...
“Semantic Structure with parameters”
...
24
25
26
PointNet PointNet++ Multi-view Stereo Machine Efficient Point Cloud Generation
2 7
Qi, Charles R., et al. "Pointnet: Deep learning on point sets for 3d classification and segmentation." arXiv preprint arXiv:1612.00593 (2016).
Feature representation
Fully-Connected Deep Neural Networks Column-wise Maximum operation
28
3D Reconstruction loss Point Cloud Parser (RNN + PointNet) reconstruction (3D CNNs, Graph NN - approximate renderer) Classification loss (CrossEntropy) Feature representation Prior on objects and their relative positions
29
3D Reconstruction loss Point Cloud Parser (RNN + PointNet) reconstruction (3D CNNs, Graph NN - approximate renderer) Classification loss (CrossEntropy) Feature representation Prior on objects and their relative positions
Sub-task #2 Sub-task #3 Sub-task #1
30
Φ - is some bijection from one point cloud to another
Earth Mover's Distance Chamfer distance
31
Requirement - generator have to generate a closed mesh, e.g. (|V| + |F| - |E| = 2)
3 2
Joint probability for sets of objects:
3 3
34
Size: 766 Gigabyte! http://buildingparser.stanford.edu/dataset.html
3 5
Size: 1.3 Terabyte!
3 6
3 7