Deformation-Aware 3D Model Embedding and Retrieval
Mikaela Uy 1 Jingwei Huang 1 Minhyuk Sung 2 Tolga Birdal 1 Leo Guibas 1
Stanford University 1 Adobe Research 2
Deformation-Aware 3D Model Embedding and Retrieval Mikaela Uy 1 - - PowerPoint PPT Presentation
Stanford University 1 Adobe Research 2 Deformation-Aware 3D Model Embedding and Retrieval Mikaela Uy 1 Jingwei Huang 1 Minhyuk Sung 2 Tolga Birdal 1 Leo Guibas 1 Motivation (a) Real Scan (b) CAD Model (c) Overlay Photo taken from [1] 2 [1]
Mikaela Uy 1 Jingwei Huang 1 Minhyuk Sung 2 Tolga Birdal 1 Leo Guibas 1
Stanford University 1 Adobe Research 2
2 Photo taken from [1]
[1] End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans. Avetisyan et. al., ICCV 2019.
(a) Real Scan (c) Overlay (b) CAD Model
Retrieve Query Model Closest Model Chamfer Distance: 4.45×10!"
3
Query Model Ours Retrieved Ours Deformed Deform Retrieve Chamfer Distance: 7.09×10!" ↑ Chamfer Distance: 1.71×10!" ↓
4
TurboSquid
Input
3D Model Scan Image
/ / 3D Model Database Retrieval Deform
Retrieved Model Deformed Model
3D Warehouse 5
6
Deform 𝓔 Introduce constraints/regularizations preventing the perfect fitting.
Source 𝐭 Target 𝐮 Deformed Source 𝓔 𝐭; 𝐮
7
Deform Deform
8
9
10
PointNet Encoder PointNet Encoder shared
MLPs MLPs MLPs
shared
𝑡 ∈ ℝ#×% 𝑢 ∈ ℝ#×% ℱ(𝒖) ∈ ℝ& ℱ(𝒕) ∈ ℝ& (𝒕) ∈ 𝕋'
&
11
𝐮 = {𝐭 ∈ 𝐘𝐮|𝑓 𝐭, 𝐮 ≤ 𝜏7}
𝐨∈:!
𝐪∈<𝐮 𝜀 𝐪; 𝐮
We precompute the fitting gap (𝑓).
… … … 𝐘𝐮 𝐐𝐮 𝐎𝐮 𝐮
[2] FaceNet: A Unified Embedding for Face Recognition and Clustering. Schroff et. al., CVPR 2015
Target
Candidate Sources Positives Negatives
PNet PNet MLPs MLPs MLPs ℱ(𝒖) ∈ ℝ! ℱ(𝒕) ∈ ℝ! (𝒕) ∈ 𝕋"
!
𝜀 𝐭; 𝐮 = (ℱ 𝐮 − ℱ 𝐭 )!(𝐭)(ℱ 𝐮 − ℱ 𝐭 )
12
B)
, exp(−𝑓 𝐭D; 𝐮 /2𝜏A
B)
E- 𝐭;𝐮 ∑𝐭,∈𝐘𝐮
, E- 𝐭;𝐮
PNet PNet MLPs MLPs MLPs ℱ(𝒖) ∈ ℝ! ℱ(𝒕) ∈ ℝ! (𝒕) ∈ 𝕋"
!
𝜀 𝐭; 𝐮 = (ℱ 𝐮 − ℱ 𝐭 )!(𝐭)(ℱ 𝐮 − ℱ 𝐭 )
D| 9 𝐭∈H!
,
[3] Stochastic Neighbor Embedding. Hinton et. al., NeurIPS 2002.
We precompute the fitting gap (𝑓).
𝐘𝐮 … … 𝐮 Target
Candidate Sources
13 Deform
Chamfer Distance Before Deformation 𝒆(𝐭, 𝐮) Chamfer Distance After Deformation 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) 𝓗 is fixed to identity. Symmetric embedding distance 𝓗 is source-dependent. Asymmetric embedding distance
14
[4] ShapeNet: An Information-Rich 3D Model Repository. Chang et. al., arXiv 2015. [5] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Qi et. al., CVPR 2017. [6] As-rigid-as-possible surface modeling. Sorkine et. al., SGP 2007.
15
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮)
16
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
17
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
18
Margin-loss is used.
(PointNet encoder is used for the embedding space.)
Fitting gap Egocentric distance field
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
19
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 (PointNet encoder is used for the embedding space.)
Reg-loss is used.
Fitting gap Egocentric distance field
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
20
(PointNet encoder is used for the embedding space.)
Margin-loss is used.
Fitting gap Egocentric distance field
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
21
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 (PointNet encoder is used for the embedding space.)
Reg-loss is used.
Fitting gap Egocentric distance field
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
22
(PointNet encoder is used for the embedding space.)
Margin-loss is used.
Fitting gap Egocentric distance field
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
23
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984 (PointNet encoder is used for the embedding space.)
Reg-loss is used.
Fitting gap Egocentric distance field
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
24
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984 (PointNet encoder is used for the embedding space.)
Reg-loss is used.
Fitting gap Egocentric distance field
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
25
Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984 (PointNet encoder is used for the embedding space.)
Reg-loss is used.
Fitting gap Egocentric distance field
(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest
26
Ranking ↓ Recall ↑ Ranked CD 12.32 51.20 AE 12.10 52.15 CD-Margin 14.27 48.06 CD-Reg 39.97 21.02 Symm-Margin 10.61 57.50 Symm-Reg 28.33 38.64 Ours-Margin 9.34 60.94 Ours-Reg 7.06 70.36
AE CD Margin Ours Reg Ours Margin Ranked CD Deformed Retrieved Chair Query Sofa Deformed Retrieved AE CD Margin Ours Reg Ours Margin Ranked CD Table
27
Car
AE CD Margin Ours Reg Ours Margin Ranked CD Deformed Retrieved Chair Query Sofa Deformed Retrieved AE CD Margin Ours Reg Ours Margin Ranked CD Table
28
Car
AE CD Margin Ours Reg Ours Margin Ranked CD Deformed Retrieved Chair Query Sofa Deformed Retrieved AE CD Margin Ours Reg Ours Margin Ranked CD Table
29
Car
AE Ours Reg Ours Margin Human Deformed Retrieved Input Scan Deformed Retrieved Input Scan
30
31
Input Image P2M++ Pred Ours Retrieved Ours Deformed GT Shape
[7] Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation. Wen et. al., ICCV 2019.
32