Deformation-Aware 3D Model Embedding and Retrieval Mikaela Uy 1 - - PowerPoint PPT Presentation

deformation aware 3d model embedding and retrieval
SMART_READER_LITE
LIVE PREVIEW

Deformation-Aware 3D Model Embedding and Retrieval Mikaela Uy 1 - - PowerPoint PPT Presentation

Stanford University 1 Adobe Research 2 Deformation-Aware 3D Model Embedding and Retrieval Mikaela Uy 1 Jingwei Huang 1 Minhyuk Sung 2 Tolga Birdal 1 Leo Guibas 1 Motivation (a) Real Scan (b) CAD Model (c) Overlay Photo taken from [1] 2 [1]


slide-1
SLIDE 1

Deformation-Aware 3D Model Embedding and Retrieval

Mikaela Uy 1 Jingwei Huang 1 Minhyuk Sung 2 Tolga Birdal 1 Leo Guibas 1

Stanford University 1 Adobe Research 2

slide-2
SLIDE 2

Motivation

2 Photo taken from [1]

[1] End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans. Avetisyan et. al., ICCV 2019.

(a) Real Scan (c) Overlay (b) CAD Model

slide-3
SLIDE 3

Goal

Retrieve Query Model Closest Model Chamfer Distance: 4.45×10!"

3

slide-4
SLIDE 4

Goal

Query Model Ours Retrieved Ours Deformed Deform Retrieve Chamfer Distance: 7.09×10!" ↑ Chamfer Distance: 1.71×10!" ↓

4

slide-5
SLIDE 5

Problem

TurboSquid

Input

3D Model Scan Image

/ / 3D Model Database Retrieval Deform

Retrieved Model Deformed Model

3D Warehouse 5

slide-6
SLIDE 6

Fitting Gap

  • Deformations introduce

constraints/regularizations that ensure plausible variations without losing the original CAD model features.

  • Fitting gap 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮):

Fitting distance (𝑒) after deforming a database shape (𝐭) to the query (𝐮) using deformation function 𝒠.

6

=

Deform 𝓔 Introduce constraints/regularizations preventing the perfect fitting.

Source 𝐭 Target 𝐮 Deformed Source 𝓔 𝐭; 𝐮

slide-7
SLIDE 7
  • Fitting gap measures the distance in

the real space.

  • Properties of fitting gap:
  • 1. (Non-negativity) 𝑓𝒠 𝐭, 𝐮 ≥ 0
  • 2. (Identity) 𝑓𝒠 𝐮, 𝐮 = 0
  • 3. (Asymmetry) 𝑓𝒠 𝐭, 𝐮 ≠ 𝑓𝒠 𝐮, 𝐭

Not a metric!

7

Properties of Fitting Gap

= =

Deform Deform

slide-8
SLIDE 8

8

Egocentric Distance Field 𝒣(𝐭)

Query Egocentric Distance Field

  • 𝒣 is source-dependent.
  • 𝒣 is represented with a

positive semi-definite matrix. 𝒣 𝐭 ∈ 𝒯# (𝒣 𝐭 ≽ 0)

slide-9
SLIDE 9

Properties

  • 1. (Non-negativity) 𝜀 𝐭, 𝐮 ≥ 0
  • 2. (Identity) 𝜀 𝐮, 𝐮 = 0
  • 3. (Asymmetry)

𝜀 𝐭, 𝐮 = (ℱ 𝐮 − ℱ 𝐭 ))𝒣(𝐭)(ℱ 𝐮 − ℱ 𝐭 ) 𝜀 𝐮, 𝐭 = (ℱ 𝐭 − ℱ 𝐮 ))𝒣(𝐮)(ℱ 𝐭 − ℱ 𝐮 ) 𝜀 𝐭, 𝐮 ≠ 𝜀 𝐮, 𝐭

9

Distance in Embedding Space 𝜀

𝜀 𝐭; 𝐮 = (ℱ 𝐮 − ℱ 𝐭 ))𝒣(𝐭)(ℱ 𝐮 − ℱ 𝐭 )

slide-10
SLIDE 10

10

Deformation-Aware Embedding

Query Egocentric Distance Field

PointNet Encoder PointNet Encoder shared

MLPs MLPs MLPs

shared

𝑡 ∈ ℝ#×% 𝑢 ∈ ℝ#×% ℱ(𝒖) ∈ ℝ& ℱ(𝒕) ∈ ℝ& 𝒣(𝒕) ∈ 𝕋'

&

𝑓𝒠 𝐭, 𝐮 ~ 𝜀 𝐭; 𝐮 = (ℱ 𝐮 − ℱ 𝐭 ))𝒣(𝐭)(ℱ 𝐮 − ℱ 𝐭 )

slide-11
SLIDE 11

Network Training

  • Margin-loss-based approach

11

P

𝐮 = {𝐭 ∈ 𝐘𝐮|𝑓𝒠 𝐭, 𝐮 ≤ 𝜏7}

N𝐮 = {𝐭 ∈ 𝐘𝐮|𝑓𝒠 𝐭, 𝐮 > 𝜏8} 9

𝐨∈:!

[max

𝐪∈<𝐮 𝜀 𝐪; 𝐮

− 𝜀 𝐨; 𝐮 ) + 𝑛 =

We precompute the fitting gap (𝑓𝒠).

… … … 𝐘𝐮 𝐐𝐮 𝐎𝐮 𝐮

[2] FaceNet: A Unified Embedding for Face Recognition and Clustering. Schroff et. al., CVPR 2015

Target

Candidate Sources Positives Negatives

PNet PNet MLPs MLPs MLPs ℱ(𝒖) ∈ ℝ! ℱ(𝒕) ∈ ℝ! 𝒣(𝒕) ∈ 𝕋"

!

𝜀 𝐭; 𝐮 = (ℱ 𝐮 − ℱ 𝐭 )!𝒣(𝐭)(ℱ 𝐮 − ℱ 𝐭 )

slide-12
SLIDE 12

Network Training

  • Regression-based approach

12

𝑞 𝐭; 𝐮 = exp(−𝑓𝒠 𝐭; 𝐮 /2𝜏A

B)

∑𝐭,∈𝐘𝐮

, exp(−𝑓𝒠 𝐭D; 𝐮 /2𝜏A

B)

̂ 𝑞 𝐭; 𝐮 =

E- 𝐭;𝐮 ∑𝐭,∈𝐘𝐮

, E- 𝐭;𝐮

PNet PNet MLPs MLPs MLPs ℱ(𝒖) ∈ ℝ! ℱ(𝒕) ∈ ℝ! 𝒣(𝒕) ∈ 𝕋"

!

𝜀 𝐭; 𝐮 = (ℱ 𝐮 − ℱ 𝐭 )!𝒣(𝐭)(ℱ 𝐮 − ℱ 𝐭 )

1 |X𝐮

D| 9 𝐭∈H!

,

| ̂ 𝑞 𝐭; 𝐮 − 𝑞 𝐭; 𝐮 |

[3] Stochastic Neighbor Embedding. Hinton et. al., NeurIPS 2002.

We precompute the fitting gap (𝑓𝒠).

𝐘𝐮 … … 𝐮 Target

Candidate Sources

slide-13
SLIDE 13

Summary

  • 1. Fitting gap
  • 2. Egocentric distance

field 𝒣(𝐭)

  • 3. Training approaches:
  • Margin-loss-based
  • Regression-based

13 Deform

=

𝑒( , ) 𝑒( , )

Chamfer Distance Before Deformation 𝒆(𝐭, 𝐮) Chamfer Distance After Deformation 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) 𝓗 is fixed to identity. Symmetric embedding distance 𝓗 is source-dependent. Asymmetric embedding distance

OURS

slide-14
SLIDE 14

Implementation Details

  • Training data: ShapeNet (5 categories)
  • Backbone architecture: PointNet (sampling points over the meshes)
  • Deformation function 𝒠 : Simplified as-rigid-as-possible (ARAP)

14

[4] ShapeNet: An Information-Rich 3D Model Repository. Chang et. al., arXiv 2015. [5] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Qi et. al., CVPR 2017. [6] As-rigid-as-possible surface modeling. Sorkine et. al., SGP 2007.

slide-15
SLIDE 15

Quantitative Results

15

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮)

slide-16
SLIDE 16

Quantitative Results

16

Ranked by Chamfer Distance (Ranked CD)

  • Select the shape

with smallest B.D.

  • No embedding

space.

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-17
SLIDE 17

Quantitative Results

17

Autoencoder (AE)

  • PointNet

autoencoder for reconstruction.

  • Use the bottleneck

layer as the embedding space.

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-18
SLIDE 18

Quantitative Results

18

CD-Margin

Margin-loss is used.

(PointNet encoder is used for the embedding space.)

𝑒( , )

Fitting gap Egocentric distance field

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-19
SLIDE 19

Quantitative Results

19

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 (PointNet encoder is used for the embedding space.)

CD-Reg

Reg-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-20
SLIDE 20

Quantitative Results

20

(PointNet encoder is used for the embedding space.)

Symm-Margin

Margin-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-21
SLIDE 21

Quantitative Results

21

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 (PointNet encoder is used for the embedding space.)

Symm-Reg

Reg-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-22
SLIDE 22

Quantitative Results

22

(PointNet encoder is used for the embedding space.)

Ours-Margin

Margin-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-23
SLIDE 23

Quantitative Results

23

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984 (PointNet encoder is used for the embedding space.)

Ours-Reg

Reg-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-24
SLIDE 24

Quantitative Results

24

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984 (PointNet encoder is used for the embedding space.)

Ours-Reg

Reg-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-25
SLIDE 25

Quantitative Results

25

Before Deformation (B.D.) 𝒆(𝐭, 𝐮) After Deformation (A.D.) 𝑓𝒠 𝐭, 𝐮 = 𝒆(𝓔 𝐭; 𝐮 , 𝐮) Ranked CD 3.025 1.104 AE 3.188 1.116 CD-Margin 3.321 1.168 CD-Reg 5.057 2.108 Symm-Margin 3.537 1.092 Symm-Reg 4.649 1.657 Ours-Margin 3.587 1.076 Ours-Reg 3.650 0.984 (PointNet encoder is used for the embedding space.)

Ours-Reg

Reg-loss is used.

𝑒( , )

Fitting gap Egocentric distance field

(Mean Chamfer Distance ×10"# for the best of the top 3 retrieval) bold = smallest, underline = second smallest

slide-26
SLIDE 26

Quantitative Results

26

Ranking ↓ Recall ↑ Ranked CD 12.32 51.20 AE 12.10 52.15 CD-Margin 14.27 48.06 CD-Reg 39.97 21.02 Symm-Margin 10.61 57.50 Symm-Reg 28.33 38.64 Ours-Margin 9.34 60.94 Ours-Reg 7.06 70.36

slide-27
SLIDE 27

Qualitative Results

AE CD Margin Ours Reg Ours Margin Ranked CD Deformed Retrieved Chair Query Sofa Deformed Retrieved AE CD Margin Ours Reg Ours Margin Ranked CD Table

27

Car

slide-28
SLIDE 28

Qualitative Results

AE CD Margin Ours Reg Ours Margin Ranked CD Deformed Retrieved Chair Query Sofa Deformed Retrieved AE CD Margin Ours Reg Ours Margin Ranked CD Table

28

Car

slide-29
SLIDE 29

Qualitative Results

AE CD Margin Ours Reg Ours Margin Ranked CD Deformed Retrieved Chair Query Sofa Deformed Retrieved AE CD Margin Ours Reg Ours Margin Ranked CD Table

29

Car

slide-30
SLIDE 30

Scan2CAD

AE Ours Reg Ours Margin Human Deformed Retrieved Input Scan Deformed Retrieved Input Scan

30

slide-31
SLIDE 31

Image2CAD

31

Input Image P2M++ Pred Ours Retrieved Ours Deformed GT Shape

[7] Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation. Wen et. al., ICCV 2019.

slide-32
SLIDE 32

https://deformscan2cad.github.io/

Visit our project page!

Deformation-Aware 3D Model Embedding and Retrieval

32