PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes Rundi Wu - - PowerPoint PPT Presentation

pq net a generative part seq2seq network for 3d shapes
SMART_READER_LITE
LIVE PREVIEW

PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes Rundi Wu - - PowerPoint PPT Presentation

PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes Rundi Wu Yixin Zhuang Kai Xu Hao Zhang Baoquan Chen 1,4 1 2 3,4 1,4 1 Center on Frontiers of Computing Studies, Peking University National University of Defense Technology


slide-1
SLIDE 1

PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes

Rundi Wu Yixin Zhuang Kai Xu Hao Zhang Baoquan Chen

Center on Frontiers of Computing Studies, Peking University National University of Defense Technology Simon Fraser University AICFVE, Beijing Film Academy

1 2 3 4 1,4 1 2 3,4 1,4 1
slide-2
SLIDE 2

3D shape generation

Voxel grid Point cloud Implicit function Mesh

  • 1. J. Wu, C. Zhang, T. Xue, B. Freeman, and J. Tenenbaum. Learning a probabilistic latent space of object shapes via 3d generative-adversarial modeling. In Advances in Neural In- formation Processing Systems, pages 82–90, 2016.
  • 2. G. Yang, X. Huang, Z. Hao, M.-Y. Liu, S. Belongie, and B. Hariharan. Pointflow: 3d point cloud generation with con- tinuous normalizing flows. 2019 IEEE International Conference on Computer Vision (ICCV).
  • 3. T. Groueix, M. Fisher, V. G. Kim, B. C. Russell, and M. Aubry. A papier-maˆche ́ approach to learning 3d surface generation. In Proc. CVPR, pages 216–224, 2018.
  • 4. J.J.Park,P.Florence,J.Straub,R.Newcombe,andS.Love- grove. DeepSDF: Learning continuous signed distance func- tions for shape representation. In CVPR, 2019.
2

[3DGAN, NIPS 2016] [Pointflow, ICCV 2019] [AtlasNet, CVPR 2018] [DeepSDF, CVPR 2019]

slide-3
SLIDE 3

Structural 3D shape generation

[GRASS, SIG 2017] [G2L, SIGA 2018] [StructureNet, SIGA 2019]

  • 1. J. Li, K. Xu, S. Chaudhuri, E. Yumer, H. Zhang, and L. Guibas. Grass: Generative recursive autoencoders for shape structures. ACM Trans. on Graph. (SIGGRAPH), 2017.
  • 2. C. Zou, E. Yumer, J. Yang, D. Ceylan, and D. Hoiem. 3D- PRNN: Generating shape primitives with recurrent neural networks. 2017 IEEE International Conference on Computer Vision (ICCV), Oct 2017.
  • 3. H. Wang, N. Schor, R. Hu, H. Huang, D. Cohen-Or, and H. Huang. Global-to-local generative model for 3d shapes. ACM Transactions on Graphics (Proc. SIGGRAPH ASIA), 37(6):214:1214:10, 2018.
  • 4. K. Mo, P. Guerrero, L. Yi, H. Su, P. Wonka, N. Mitra, and L. J. Guibas. Structurenet: Hierarchical graph networks for 3d shape generation. ACM Trans. on Graph. (SIGGRAPH Asia), 2019.
1 3 4

[3D-PRNN, ICCV 2017]

2 3
slide-4
SLIDE 4

Shape structure presentations

hierarchical part organization linear part order linear string of words phrases nested in phrases

≈ ≈

“ ”

4
slide-5
SLIDE 5
  • Our network, PQ-NET, learns 3D shape representation via sequential part assembly

Generate as a sequence

Random Noise

Input Sequential Generation

RGB Image Partial Shape Depth Map

Z 5
slide-6
SLIDE 6

Method

Inverse Order E D Stop Sign Part Box Parameter Part Geometry Feature Number of parts (one hot) Apply transformation E CNN encoder D Implicit decoder

a) Part Geometry Encoding b) Sequential Part Assembly and Generation

E E D D E D (x, y, z) GRU GRU GRU GRU GRU GRU GRU GRU GRU hz Initial vector
  • a. Apply IM-NET to encode each scaled part’s geometry
  • b. Model sequential part assembly using a Sequence-to-Sequence Auto-encoder (Seq2Seq AE)
6
slide-7
SLIDE 7 E CNN encoder D Implicit decoder E D (x, y, z) 7

Method - Part geometry encoding

Similar architecture as IM-NET :

  • a CNN encoder maps 64^3 voxelized part to 128D vector
  • a MLP decoder that predicts the occupancy of a given point

e

d

P p

ground truth signed function A set of sampled points from P

  • 1. Z. Chen and H. Zhang. Learning implicit fields for generative shape modeling. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
1
slide-8
SLIDE 8

Method - Seq2Seq AE

Part Box Parameter : 6D, position + size Part Geometry Feature : latent vector encoded by IM-NET Stacked GRU Cell Number of parts in one-hot representation

hr hr

1

hr

2

h0 h1 h2

Part 1 Part 2 Part k Part k Part k-1 Part 1 Inverse Order

Encoder :

  • a bidirectional stacked RNN to encode part sequence
8
slide-9
SLIDE 9

Method - Seq2Seq AE

Structure Geometry

I0 hS hS

1

hS

2

hG hG

1

hG

2

Decoder :

  • a stacked RNN to predict geometry and structure feature separately
9

Part Box Parameter : 6D, position + size Part Geometry Feature : latent vector to be decoded by IM-NET GRU Cell Stop sign: a confidence value between 0~1

I0

Initial input: zero vector

slide-10
SLIDE 10

Method - Seq2Seq AE

10

Training losses

  • MSE loss on the reconstruction of geometry feature and structure feature
  • Binary Cross Entropy loss on the stop sign predicted by decoder
Inverse Order GRU GRU GRU GRU GRU GRU GRU GRU GRU hz Initial vector Stop Sign Part Box Parameter Part Geometry Feature Number of parts (one hot)
slide-11
SLIDE 11

Results : shape auto-encoding

a) Ground Truth b) IM-NET-256 c) Ours-256 11
slide-12
SLIDE 12

Results : shape generation

a) Ours b) IM-NET c) StructureNet 12
slide-13
SLIDE 13

Results : shape generation

13
slide-14
SLIDE 14

Results : latent space interpolation

14
slide-15
SLIDE 15

Results : single view reconstruction

a) Input image d) Ground Truth c) Ours b) IM-NET 15
slide-16
SLIDE 16

Results : comparison to 3D-PRNN

a) Input Depth Map b) 3D-PRNN c) Ours d) GT
  • Shape reconstruction from single depth image
  • Compare on two orders: (A) PartNet default (B) enforced top-down
16
slide-17
SLIDE 17

Results : applications

Input Order Output Order Output Shape
  • Order denosing and part correspondence
  • Re-train the model the correct the input order
17
slide-18
SLIDE 18

Results : applications

  • Partial shape completion
  • Re-train the model to reconstruct from partial shape input
Partial Input Output Sequence Final Shape 18
slide-19
SLIDE 19

Limitation

  • PQ-NET do not produce part relations
  • Comparing to prior works that seek to hierarchical representation
  • The order of parts could affect the performance
  • A consistent part order over the dataset is required
19
slide-20
SLIDE 20

Thanks!

Code and data: https://github.com/ChrisWu1997/PQ-NET

20