An Introduction to Holistic 3D Reconstruction Yi Ma EECS - - PowerPoint PPT Presentation

an introduction to holistic 3d reconstruction
SMART_READER_LITE
LIVE PREVIEW

An Introduction to Holistic 3D Reconstruction Yi Ma EECS - - PowerPoint PPT Presentation

An Introduction to Holistic 3D Reconstruction Yi Ma EECS Department, UC Berkeley 1 What is 3D Reconstruction? What is its shape? 2 Image Source: Internet Applications of 3D Reconstruction 3 Image Source: Internet Traditional 3D


slide-1
SLIDE 1

An Introduction to Holistic 3D Reconstruction

1

Yi Ma EECS Department, UC Berkeley

slide-2
SLIDE 2

What is 3D Reconstruction?

What is its shape?

Image Source: Internet

2

slide-3
SLIDE 3

Applications of 3D Reconstruction

Image Source: Internet

3

slide-4
SLIDE 4

Traditional 3D Reconstruction Pipeline

Feature Extraction & Matching Multiview Geometry Point Cloud

Image Source: Internet

4

slide-5
SLIDE 5

Are Point Clouds Universal Representation?

A Gear Wheel Scanned by eviXscan 3D Pro Streets Scanned by Velodyne Lidar

Image source: “Diagnostics of machine parts by means of reverse engineering procedures” Image source: Velodyne website

5

Very Difficult to Storing, Computing, Editing, Visualizing, Interact, and Interpret.

slide-6
SLIDE 6

Limitation of Traditional 3D Reconstruction

Textureless Scenes Medium/Large Baseline (Correspondence Fail) Reflection/Transparency Repetitive Patterns

Image source: Internet

6

Multiple Moving Objects

slide-7
SLIDE 7

Data-Driven Learning-Based Approaches

Depth Map Regression

Li, Z., & Snavely, N. (2018)

3D Instance Segmentation

Mousavian, A., et al. (2019)

Voxel Generation

Song, S., et al. (2017)

Pose Estimation

Kehl, Wadim., et al. (2017)

Mesh Generation

Groueix, T., et al. (2018)

Layout Prediction

Zou, C., et al. (2018)

Plane Detection

Liu, C., et al. (2018)

7

slide-8
SLIDE 8
  • Recently research [1] suggests deep encoder-decoder

networks do not perform reconstruction but classification. For data-driven based depth recovery, DNN is not better (or even worse) than nearest neighbors (NN).

Equivalent to Image Classification?

Ground Truth OGN Matryoshka Clustering AtlasNet Retrieval Oracle NN [1]. Tatarchenko, Maxim, et al. “What Do Single-view 3D Reconstruction Networks Learn?.” arXiv preprint arXiv:1905.03678 (2019).

8

slide-9
SLIDE 9

Our World is Full of Structures

  • Man-made environments are rich of structural regularities
  • straight lines
  • smooth curves
  • parallelism
  • orthogonality
  • Symmetry
  • Building code

& grammar…

Image source: Internet

9

slide-10
SLIDE 10

Importance of Structure in Human 3D Perception

10

  • Structure: spatial relationships among multiple points, lines, patches, etc.
  • Human perceives 3D space by recognizing many types of structure in the

scene

Lee et al. Geometric Reasoning for Single Image Structure Recovery. CVPR 2009.

slide-11
SLIDE 11

Exploring Structures for 3D Reconstruction

11

LOCAL: face-edge-vertex graph, smooth curves & surfaces SEMI-GLOBAL: symmetry, parallelism & orthogonality GLOBAL: shape grammar

Image Source: Chen et al., 2007; Pauly et al., 2008 Image Source: https://stuckeman.psu.edu/adapting-modern-architecture-local-context

slide-12
SLIDE 12

Ex Expl ploring ng Local Str truc uctur tures -- What Does a 2D Line Drawing Tell Us about the 3D Geometry?

“Given a single picture … we usually have definite ideas about the 3-D shapes of objects. To do this we need to use assumptions about the world and the image formation process, since there exist a large number of shapes which can produce the same picture.”

  • - Takeo Kanade, 1981

12

[Sinha and Adelson 1993]

slide-13
SLIDE 13

Ex Expl ploring ng Local Str truc uctur tures – Some History of Single Line Drawing Interpretation

  • Prototype-based interpretation: Roberts (1965), Falk (1972), Grape

(1973)

  • Polyhedrons: Huffman (1971), Clowes (1971), Mackworth (1973),

Sugihara (1978), Whiteley (1979, 1982), Draper (1981), Shapira (1985), …

  • Curved objects: Turner (1974), Shapira and Freeman (1979), Lee et al.

(1985), Malik (1987) …

  • Paper-made objects (Origami): Kanade (1980, 1981)
  • Dynamic scenes: Asada et al. (1984)
  • ……

13

slide-14
SLIDE 14

Ex Expl ploring ng Local Str truc uctur tures – Line Labeling [Huffman- Clowes, 1971]

  • Every line in natural pictures of polyhedron objects should have

exactly one of the four labels

  • Convex (+), concave (-), or occluding (à, ß)

14

slide-15
SLIDE 15

Ex Expl ploring ng Local Str truc uctur tures – Junction Dictionary and Consistent Labeling [Huffman-Clowes, 1971]

  • 12 valid configurations for

trihedral vertex

  • L-, Y-, W-types
  • Represents just 11.5% of all possible

configurations

  • T-junction occurs when an edge
  • ccludes another partially.
  • Does not correspond to a three-

dimensional vertex.

15

slide-16
SLIDE 16

Ex Expl ploring ng Local Str truc uctur tures – Junction Dictionary and Consistent Labeling [Huffman-Clowes, 1971]

  • 12 valid configurations for

trihedral vertex

  • L-, Y-, W-types
  • Represents just 11.5% of all possible

configurations

  • T-junction occurs when an edge
  • ccludes another partially.
  • Does not correspond to a three-

dimensional vertex.

16

L L L L W W W W Y T

slide-17
SLIDE 17

Ex Expl ploring ng Local Str truc uctur tures – A linear Algebra Approach to 3D Reconstruction [Sugihara, 1982]

  • Consider a picture which is obtained as

the orthographic projection of the

  • bject
  • i-th vertex: (xi, yi, zi)
  • j-th face: (aj, bj, cj)
  • 3D reconstruction can be formulated as

estimating the unknowns zi, aj, bj, cj…

17

slide-18
SLIDE 18

Ex Expl ploring ng Local Str truc uctur tures – A linear Algebra Approach to 3D Reconstruction [Sugihara, 1982]

Line label assignments to the picture provide two forms of constraints:

  • 1. Vertex i should be on the j-th face:
  • 2. Vertex t should be nearer than the k-th face

18

slide-19
SLIDE 19

Ex Expl ploring ng Local Str truc uctur tures – A linear Algebra Approach to 3D Reconstruction [Sugihara, 1982]

Theorem: A labeled line drawing represents a polyhedral scene if and only if the linear system has a solution.

19

In practice, there are usually infinite number of solutions …

slide-20
SLIDE 20

Ex Expl ploring ng Local Str truc uctur tures – Shape Recovery via Optimization

  • To resolve ambiguity, one option is to use additional cues such as

shading [Sugihara, 1986]:

20

Lambertian surface with light source direction l:

slide-21
SLIDE 21

Ex Expl ploring ng Local Str truc uctur tures – Shape Recovery via Optimization

  • Another option is to invoke additional structural priors, such as

smoothness and regularity:

21

“To interpret a polygon in the image, we try to find a configuration of the vertices in space that makes the three-dimensional figure as regular as possible. Regularity might be measured in a variety of ways … we prefer local features which are more likely to survive occlusion.”

  • - Barrow and Tenenbaum, 1981

e.g., f(w) = “sum of the squares of angles of faces”

slide-22
SLIDE 22

Ex Expl ploring ng Local Str truc uctur tures – Some History of Additional Cues in Single Line Drawing Interpretation

  • Light intensity: Horn (1975), Woodham (1977, 1981), Ikeuchi (1981)

Coleman and Jain (1982)

  • Apparent distortion of known patterns: Kender (1979), Kanade

(1981), Ikeuchi (1984)

  • Surface contour: Stevens (1981), Barrow and Tenenbaum (1981),

Marr (1982)

  • Texture: Bajcsy and Lieberman (1976), Witkin (1981)
  • Vanishing points: Nakatani and Kitahashi (1984)
  • ……

22

slide-23
SLIDE 23

Exploring Structures for 3D Reconstruction

23

LOCAL: face-edge-vertex graph, smooth curves & surfaces SEMI-GLOBAL: symmetry, parallelism & orthogonality GLOBAL: shape grammar

Image Source: Chen et al., 2007; Pauly et al., 2008 Image Source: https://stuckeman.psu.edu/adapting-modern-architecture-local-context

slide-24
SLIDE 24

Sym ymmetry y Structures

24

Symmetry captures almost all “regularities”.

slide-25
SLIDE 25

Sym ymmetry y Structures – Hidden Images from Rotation

25

slide-26
SLIDE 26

Sym ymmetry y Structures – Hidden Images from Reflection

26

slide-27
SLIDE 27

Sym ymmetry y Structures – Hidden Images from Translation

27

slide-28
SLIDE 28

Sym ymmetric Structure & Group

28

  • Definition. A set of 3-D features S is called a symmetric structure if there exists a

non-trivial subgroup G of E(3) that acts on it such that for every g in G, the map is an (isometric) automorphism of S. We say the structure S has a group symmetry G.

slide-29
SLIDE 29

Sym ymmetry y Structures – Hidden Multiple Views

29

slide-30
SLIDE 30

Sym ymmetry y Structures – Hidden Multiple Views

30

Solving g0 from Lyapunov equations: with gi’ and gi known.

slide-31
SLIDE 31

Sym ymmetry y Structures

31

3-D reconstruction with symmetry is simple, accurate and robust!

slide-32
SLIDE 32

Sym ymmetry y Structures – Hidden Images in Each View

32

1 2 3 4 (3) (4) (2) (1)

Symmetry on object Virtual camera-camera

slide-33
SLIDE 33

Sym ymmetry y Structures – Reflective Homography

33

1 2 3 4 (3) (4) (2) (1)

2 pairs of symmetric points Decompose H to obtain (R’, T’, N) and T0 Reflective homography Solve Lyapunov equation to obtain R0.

slide-34
SLIDE 34

Sym ymmetry y Structures – Alignment of Different Objects

34

?

slide-35
SLIDE 35

Sym ymmetry y Structures – Scale Correction

35

? For a point p on the intersection line

slide-36
SLIDE 36

Sym ymmetry y Structures – Alignment of Different Views

36

z z z

slide-37
SLIDE 37

Sym ymmetry y Structures – Scale Correction

37

For any image x1 in the first view, its corresponding image in the second view is:

slide-38
SLIDE 38

Sym ymmetry y Structures – Alignment of Multiple Views

38

Method is object-centered and baseline-independent.

slide-39
SLIDE 39

Sym ymmetry y Structures – Experiment Results

39

slide-40
SLIDE 40

Sym ymmetry y Structures – Experiment Results

40

slide-41
SLIDE 41

Sym ymmetry y Structures – Image Transfer

41

slide-42
SLIDE 42

Sym ymmetry y Structures – Camera Pose

42

slide-43
SLIDE 43

Sym ymmetry y Structures – Full 3D Model

43

slide-44
SLIDE 44

Reference o

  • n M

Multiview G Geometry o

  • f J

Junctions, L Lines, P Planes, a and S Symmetries

44

An Invitation to 3D Vision, Yi Ma, S. Soatto, J. Kosecka, and S. Sastry Springer, 2004.

. . .

slide-45
SLIDE 45

Exploring Structures for 3D Reconstruction

45

LOCAL: face-edge-vertex graph, smooth curves & surfaces SEMI-GLOBAL: symmetry, parallelism & orthogonality GLOBAL: shape grammar

Image Source: Chen et al., 2007; Pauly et al., 2008 Image Source: https://stuckeman.psu.edu/adapting-modern-architecture-local-context

slide-46
SLIDE 46

Expl Exploring ng Globa bal Struc uctur ures s – Shape Grammar

46

slide-47
SLIDE 47
slide-48
SLIDE 48

Summary

  • Here we outline the rest of the tutorial.
  • A holistic 3D reconstruction pipeline consists of three main steps:
  • 1. Structure type identification, i.e., what types of structure are there in the

scene?

  • 2. Structure instance identification, i.e., where are the instances of such

structure in the image?

  • 3. Structure-based 3D reconstruction, i.e., how can we infer the 3D geometry

from the detected structure instances?

48

Structure Type Identification Structure Instance Detection Structure-based 3D Reconstruction Input Image(s)

We have focused on Step 3 so far. The rest of the tutorial will discuss Steps 1 and 2.

slide-49
SLIDE 49

Example Problems in Structure Type Identification

  • Local structures
  • Are the objects polyhedrons, smooth/curved surfaces, piece-wise planar, or

some combination of those?

  • Semi-global structures
  • What types of symmetry are there?
  • Manhattan world? Atlanta world? Something else?
  • Global structures (shape grammar):
  • What rules are used (known as inverse procedural modeling)?

49

slide-50
SLIDE 50

Example Problems in Structure Instance Detection

  • Local structures:
  • Build the face-edge-vexter graph, i.e., via junction detection, line detection,

face identification, etc.

  • Estimate the parameters of the geometric primitives involved
  • Semi-global structures:
  • Symmetry detection
  • Vanishing point detection
  • Global structures:
  • Procedural reconstruction

50