Neural l Renderin ing Prof. Leal-Taix and Prof. Niessner 1 - - PowerPoint PPT Presentation

neural l renderin ing
SMART_READER_LITE
LIVE PREVIEW

Neural l Renderin ing Prof. Leal-Taix and Prof. Niessner 1 - - PowerPoint PPT Presentation

Neural l Renderin ing Prof. Leal-Taix and Prof. Niessner 1 Renderin ing Camera Def. - Intrinsics - Often: 3D Scene: - focal length - Material - principal point) - Lighting - Geometry (incl. animation) Camera View Point -


slide-1
SLIDE 1

Neural l Renderin ing

1

  • Prof. Leal-Taixé and Prof. Niessner
slide-2
SLIDE 2

Renderin ing

2

3D Scene:

  • Material
  • Lighting
  • Geometry

(incl. animation) Camera View Point

  • Extrinsics
  • 6 DoF (3rot, 3trans)

Camera Def.

  • Intrinsics
  • Often:
  • focal length
  • principal point)
  • Prof. Leal-Taixé and Prof. Niessner
slide-3
SLIDE 3

Photo-reali listic Im Image Synthesis is

The Rendering Equation [Kajiya 86]

3

  • Prof. Leal-Taixé and Prof. Niessner
slide-4
SLIDE 4

Need 3D Content fo for r Renderi ring

Textures Material & Lighting Geometry

4

  • Prof. Leal-Taixé and Prof. Niessner
slide-5
SLIDE 5

Computer Vis isio ion fo for r Reconstructio ion

ICCV’09 [Agarwal et al.]: Building Rome in a Day

5

  • Prof. Leal-Taixé and Prof. Niessner
slide-6
SLIDE 6

Computer Graphics

3D Dig igit itization

Computer Vision

6

  • Prof. Leal-Taixé and Prof. Niessner
slide-7
SLIDE 7

Tra raditi itional Gra raphic ics vs Deep Learn rnin ing

3D Model + Textures + Shading -> Synthetic Image Star Wars Rogue One

Discriminator loss Generator loss

[Karras et al. 18]

Generative Adversarial Networks

7

  • Prof. Leal-Taixé and Prof. Niessner
slide-8
SLIDE 8

Id Idea of f Neural Renderin ing

Novel View point synthesis:

8

  • Prof. Leal-Taixé and Prof. Niessner

Neural Network

  • > Encodes entire

scene description, lighting, materials, etc. 6 DoF Camera Pose / View Point

slide-9
SLIDE 9

Neural Renderin ing wit ith Pix ix2Pix ix

Ground truth for training

  • Pose + Target Image (e.g., observed from real world)
  • Constrain with re-rendering loss

Testing

  • Given unseen pose, generate image
  • Prof. Leal-Taixé and Prof. Niessner

9

slide-10
SLIDE 10

Neural Renderin ing wit ith Pix ix2Pix ix

10

  • Prof. Leal-Taixé and Prof. Niessner
slide-11
SLIDE 11

Other r Neural Renderin ing

  • Conditioned on Faces (Deep Video Portraits)
  • Conditioned on Human Skeleton (Everybody Dance Now)
  • Prof. Leal-Taixé and Prof. Niessner

11

slide-12
SLIDE 12

Neural Renderin ing wit ith Pix ix2Pix ix

12

  • Prof. Leal-Taixé and Prof. Niessner
slide-13
SLIDE 13

Deep Voxels

[Sitzmann et al. CVPR’19] Deep Voxels

13

  • Prof. Leal-Taixé and Prof. Niessner
slide-14
SLIDE 14

Deep Voxels

  • Main idea for video generation:

– Why learn 3D operations with 2D Convs !?!? – We know how 3D transformations work

  • E.g., 6 DoF rigid pose [

[ R | t | t ]

– Incorporate these into the architectures

  • Need to be differentiable!

– Example application: novel view point synthesis

  • Given rigid pose, generate image for that view

[Sitzmann et al. CVPR’19] Deep Voxels

14

  • Prof. Leal-Taixé and Prof. Niessner
slide-15
SLIDE 15

Deep Voxels

2D U-Net

Renderin g Lifting Layer 2D 3D

2D U-Net

2D Feature Extraction

Source View R, t

Projection Layer 3D 2D

Output Source

Target View R, t

3D U-Net

3D Features

Simplified overview for novel view synthesis [Sitzmann et al. CVPR’19] Deep Voxels

15

  • Prof. Leal-Taixé and Prof. Niessner
slide-16
SLIDE 16

Deep Voxels

[Sitzmann et al. CVPR’19] Deep Voxels

16

  • Prof. Leal-Taixé and Prof. Niessner
slide-17
SLIDE 17

Deep Voxels

Issue: we don’t know the depth for the target!

  • > Per-pixel softmax along the ray
  • > Network learns the depth

Occlusion Network:

[Sitzmann et al. CVPR’19] Deep Voxels

17

  • Prof. Leal-Taixé and Prof. Niessner
slide-18
SLIDE 18

Deep Voxels

[Sitzmann et al. ’18] Deep Voxels

18

  • Prof. Leal-Taixé and Prof. Niessner
slide-19
SLIDE 19

Deep Voxels

[Sitzmann et al. ’18] Deep Voxels

19

  • Prof. Leal-Taixé and Prof. Niessner
slide-20
SLIDE 20

Deep Voxels: : In Insights

  • Lifting from 2D to 3D works great

– No need to take specific care for temp. coherency!

  • All 3D operations are differentiable
  • Currently, only for novel view-point synthesis

– I.e., cGAN for new pose in a given scene

  • But: limited resolution due to dense 3D voxel grid

[Sitzmann et al. ’18] Deep Voxels

20

  • Prof. Leal-Taixé and Prof. Niessner
slide-21
SLIDE 21

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric

Both Scene Representation and Differentiable Renderer often adapted from traditional computer graphics.

Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

21

  • Prof. Leal-Taixé and Prof. Niessner
slide-22
SLIDE 22

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

22

  • Prof. Leal-Taixé and Prof. Niessner
slide-23
SLIDE 23

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

23

  • Prof. Leal-Taixé and Prof. Niessner
slide-24
SLIDE 24

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

24

  • Prof. Leal-Taixé and Prof. Niessner
slide-25
SLIDE 25

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

25

  • Prof. Leal-Taixé and Prof. Niessner
slide-26
SLIDE 26

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3)

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

26

  • Prof. Leal-Taixé and Prof. Niessner
slide-27
SLIDE 27

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3)

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

27

  • Prof. Leal-Taixé and Prof. Niessner
slide-28
SLIDE 28

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3) High quality Requires good SFM No compact representation

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

28

  • Prof. Leal-Taixé and Prof. Niessner
slide-29
SLIDE 29

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3) High quality Requires good SFM No compact representation

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

29

  • Prof. Leal-Taixé and Prof. Niessner
slide-30
SLIDE 30

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3) High quality Requires good SFM No compact representation High quality Requires good SFM

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

30

  • Prof. Leal-Taixé and Prof. Niessner
slide-31
SLIDE 31

Im Importing 3D stru ructure fr from CG

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Traced Volumetric Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3) High quality Requires good SFM No compact representation High quality Requires good SFM

Pros Pros Cons Cons Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

31

  • Prof. Leal-Taixé and Prof. Niessner
slide-32
SLIDE 32

Scene Repre resentatio ion Network rks Sitz itzmann et al., l., Neurip rips 2019

Scene Scene Representa tion tion Renderer

ℝ3՜ ℝ𝑜 ReLU MLP Generalized (learned) sphere-tracing

Generalizati

  • n
  • n

Hypernetwork

Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

32

  • Prof. Leal-Taixé and Prof. Niessner
slide-33
SLIDE 33

Scene Repre resentatio ion Network rks Sitz itzmann et al., l., Neurip rips 2019

Full 3D Reconstruction from single image!

Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

33

  • Prof. Leal-Taixé and Prof. Niessner
slide-34
SLIDE 34

NERF: : Neura ral Radia iance Fie ield lds Mild ildenhall et al., l., arX rXiv iv 2020

Scene Scene Representa tion tion Renderer

ℝ6՜ ℝ3 ReLU MLP + Positional Encoding View Direction Volumetric, stratified sampling

Generalizati

  • n
  • n

None.

Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

34

  • Prof. Leal-Taixé and Prof. Niessner
slide-35
SLIDE 35

NERF: : Neura ral Radia iance Fie ield lds Mild ildenhall et al., l., arX rXiv iv 2020

Photorealistic, including view-dependence! (~100 images)

Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

35

  • Prof. Leal-Taixé and Prof. Niessner
slide-36
SLIDE 36

Require rements

Scene Scene Representa tion tion

Multi-Plane Images Voxelgrids Image-based Point Clouds Implicit Function

Rendere rer

(Alpha) compositing Volumetric Ray-based Rasterization Splatting Sphere-Tracing Volumetric

Pros Pros Cons Cons

Fast rendering High quality Generalizes Only 2.5D Size “True 3D” High quality No reconstruction priors Memory O(n3) High quality Requires good SFM No compact representation High quality Requires good SFM High quality May generalize! Expensive rendering, training

Slides: Vincent Sitzmann (Eurographics State-of-the-art on Neural Rendering)

36

  • Prof. Leal-Taixé and Prof. Niessner
slide-37
SLIDE 37

Neura ral Texture res: : Feature res on 3D Mesh

37

  • Prof. Leal-Taixé and Prof. Niessner
slide-38
SLIDE 38

Siggraph’19 [Thies et al.]: Neural Textures 3D Geometry Neural Texture

Neura ral Texture res: : Feature res on 3D Mesh

38

  • Prof. Leal-Taixé and Prof. Niessner
slide-39
SLIDE 39

UV-Map Sampled Texture Siggraph’19 [Thies et al.]: Neural Textures 3D Geometry

Rendering 3D 2D

View R, t

Neural Texture

Neura ral Texture res: : Feature res on 3D Mesh

39

  • Prof. Leal-Taixé and Prof. Niessner
slide-40
SLIDE 40

UV-Map Renderer Output Image Sampled Texture Siggraph’19 [Thies et al.]: Neural Textures 3D Geometry

Rendering 3D 2D

View R, t

Neural Texture

Neura ral Texture res: : Feature res on 3D Mesh

40

  • Prof. Leal-Taixé and Prof. Niessner
slide-41
SLIDE 41

Defe ferred Neural Rendering

Albedo Depth Normal Lighting

Deferred Renderer Handcrafted ”Feature Maps“

Siggraph’19 [Thies et al.]: Neural Textures

41

  • Prof. Leal-Taixé and Prof. Niessner
slide-42
SLIDE 42

Defe ferred Neural Rendering

Albedo Depth Normal Lighting

Deferred Renderer Handcrafted ”Feature Maps“

Learned

Neural

Siggraph’19 [Thies et al.]: Neural Textures

42

  • Prof. Leal-Taixé and Prof. Niessner
slide-43
SLIDE 43

UV-Map Renderer Output Image Sampled Texture 3D Geometry

Rendering 3D 2D

View R, t

Neural Texture Siggraph’19 [Thies et al.]: Neural Textures

Defe ferred Neural Rendering

43

  • Prof. Leal-Taixé and Prof. Niessner
slide-44
SLIDE 44

Neura ral Texture res: : Feature res on 3D Mesh

Siggraph’19 [Thies et al.]: Neural Textures

44

  • Prof. Leal-Taixé and Prof. Niessner
slide-45
SLIDE 45

Input UV-Map Ours

Novel Vie iew-Poin int Synthesis is

Siggraph’19 [Thies et al.]: Neural Textures

45

  • Prof. Leal-Taixé and Prof. Niessner
slide-46
SLIDE 46

Ground Truth Ours

Novel Vie iew-Poin int Synthesis is

Siggraph’19 [Thies et al.]: Neural Textures

46

  • Prof. Leal-Taixé and Prof. Niessner
slide-47
SLIDE 47

Geometry Editing Input Sequence

Scene Edit itin ing

Siggraph’19 [Thies et al.]: Neural Textures

47

  • Prof. Leal-Taixé and Prof. Niessner
slide-48
SLIDE 48

Scene Edit itin ing

Siggraph’19 [Thies et al.]: Neural Textures

48

  • Prof. Leal-Taixé and Prof. Niessner
slide-49
SLIDE 49

Scene Edit itin ing

Siggraph’19 [Thies et al.]: Neural Textures

49

  • Prof. Leal-Taixé and Prof. Niessner
slide-50
SLIDE 50

Facia ial Anim imation

Siggraph’19 [Thies et al.]: Neural Textures

50

  • Prof. Leal-Taixé and Prof. Niessner
slide-51
SLIDE 51

Facia ial Anim imation

Siggraph’19 [Thies et al.]: Neural Textures

51

  • Prof. Leal-Taixé and Prof. Niessner
slide-52
SLIDE 52

Facia ial Anim imation

Siggraph’19 [Thies et al.]: Neural Textures

52

  • Prof. Leal-Taixé and Prof. Niessner
slide-53
SLIDE 53

Facia ial Anim imation

Siggraph’19 [Thies et al.]: Neural Textures

53

  • Prof. Leal-Taixé and Prof. Niessner
slide-54
SLIDE 54

Facia ial Anim imation

Siggraph’19 [Thies et al.]: Neural Textures

54

  • Prof. Leal-Taixé and Prof. Niessner
slide-55
SLIDE 55

Defe ferred Neural Rendering

Siggraph’19 [Thies et al.]: Neural Textures

56

  • Prof. Leal-Taixé and Prof. Niessner
slide-56
SLIDE 56

Defe ferred Neural Rendering

Siggraph’19 [Thies et al.]: Neural Textures

57

  • Prof. Leal-Taixé and Prof. Niessner
slide-57
SLIDE 57

Neura ral Voic ice Puppetry ry

58

  • Prof. Leal-Taixé and Prof. Niessner
slide-58
SLIDE 58

Neural Voic ice Puppetry

59

  • Prof. Leal-Taixé and Prof. Niessner
slide-59
SLIDE 59

Neural Voic ice Puppetry

60

  • Prof. Leal-Taixé and Prof. Niessner
slide-60
SLIDE 60

Neural Voic ice Puppetry

[Hannun et al.] DeepSpeech RNN

Output of the RNN of DeepSpeech:

  • Logits of alphabet (|alphabet|=29)

We use a time window (n=16)

61

  • Prof. Leal-Taixé and Prof. Niessner
slide-61
SLIDE 61

Neural Voic ice Puppetry

62

  • Prof. Leal-Taixé and Prof. Niessner
slide-62
SLIDE 62

Neural Voic ice Puppetry

63

  • Prof. Leal-Taixé and Prof. Niessner
slide-63
SLIDE 63

Neural Voic ice Puppetry

Person-specific Blendshape Expression Model

64

  • Prof. Leal-Taixé and Prof. Niessner
slide-64
SLIDE 64

Neural Voic ice Puppetry

Audio2Expression Training

65

  • Prof. Leal-Taixé and Prof. Niessner
slide-65
SLIDE 65

Neural Voic ice Puppetry

Hundreds of commentator videos available

  • - all with ‘neutral’ talking style --

66

  • Prof. Leal-Taixé and Prof. Niessner
slide-66
SLIDE 66

Neural Voic ice Puppetry

Flame Model Basel Model

with pose

67

  • Prof. Leal-Taixé and Prof. Niessner
slide-67
SLIDE 67

Neural Voic ice Puppetry

68

  • Prof. Leal-Taixé and Prof. Niessner
slide-68
SLIDE 68

Neural Voic ice Puppetry

69

  • Prof. Leal-Taixé and Prof. Niessner
slide-69
SLIDE 69

Neural Voic ice Puppetry

70

  • Prof. Leal-Taixé and Prof. Niessner
slide-70
SLIDE 70

Neural Voic ice Puppetry

71

  • Prof. Leal-Taixé and Prof. Niessner
slide-71
SLIDE 71

Big ig Open Challenges

72

  • Prof. Leal-Taixé and Prof. Niessner
slide-72
SLIDE 72

Big ig Open Challenges

Photo-realistic Reconstruction

73

  • Prof. Leal-Taixé and Prof. Niessner
slide-73
SLIDE 73

Big ig Open Challenges: How much can AI I do?

Siggraph’19 [Thies et al.]: Neural Textures

74

  • Prof. Leal-Taixé and Prof. Niessner
slide-74
SLIDE 74

Big ig Open Challenges: : 3D in in Networks

Why learn 3D operations, such as transformations?

  • > differentiate known operators

Capsule networks are motivated by inverse graphics [Sabour et al. 17]

75

  • Prof. Leal-Taixé and Prof. Niessner
slide-75
SLIDE 75
slide-76
SLIDE 76

See you next week 

77

  • Prof. Leal-Taixé and Prof. Niessner
slide-77
SLIDE 77

Some Extra ra Sli lides:

  • Prof. Leal-Taixé and Prof. Niessner

78

slide-78
SLIDE 78

Neural Voic ice Puppetry

79

  • Prof. Leal-Taixé and Prof. Niessner
slide-79
SLIDE 79

Neural Voic ice Puppetry

80

  • Prof. Leal-Taixé and Prof. Niessner
slide-80
SLIDE 80

Neural Voic ice Puppetry

81

  • Prof. Leal-Taixé and Prof. Niessner
slide-81
SLIDE 81

Facia ial Reenactment

82

Dense Sparse

  • Prof. Leal-Taixé and Prof. Niessner
slide-82
SLIDE 82

Neura ral Renderi ring and Reenactment of

  • f

Human Actor r Vid ideos

[Liu et al. 19] Neural Rendering and Reenactment of Human Actor Videos

83

  • Prof. Leal-Taixé and Prof. Niessner
slide-83
SLIDE 83

84

Body Reenactment

Dense Sparse

  • Prof. Leal-Taixé and Prof. Niessner
slide-84
SLIDE 84

85

Open Challenges

  • Motion Capturing
  • Person-specific Motions/Expressions
  • Temporal Stability
  • Image Quality
  • Prof. Leal-Taixé and Prof. Niessner