Viruual Content Generation via Deep Learning Yinda Zhang Research - - PowerPoint PPT Presentation

viruual content generation via deep learning
SMART_READER_LITE
LIVE PREVIEW

Viruual Content Generation via Deep Learning Yinda Zhang Research - - PowerPoint PPT Presentation

Viruual Content Generation via Deep Learning Yinda Zhang Research Scientist @ Google yindaz@google.com, www.zhangyinda.com GAMES Webinar A bit about me... A bit about me... 3D Scene Understanding Data-Driven Approach Shape Analysis Depth


slide-1
SLIDE 1

Viruual Content Generation via Deep Learning

Yinda Zhang GAMES Webinar

Research Scientist @ Google yindaz@google.com, www.zhangyinda.com

slide-2
SLIDE 2

A bit about me...

slide-3
SLIDE 3

A bit about me...

3D Scene Understanding Data-Driven Approach Shape Analysis

slide-4
SLIDE 4

Image Depth Image Depth

Pixel4 Rear Facing Camera: htups://ai.googleblog.com/2019/12/improvements-to-porurait-mode-on-google.html Pixel4 Front Facing Camera: htups://ai.googleblog.com/2020/04/udepth-real-time-3d-depth-sensing-on.html Zhang et.al., Du2Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels, arXiv:2003.14299

Depth Sensing on Device

slide-5
SLIDE 5

Augmented Reality

slide-6
SLIDE 6
slide-7
SLIDE 7

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-8
SLIDE 8

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-9
SLIDE 9

3D Shape Generation from a Single Image

[1] Choy et.al., 3dr2n2: A unifjed approach for single and multi-view 3d object reconstruction, ECCV 2016. [2] Fan et.al., A point set generation network for 3d object reconstruction from a single image, CVPR 2017. [3] Lorensen et.al., Marching cubes: A high resolution 3d surgace construction algorithm, SIGGRAPH 1987. [4] Bernardini et.al., The ball-pivoting algorithm for surgace reconstruction, IEEE Trans. Vis. Comput. Graph 1999.

Input Volumes [1, 3] Point Cloud [2, 4] Mesh (Ours) Mesh + Texture (Ours)

slide-10
SLIDE 10

Pixel2Mesh

slide-11
SLIDE 11 [1] Choy et.al., 3dr2n2: A unifjed approach for single and multi-view 3d object reconstruction, ECCV 2016. [2] Fan et.al., A point set generation network for 3d object reconstruction from a single image, CVPR 2017. [3] Lorensen et.al., Marching cubes: A high resolution 3d surgace construction algorithm, SIGGRAPH 1987. [4] Bernardini et.al., The ball-pivoting algorithm for surgace reconstruction, IEEE Trans. Vis. Comput. Graph 1999. [5] Kato et.al., Neural 3d mesh renderer, CVPR 2018. [6] Groueix et.al., Atlasnet: A papier-maˆche´ approach to learning 3d surgace generation, CVPR 2018. [7] Mescheder et.al., Occupancy networks: Learning 3d reconstruction in function space, CVPR 2019.

Input Volumes [1, 3] Point Cloud [2, 4] Mesh [5] Mesh [6] Implicit [7] Mesh (Ours) GT

slide-12
SLIDE 12

Pixel2Mesh

slide-13
SLIDE 13
slide-14
SLIDE 14

htups://walsvid.github.io/Pixel2MeshPlusPlus/

slide-15
SLIDE 15

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-16
SLIDE 16

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-17
SLIDE 17

Shape Manipulation -- Pose Transfer

slide-18
SLIDE 18

htups://jiashunwang.github.io/Neural-Pose-Transfer/

slide-19
SLIDE 19

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-20
SLIDE 20

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-21
SLIDE 21

Neural Rendering

Rely on texture map. Rely on volume. Rely on implicit representation.

Thies et.al., Deferred Neural Rendering: Image Synthesis using Neural Textures, SIGGRAPH 2019. Lombardi et.al., Neural Volumes: Learning Dynamic Renderable Volumes from Images, SIGGRAPH 2019. Mildenhall et.al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, arXiv:2003.08934

For Large Scale Scene

slide-22
SLIDE 22

Neural Rendering

Rely on texture map. Rely on volume. Rely on point cloud. Rely on implicit representation.

Thies et.al., Deferred Neural Rendering: Image Synthesis using Neural Textures, SIGGRAPH 2019. Lombardi et.al., Neural Volumes: Learning Dynamic Renderable Volumes from Images, SIGGRAPH 2019. Mildenhall et.al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, arXiv:2003.08934
slide-23
SLIDE 23

Neural Point Cloud Rendering

RGB-D Scans Point Cloud Rendering

slide-24
SLIDE 24
slide-25
SLIDE 25

htups://daipengwa.github.io/NeuralPointCloudRendering_ProjectPage/

slide-26
SLIDE 26

Neural Rendering

Rely on texture map. Rely on volume. Rely on point cloud. Rely on implicit representation.

Thies et.al., Deferred Neural Rendering: Image Synthesis using Neural Textures, SIGGRAPH 2019. Lombardi et.al., Neural Volumes: Learning Dynamic Renderable Volumes from Images, SIGGRAPH 2019. Mildenhall et.al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, arXiv:2003.08934
slide-27
SLIDE 27

Neural Rendering

Rely on texture map. Rely on volume. Rely on point cloud. Rely on implicit representation.

Thies et.al., Deferred Neural Rendering: Image Synthesis using Neural Textures, SIGGRAPH 2019. Lombardi et.al., Neural Volumes: Learning Dynamic Renderable Volumes from Images, SIGGRAPH 2019. Mildenhall et.al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, arXiv:2003.08934
slide-28
SLIDE 28

Implicit Function for 3D

Park et.al., DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation, CVPR 2019. Mescheder et.al., Occupancy Networks: Learning 3D Reconstruction in Function Space, CVPR 2019.
slide-29
SLIDE 29

Render Implicit Function

Haru et.al., Sphere tracing: A geometric method for the antialiased ray tracing of implicit surgaces. The Visual Computer 1996.
slide-30
SLIDE 30
  • Extremely time consuming

○ Too many queries for the rendering process. ○ Unroll multiple times for backpropagation. ○ Differentiable.

Render Deep Implicit Function

Coarse-to-fjne Aggressive Marching Converge Criteria

slide-31
SLIDE 31

Render Deep Implicit Function

slide-32
SLIDE 32

Render Deep Implicit Function

slide-33
SLIDE 33

htup://b1ueber2y.me/projects/DIST-Renderer/

slide-34
SLIDE 34

A pipeline to generate viruual content...

Generate 3D Manipulate 3D Render 3D

Saito et.al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, ICCV 2019. Wang et.al., Neural Pose Transfer by Spatially Adaptive Instance Normalization, CVPR 2020. Guo et.al., The Relightables: Volumetric Pergormance Capture of Humans with Realistic Relighting, SIGGRAPH 2019.
slide-35
SLIDE 35

Thanks!