1 Prior Work: Depth from Defocus Prior Work: Modifying Cameras - PDF document

Light fields Light Fields: From Shape Recovery to Sparse Reconstruction Ravi Ramamoorthi University of California, San Diego Refocusing Viewpoint Change L4CV Keynote, Jul 26, 2017 Consumer light field cameras Outline § Motivation § Light Fields for Passive 3D Capture § Specular Objects and SVBRDF Invariants Lytro (first generation) Pelican Light § Sparse Light Field Interpolation, Reconstruction § Insights and Future Work RayTrix Lytro Illum Goal: Passive easy-to-use 3D Prior Work: Depth from Stereo Real-World Scene High Quality Depth Estimation Stanford Multi-Camera Array Stereo Pair: Scharstein et al. 2002, Min et al. 2013, … Multi-view Stereo: Okutomi and Kanade 1993, Li et al. 2010, … Pros: + Robust in most cases + One time setup + Baseline modifiable Light-field Camera Cons: - Multiple cameras needed - Difficult setup (image rectification) - Baseline dependent - Relies on correspondence (brighter means closer to the camera) Tao et al. 13, 14,15 1

Prior Work: Depth from Defocus Prior Work: Modifying Cameras DSLR with a focusing mechanism DSLR with a focusing mechanism Masks: Liang 2008, Levin 2010, … Depth from defocus: Klarquist 1995, Schechner 2000, … Pros: + Robust in most cases Pros: + Aperture modifiable + Robust in most cases + One camera solution + Aperture modifiable + One camera solution Cons: - Some require multiple captures Cons: - Masks? - Difficult to obtain image (multi-exposures) - How to add masks? - Aperture size dependent - Relies on defocus Novelty : The Four Cues Novelty : The Four Cues INPUT: OUTPUT: INPUT: OUTPUT: Light-field Image High quality depth map Light-field Image High quality depth map Core Depth Estimation Depth from Correspondence and Defocus (Tao 13) Novelty : The Four Cues Novelty : The Four Cues INPUT: OUTPUT: INPUT: OUTPUT: Light-field Image High quality depth map Light-field Image High quality depth map Separate Separate Secularities Secularities (Tao 14,15) (Tao 14,15) Improve Input Improve Input Depth from Depth from ? Correspondence and Correspondence and Defocus Defocus (Tao 13) (Tao 13) Improve Output Output constraints using Shading information (Tao 15,16) 2

Defocus + Correspondence Results First public 3D from light field algorithm for consumer Lytro Camera: Tao et al., ICCV 13 Unify Defocus, Correspondence, Shading with LF Cameras. Tao et al. CVPR 15, PAMI 16 Occlusion Occlusion model occluded § What’s the problem with occlusions? Pinhole Model plane occluder angular angular patch patch occluded object object object plane Camera plane Camera plane Camera plane No occlusion With occlusion occluder “Reversed” Pinhole Model Wang et al. ICCV 15, PAMI 16 Occlusion theory Algorithm overflow § Insight: § The angular and spatial edges have same orientation § Half the angular patch still follows photo-consistency Light field input Initial depth Final depth Spatial image Angular patch for red pixel Same orientation Same color Edge detection Initial occlusion Final occlusion 3

Results with Occlusions Outline § Motivation § Light Fields for Passive 3D Capture § Specular Objects and SVBRDF Invariants § Sparse Light Field Interpolation, Reconstruction § Insights and Future Work Specularity: Point vs Line Consistency Specularity: Point vs Line Consistency Lambertian Diffuse Surface RGB 3D Scatter Plot of Angular Lambertian Diffuse Surface RGB 3D Scatter Plot of Angular (Out-of-focus) (Refocusing to Photo consistency) B B G G R R Specularity: Point vs Line Consistency Specularity: Point vs Line Consistency Lambertian Diffuse+Specular RGB 3D Scatter Plot of Angular Lambertian Diffuse+Specular RGB 3D Scatter Plot of Angular Surface Surface (Out-of-focus) (Refocused to Line) B B G G R R 4

Specularity: Line Consistency SVBRDF-Invariant Equation § Instead of separating specularity, (SV)BRDF invariance § Build on differential motion theory [Chandraker 14] § Use light field cameras instead § More views à more robust § First framework proven to be SV BRDF-Invariant § Extend traditional optical flow to glossy objects Wang et al. CVPR 16, PAMI 17 Point and Line Consistency with Light Field Cameras: Tao et al. PAMI 15 Δ I = I 2 ( u ) − I 1 ( u ) Δ I = I 2 ( u ) − I 1 ( u ) Same intensity Image Image Image plane plane plane Slides from Wang et al. CVPR 16, PAMI 17 Δ I = I 2 ( u ) − I 1 ( u ) Spatial change (same) = ! ( ! ) ! ( ! ) + Image plane Viewpoint change depth is solvable ! ( ! ! ) by one motion! ! ! , ! ! In 3D: 3 unknowns ( z, ) solvable by 3 motions! 5

rank deficiency ¡ Diffuse + 1-lobe unknown function of half-angle ¡ Directly solve a line of solutions z unknown unknown function diffuse term h v ρ ( n , s , v ) = ρ s ( θ ) + σ n ! ! s θ Different BRDFs! ! ! Use assumption on BRDF model ¡ Represent BRDF ratio in two ways à combine ¡ SVBRDF-invariant equation function of a ! ! = ! ( ! ! , ! ! ) § z Form 1 § Directly solving requires initial conditions Form 2 h v n s � Assume shape is locally polynomial φ ! , ! ! , ! ! à functions of a 5x5 patch = f ( z ) ! ! = = ! ( ! ! , ! ! ) = ! = ! ! ! ! + ! ! ! ! + ! ! !" + ! ! ! + ! ! ! + ! ! Quadratic shape Invariant to SVBRDF! ¡ 100 materials à 9 categories ¡ Recall the solution lies on a line z ! ! ! ! ¡ z is known à and are known ! ! ! ! ρ ¡ Finally, the BRDF can be recovered 6

Input image Input image Our result PLC (PAMI15) Our result PLC (PAMI15) SDC (CVPR15) PSSM (CVPR15) Lytro Illum SDC (CVPR15) PSSM (CVPR15) Lytro Illum Can also be inserted in a robust geometric optimization framework. See Li et al. CVPR 2017 Resolution trade-off Outline § Motivation § Light Fields for Passive 3D Capture Limited resolution § Specular Objects and SVBRDF Invariants High angular § Sparse Light Field Interpolation, Reconstruction Low spatial § Insights and Future Work Kalantari et al. Solution: angular super-resolution Straightforward solution n Model the process with a single CNN CNN Sparse Input Views Synthesized Views Kalantari et al. 7

Single CNN’s result High-level idea n Follow the pipeline of existing techniques and break the process into two components Goesele et al. [2010]; Chaurasia et al. [2013] n Disparity estimator n Color predictor n Model the components using learning n Train both models simultaneously Disparity Color View Synthesis Estimator CNN Predictor CNN Our result 4D RGBD Light Fields from 2D Image Kalantari et al. Srinivasan et al. ICCV 17 Light field video Lytro video n Consumer light field cameras limited bandwidth n Capture low frame rate videos Lytro Illum (3 fps video) Wang et al. SIGGRAPH 17 8

Hybrid Light Field Video System Our result DSLR 30 fps Lytro 3 fps Outline Shape, Reflectance, Resolution § Motivation § Significant progress in recovering overall shape § Light Fields for Passive 3D Capture § Can we recover fine-scale shape, reflectance § Hair, microstructure, detailed BRDFs § Specular Objects and SVBRDF Invariants § Light field camera as a reflectance device § Sparse Light Field Interpolation, Reconstruction § Two-shot near-field acquisition: Xu et al. SIGGRAPH Asia 16 § Insights and Future Work § Theoretical limits of shape/reflectance ambiguity § Resolution limits (Liang and Ramamoorthi TOG 15) § Easy, sparse light field capture for VR § Super-resolution limits with learning Deep learning for analysis Deep learning for synthesis § Generally received much less attention § Strong physical foundation § Designed for reducing an image to a label § Insufficient data in some applications Classification Image Captioning § This talk: Learning system architecture inspired Krizhevsky et al. 2012 Vinyals et al. 2014 by physically-based solutions § Leverage physics, use learning bypass hard problems (occlusion). Best of both worlds Object Detection Video Recognition Girshick et al. 2014 Karpathy et al. 2014 9

New Applications in Computer Vision Acknowledgements § Light Fields for Scene Flow (Tao et al. ICCV 15) § Students and Postdocs ( Michael Tao, Ting-Chun Wang, Pratul Srinivasan , Nima Kalantari, Zak Murez, Zhengqin Li, Zexiang Xu, § Light Field Material Recognition (Wang et al. ECCV 16) Jong-Chyi Su, Jun-Yan Zhu, Jiamin Bai, Dikpal Reddy, Eno Toeppe) § Light Field Motion Deblurring (Srinivasan et al. CVPR 16) § Light Field Descattering (Tian et al. ICCV 17) § Computer vision with multiple views/images § Collaborators (Jitendra Malik, Manmohan Chandraker, Alexei Efros, Szymon Rusinkiewicz,, Ren Ng, Chia-Kai Liang, Ebi Hiroaki, Jiandong Tian, Sunil Hadap) § Funding: NSF, ONR (x2), UC San Diego Center for Visual Computing (Google, Sony, Adobe, Nokia, Samsung, Draper) 10

1 Prior Work: Depth from Defocus Prior Work: Modifying Cameras - PDF document

Light fields Light Fields: From Shape Recovery to Sparse Reconstruction Ravi Ramamoorthi University of California, San Diego Refocusing Viewpoint Change L4CV Keynote, Jul 26, 2017 Consumer light field cameras Outline Motivation Light

Representation in Low-Level Visual Learning Erik Sudderth Brown University Department of

A Theory of Spherical Diagrams Giovanni Viglietta (work in progress...) JAIST July 16, 2020

Distance fields imre paadik Overview Signed distance fields Distance fields in computer

A A 3D 3D-Ad Adver ert t Cr Crea eati tion System em for Product ct Place cements

VU Augmented Reality on Mobile Devices VU Augmented Reality on Mobile Devices Introduction

Steerable Interfaces for Steerable Interfaces for Pervasive Computing Spaces Pervasive Computing

Dynamic Spatial Partitioning for Real-Time Visibility Determination Joshua Shagam Computer

Presenting 2D Web Content in XR TPAC 2018 Overview 2D Web Content in 3D XR - Background

Co-authors Jonathan P. Piccini, MD, MHS; Tongrong Wang, MS; S. Chris Malaisrie, MD; David R.

BUPT-MCPRL@TRECVID 2014: Surveillance Event Detection(SED) Qi Chen (chen_qi1990@163.com)

1 4 1. Visibility culling What is it for? Avoid processing polygons which do not

Non-Negative Graph Embedding N N ti G h E b ddi Jianchao Yang Shuicheng Yan Yun Fu Jianchao

on Petascale Supercomputing Platforms John E. Stone, Kirby L. Vandivort, Klaus Schulten

DEEP LEARNING (F OR R OBOTIC V ISION ) Juxi Leitner @Juxi http://Juxi.net R ESEARCHER R OBOTICS

Usin ing UAV Technology Thomas Bamford, Kamran Esmaeili, Angela P. Schoellig CAMI 2016 In

Human-Oriented Robotics Temporal Reasoning Part 3/3 Kai Arras Social Robotics Lab, University

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

EE-559 Deep learning 9.3. Visualizing the processing in the input Fran cois Fleuret

Learning Visual Distance Function L i Vi l Di t F ti for Identification from one Example for

Welcome! Todays Agenda: The Postprocessing Pipeline Vignetting, Chromatic Aberration

Learning to Synthesize Motion Blur CVPR 2019 Tim Brooks and Jon Barron Research Motion During

Analyzing Performance of QtQuick Applications Thomas McGuire KDAB thomas@kdab.com Performance:

Photographer anonymity Joint work with Peter Schaffer (Uni. of Luxembourg) Djamila Aouada (Uni.

Chapter 6: Space & Depth Perception Lec 12 Jonathan Pillow, Sensation & Perception (PSY

1 Prior Work: Depth from Defocus Prior Work: Modifying Cameras - PDF document

Light fields Light Fields: From Shape Recovery to Sparse Reconstruction Ravi Ramamoorthi University of California, San Diego Refocusing Viewpoint Change L4CV Keynote, Jul 26, 2017 Consumer light field cameras Outline Motivation Light

Representation in Low-Level Visual Learning Erik Sudderth Brown University Department of

A Theory of Spherical Diagrams Giovanni Viglietta (work in progress...) JAIST July 16, 2020

Distance fields imre paadik Overview Signed distance fields Distance fields in computer

A A 3D 3D-Ad Adver ert t Cr Crea eati tion System em for Product ct Place cements

VU Augmented Reality on Mobile Devices VU Augmented Reality on Mobile Devices Introduction

Steerable Interfaces for Steerable Interfaces for Pervasive Computing Spaces Pervasive Computing

Dynamic Spatial Partitioning for Real-Time Visibility Determination Joshua Shagam Computer

Presenting 2D Web Content in XR TPAC 2018 Overview 2D Web Content in 3D XR - Background

Co-authors Jonathan P. Piccini, MD, MHS; Tongrong Wang, MS; S. Chris Malaisrie, MD; David R.

BUPT-MCPRL@TRECVID 2014: Surveillance Event Detection(SED) Qi Chen (chen_qi1990@163.com)

1 4 1. Visibility culling What is it for? Avoid processing polygons which do not

Non-Negative Graph Embedding N N ti G h E b ddi Jianchao Yang Shuicheng Yan Yun Fu Jianchao

on Petascale Supercomputing Platforms John E. Stone, Kirby L. Vandivort, Klaus Schulten

DEEP LEARNING (F OR R OBOTIC V ISION ) Juxi Leitner @Juxi http://Juxi.net R ESEARCHER R OBOTICS

Usin ing UAV Technology Thomas Bamford, Kamran Esmaeili, Angela P. Schoellig CAMI 2016 In

Human-Oriented Robotics Temporal Reasoning Part 3/3 Kai Arras Social Robotics Lab, University

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

EE-559 Deep learning 9.3. Visualizing the processing in the input Fran cois Fleuret

Learning Visual Distance Function L i Vi l Di t F ti for Identification from one Example for

Welcome! Todays Agenda: The Postprocessing Pipeline Vignetting, Chromatic Aberration

Learning to Synthesize Motion Blur CVPR 2019 Tim Brooks and Jon Barron Research Motion During

Analyzing Performance of QtQuick Applications Thomas McGuire KDAB thomas@kdab.com Performance:

Photographer anonymity Joint work with Peter Schaffer (Uni. of Luxembourg) Djamila Aouada (Uni.

Chapter 6: Space &amp; Depth Perception Lec 12 Jonathan Pillow, Sensation &amp; Perception (PSY

Chapter 6: Space & Depth Perception Lec 12 Jonathan Pillow, Sensation & Perception (PSY