Synthesizing Images and Videos Qifeng Chen Assistant Professor, - PowerPoint PPT Presentation

New Perspectives for Processing and Synthesizing Images and Videos Qifeng Chen Assistant Professor, HKUST

Q&A ◼ Which company is the most valuable worldwide? ◼ Apple ◼ What is the most important product of Apple? ◼ iPhone ◼ What is the most differentiable functionality of a smart phone today? ◼ Photography (arguably)

Low-light Imaging

Powerful Zoom

Overview ◼ Image and Video Processing ▪ Learning to See in the Dark ▪ Zoom to Learn, Learn to Zoom ▪ Fast Image and Video Processing ▪ Reflection Removal ◼ Image and Video Synthesis ▪ Photographic Image Synthesis ▪ Semi-parametric Image Synthesis ▪ RGBD Future Video Prediction ▪ Fully Automatic Video Colorization

Image and Video Processing

Learning to See in the Dark

Low-light Imaging Learning to See in the Dark A deep learning based Image Signal Processor Chen Chen, Qifeng Chen, Jia Xu, and Vladlen Koltun. Learning to See in the Dark, CVPR 2018

Dataset

Amplication Ratio

Results

Zoom to Learn, Learn to Zoom

Data Collection

What not just super-resolution with GANs? ◼ Existing super-resolution methods are trained on downsampled RGB images that contain little noise ◼ But in 8X digital zoom, noise is prominent ◼ RGB images are the output of ISP ▪ High frequency is removed by denoising ◼ We train our model to recover underlying high-frequency details from noisy input

Contextual Bilateral Loss Contextual Loss A novel loss (CoBi) for measuring similarity of slightly misaligned image pairs

Contextual Bilateral Loss

Results

Results https://youtu.be/xmCzET2GNk0 https://youtu.be/xmCzET2GNk0

Going well

A hazy day

Dehazed image Nonlocal Dehazing [Berman et al. 2016]

But not practical Nonlocal Dehazing takes a few seconds

Alternative solutions? ◼ Use another method ▪ No state-of-the-art accuracy ◼ Accelerate implementation ▪ Time consuming ◼ Nonlinear Function Approximator ▪ Simple, general, accurate and fast

Real-time performance Our approxminator runs at 30fps

Fast Image Processing Qifeng Chen, Jia Xu, and Vladlen Koltun. Fast Image Processing with Fully-Convolutional Networks, ICCV 2017

Results

Single Image Reflection Removal

Data Collection

Method

Results

Deep Image and Video Synthesis

Art by Human Creation

Art by Human Creation & AI

Photographic image synthesis Input semantic layouts Synthesized images Qifeng Chen and Vladlen Koltun. Photographic Image Synthesis with Cascaded Refinement Networks. ICCV 2017

Motivation ◼ Computer graphics ▪ Alternative route to photorealism ▪ Capture photographic appearance ▪ Fast image synthesis

Motivation ◼ Artificial Intelligence ▪ Visual Imagination

Our approach ◼ Cascaded refinement networks ◼ Perceptual Loss ◼ Diversity

Cascaded refinement networks High Resolution

Perceptual Loss

Diversity

Comparisons on Cityscapes

Results on NYU dataset Tseung Kwan O, Kowloon

User Study

User study

GTA5 and Demo Video

Semi-parametric Image Synthesis Semantic layouts Our result Xiaojuan Qi, Qifeng Chen, Jiaya Jia, and Vladlen Koltun Semi-parametric Image Synthesis. CVPR 2018

Image Synthesis Semantic layouts Our result NYU dataset [Silberman et al. ECCV 2012] ADE20K dataset [Zhou et al. 2017]

Prior Work: Parametric Models Pix2pix [Isola et al. 2017] CRN [Chen and Koltun 2017]

Prior Work: Non-parametric Models Scene Completion using Millions of Photographs [Hays and Efros 2007]

Our Approach … Sky … Forest … Mountain … Grass External memory

Our Approach … Sky Sky … Forest Mountain Forest Grass … Mountain Semantic layout … Grass External memory

Our Approach Stage 1: Canvas Generation Canvas Retrieved segments

Our Approach Stage 2: Image Synthesis Sky Forest Mountain Canvas Final result Grass Semantic layout

SIMS: Canvas Generation Semantic layout … … … … Car Building External memory

SIMS: Canvas Generation Semantic layout … … … … … Car Building External memory Retrieved segments

SIMS: Canvas Generation Semantic layout … … … … Transformation … … Car Building network Transformed External memory Retrieved segments segments

SIMS: Canvas Generation Semantic layout … Ordering network … … … Transformation … … Car Building Canvas network Transformed External memory Retrieved segments segments

SIMS: Image Synthesis Semantic layout Canvas

SIMS: Image Synthesis Semantic layout Convolution Upsampling Pooling Canvas Synthesis network f

SIMS: Image Synthesis Semantic layout Output Convolution Upsampling Pooling Canvas Synthesis network f

Results

Semantic layout

Pix2pix [Isola et al. 2017]

CRN [Chen and Koltun 2017]

Our result

Diversified Synthesis

Image Statistics Mean Power Spectrum Pix2pix [Isola et al. 2017] Real images

Image Statistics CRN [Chen and Koltun 2017] Real images

Image Statistics Our approach Real images

Perceptual Experiments Cityscape Cityscap Cityscap NYU ADE20K Mean s es es (fine) (coarse) (coarse ） (fine) (GTA5) SIMS > 94.2% 98.1% 95.7% 94.9% 87.6% 94.1% Pix2pix SIMS > CRN 93.9% 74.1% 84.5% 89.1% 88.9% 86.1%

Thank You

Future Prediction

Video Prediction

3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis

Results

Video Colorization

Fully Automatic Video Colorization with Self Regularization and Diversity

Diversity

Results

Thank You https://cqf.io

Synthesizing Images and Videos Qifeng Chen Assistant Professor, - PowerPoint PPT Presentation

New Perspectives for Processing and Synthesizing Images and Videos Qifeng Chen Assistant Professor, HKUST Q&A Which company is the most valuable worldwide? Apple What is the most important product of Apple? iPhone What is

Content-Based Projections for Panoramic Images and Panoramic Images and Videos Videos

SYNTHESIZING 3D SOUND SYNTHESIZING 3D SOUND AND AND SOUND LOCALIZATION SOUND LOCALIZATION

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Creating Videos Session will begin shortly Why create instructional videos for your courses?

Dennis Rosenberg http://DennisRosenberg.com Why Videos? People love watching videos Higher

Consuming videos with the ForkBrowser Consuming videos with the ForkBrowser Ork de Rooij, Cees

Understand Basketball Games 2018.6.15 Sports Videos Large quantity, high

GoBack On Synthesizing Controllers from Bounded-Response Properties Dejan Ni ckovi Oded

Synthesizing an Instruction Selection Rule Library from Semantic Specifjcations Sebastian

Ray Tracing CPSC 453 Fall 2018 Sonny Chan Ray Tracing A method for synthesizing images of

Bitmap (Raster) Images CO2016 Multimedia and Computer Graphics Roy Crole: Bitmap Images (CO2016,

Action recognition in videos Action recognition in videos Cordelia Schmid Cordelia Schmid

Action recognition in videos II Action recognition in videos II Cordelia Schmid INRIA Grenoble

HAAR-like features for images Images digit images are scanned hand written digits Digit

https://images-na.ssl-images-amazon.com/images/I/A1w4iP5ov-L._SY879_.jpg Translate this table to a

Segmentation of Objects in Images and Videos Alexandre X. Falc ao and Thiago V. Spina

Sweet Success : - CRN partners: USQ, ANU & UniSA making machinima for - Federally-funded:

2D Computer Graphics Diego Nehab Summer 2020 IMPA 1 Inflection points and double points The

Simulation Discrete-Event System Simulation Dr. Mesut Gne Computer Science, Informatik

Office of Research Administration Clinical Research Support Services 1629 S. Thames Street, Suite

Photographic Image Synthesis with Cascaded Refinement Networks Qifeng Chen and Vladlen Koltun

Universality of the GromovHausdorff distance Workshop on Topological Data Analysis as part of

Experimental challenges towards the detection of relic neutrinos with unstable nuclei Runions

Make Marketing Indispensable Strategies for turning the sales team into your biggest fans Kelly

Sambuz

Useful Links

Newsletter

Mail Us

Synthesizing Images and Videos Qifeng Chen Assistant Professor, - PowerPoint PPT Presentation

New Perspectives for Processing and Synthesizing Images and Videos Qifeng Chen Assistant Professor, HKUST Q&A Which company is the most valuable worldwide? Apple What is the most important product of Apple? iPhone What is

Content-Based Projections for Panoramic Images and Panoramic Images and Videos Videos

SYNTHESIZING 3D SOUND SYNTHESIZING 3D SOUND AND AND SOUND LOCALIZATION SOUND LOCALIZATION

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Creating Videos Session will begin shortly Why create instructional videos for your courses?

Dennis Rosenberg http://DennisRosenberg.com Why Videos? People love watching videos Higher

Consuming videos with the ForkBrowser Consuming videos with the ForkBrowser Ork de Rooij, Cees

Understand Basketball Games 2018.6.15 Sports Videos Large quantity, high

GoBack On Synthesizing Controllers from Bounded-Response Properties Dejan Ni ckovi Oded

Synthesizing an Instruction Selection Rule Library from Semantic Specifjcations Sebastian

Ray Tracing CPSC 453 Fall 2018 Sonny Chan Ray Tracing A method for synthesizing images of

Bitmap (Raster) Images CO2016 Multimedia and Computer Graphics Roy Crole: Bitmap Images (CO2016,

Action recognition in videos Action recognition in videos Cordelia Schmid Cordelia Schmid

Action recognition in videos II Action recognition in videos II Cordelia Schmid INRIA Grenoble

HAAR-like features for images Images digit images are scanned hand written digits Digit

https://images-na.ssl-images-amazon.com/images/I/A1w4iP5ov-L._SY879_.jpg Translate this table to a

Segmentation of Objects in Images and Videos Alexandre X. Falc ao and Thiago V. Spina

Sweet Success : - CRN partners: USQ, ANU &amp; UniSA making machinima for - Federally-funded:

2D Computer Graphics Diego Nehab Summer 2020 IMPA 1 Inflection points and double points The

Simulation Discrete-Event System Simulation Dr. Mesut Gne Computer Science, Informatik

Office of Research Administration Clinical Research Support Services 1629 S. Thames Street, Suite

Photographic Image Synthesis with Cascaded Refinement Networks Qifeng Chen and Vladlen Koltun

Universality of the GromovHausdorff distance Workshop on Topological Data Analysis as part of

Experimental challenges towards the detection of relic neutrinos with unstable nuclei Runions

Make Marketing Indispensable Strategies for turning the sales team into your biggest fans Kelly

Sambuz

Useful Links

Newsletter

Mail Us

Sweet Success : - CRN partners: USQ, ANU & UniSA making machinima for - Federally-funded: