Image Domains Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils - PowerPoint PPT Presentation

CreativeAI: Deep Learning for Graphics Image Domains Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey Tobias Ritschel UCL UCL UCL TUM UCL

Timetable Niloy Paul Nils Introduction 2:15 pm X X X ∼ 2:25 pm Machine Learning Basics X and Basics Theory ∼ 2:55 pm Neural Network Basics X Feature Visualization ∼ 3:25 pm X Alternatives to Direct Supervision ∼ 3:35 pm X 15 min. break of the Art Image Domains 4:15 pm X State 3D Domains ∼ 4:45 pm X ∼ 5:15 pm Motion and Physics X ∼ 5:45 pm Discussion X X X SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 2

Overview Examples of deep learning techniques that are commonly used in the image domain: • Common Architecture Elements (Dilated Convolution, Grouped Convolutions) • Deep Features (Autoencoders, Transfer Learning, One-shot Learning, Style Transfer) • Adversarial Image Generation (GANs, CGANs) • Interesting Trends (Attention, “Gray Box” Learning) SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 3

Common Architecture Elements SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 4

Classification, Segmentation, Detection ImageNet classification performance (for up-to-date top-performers see leaderboards of datasets like ImageNet or COCO) per million parameters top-1 accuracy top-1 accuracy # operations Images from: Canziani et al., An Analysis of Deep Neural Network Models for Practical Applications , arXiv 2017 Blog: https://towardsdatascience.com/neural-network-architectures-156e5bad51ba SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 5

Architecture Elements Some notable architecture elements shared by many successful architectures: Residual Blocks Dilated Attention and Dense Blocks Convolutions (Spatial and over Channels) Skip Connections Grouped (UNet) Convolutions SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 6

Dilated (Atrous) Convolutions Problem: increasing the receptive field costs a lots of parameters. Idea: spread out the samples used in each convolution. 1 st layer: not dilated 2 nd layer: 1-dilated 3 rd layer: 2-dilated dilated convolution 3x3 recep. field 7x7 recep. field 15x15 recep. field Images from: Dumoulin and Visin, A guide to convolution arithmetic for deep learning , arXiv 2016 Yu and Koltun, Multi-scale Context Aggregation by Dilated Convolutions , ICLR 2016 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 7

Dilated (Atrous) Convolutions Problem: increasing the receptive field costs a lots of parameters. Idea: spread out the samples used for a convolution. 3 rd layer: 2-dilated dilated convolution 15x15 recep. field 2 nd layer: 1-dilated 7x7 recep. field … 1 st layer: not dilated 3x3 recep. field Input image Dumoulin and Visin, A guide to convolution arithmetic for deep learning , arXiv 2016 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 8

Grouped Convolutions (Inception Modules) Problem: conv. parameters grow quadratically in the number of channels Idea: split channels into groups, remove connections between different groups n channels group1 group2 group3 n/3 ch. n/3 ch. n/3 ch. n/3 ch. n/3 ch. n/3 ch. n/3 ch. n/3 ch. n/3 ch. n channels Image from: Xie et al., Aggregated Residual Transformations for Deep Neural Networks , CVPR 2017 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 9

Example: Sketch Simplification Learning to Simplify: Fully Convolutional Networks for Rough Sketch Cleanup , Simo-Serra et al. 11

Example: Sketch Simplification • Loss for thin edges saturates easily • Authors take extra steps to align input and ground truth edges Pencil: input Red: ground truth Learning to Simplify: Fully Convolutional Networks for Rough Sketch Cleanup , Simo-Serra et al. 12

Image Decomposition • A selection of methods: • Direct Instrinsics , Narihira et al., 2015 • Learning Data-driven Reflectance Priors for Intrinsic Image Decomposition, Zhou et al., 2015 • Decomposing Single Images for Layered Photo Retouching , Innamorati et al. 2017 13

Image Decomposition: Decomposing Single Images for Layered Photo Retouching 14

Example Application: Denoising SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 15

Deep Features SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 16

Autoencoders • Features learned by deep networks are useful for a large range of tasks. • An autoencoder is a simple way to obtain these features. • Does not require additional supervision. Reconstruction Decoder L2 Loss function: useful features (latent vectors) Encoder Input data Manash Kumar Mandal, Implementing PCA, Feedforward and Convolutional Autoencoders and using it for Image Reconstruction, Retrieval & Compression , https://blog.manash.me/ SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 17

Shared Feature Space: Interactive Garments representation 1 representation 2 useful features (latent vectors) representation 3 Wang et al., Learning a Shared Shape Space for Multimodal Garment Design , Siggraph Asia 2018 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 18

Transfer Learning Features extracted by well-trained CNNs often generalize beyond the task they were trained on useful features (latent vectors) original task encoder decoder input image (normals) 3D edges new task (edges) Images from: Zamir et al., Taskonomy: Disentangling Task Transfer Learning , CVPR 2018 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 19

Taxonomy of Tasks: Taskonomy http://taskonomy.stanford.edu/api/ Images from: Zamir et al., Taskonomy: Disentangling Task Transfer Learning , CVPR 2018 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 20

Taxonomy of Tasks: Taskonomy Images from: Zamir et al., Taskonomy: Disentangling Task Transfer Learning , CVPR 2018 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 21

Few-shot, One-shot Learning Feature training: One-shot: • With a good feature space, tasks lots of examples from train regressor with class subset A become easier one example of each class in class subset B • In classification, for example, nearest neighbors might already be good regressor (e.g. NN) enough • Often trained with a Siamese network, to optimize the metric in feature space feature computation https://hackernoon.com/one-shot-learning-with-siamese-networks-in-pytorch-8ddaab10340e SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 22

Style Transfer • Combine content from image A with style from image B Images from: Gatys et al., Image Style Transfer using Convolutional Neural Networks , CVPR 2016 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 23

What is Style and Content? Remember that features in a CNN often generalize well. Define style and content using the layers of a CNN (VGG19 for example): shallow layers deeper layers describe style describe content SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 24

Optimize for Style A and Content B same pre-trained networks, fix weights same content features same style features A B optimize to have same style/content features SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 25

Style Transfer: Follow-Ups feed-forward networks more control over the result Images from: Gatys, et al., Controlling Perceptual Factors in Neural Style Transfer , CVPR 2017 Johnson et al., Perceptual Losses for Real-Time Style Transfer and Super-Resolution , ECCV 2016 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 26

Style Transfer for Videos Ruder et al., Artistic Style Transfer for Videos , German Conference on Pattern Recognition 2016 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 27

Adversarial Image Generation SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 28

Generative Adversarial Networks Player 1: generator Player 2: discriminator real/fake Scores if discriminator Scores if it can distinguish can’t distinguish output between real and fake from real image from dataset

GANs to CGANs (Conditional GANs) GAN CGAN increasingly determined by the condition Karras et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation , ICLR 2018 Kelly and Guerrero et al., FrankenGAN: Guided Detail Synthesis for Building Mass Models using Style-Synchonized GANs , Siggraph Asia 2018 GAN Isola et al., Image-to-Image Translation with Conditional Adversarial Nets , CVPR 2017 Image Credit: Zhu et al. , Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks , ICCV 2017 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 30

Image-to-image Translation • ≈ learn a mapping between images from example pairs • Approximate sampling from a conditional distribution SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 31 Image Credit: Image-to-Image Translation with Conditional Adversarial Nets , Isola et al.

Adversarial Loss vs. Manual Loss Problem: A good loss function is often hard to find Idea: Train a network to discriminate between network output and ground truth ? Images from: Simo-Serra, Iizuka and Ishikawa, Mastering Sketching , Siggraph 2018 SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 32

Image Domains Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils - PowerPoint PPT Presentation

CreativeAI: Deep Learning for Graphics Image Domains Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey Tobias Ritschel UCL UCL UCL TUM UCL Timetable Niloy Paul Nils Introduction 2:15 pm X X X 2:25 pm Machine Learning

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Bi-Continuous Domains and Some Old Problems in Domain Theory Talk at Domains IX Klaus Keimel

Image Processing Todays Class Image Representations: Matrices Image Representations: RGB,

Topic 7: Topic 7: Image Morphing Image Morphing 1. 1. Intro to basic image morphing Intro to

Image Features Sanja Fidler CSC420: Intro to Image Understanding 1 / 64 Image Features Image

RGBD Tutorial 14210240041 Gu Pan Image RGB YUV Lab Depth Image RGB image Depth image Each pixel in

Image Features Sanja Fidler CSC420: Intro to Image Understanding 1 / 1 Image Features Image

Is Digital Technology Image 1 Restructuring the brain? Globally connected Image 2 Image 3 How

FIRST ACADEMIC BUILDING February 12, 2014 MORPHOSIS ARCHITECTS IMAGE OF ART IMAGE OF ART IMAGE

Image filtering and image features September 26, 2019 Outline: Image filtering and image

Oncentra Prostate Image Fusion Josh Mason Oncentra Prostate Image Fusion Multiple image

Image as a single label king crab Image Source: ImageNet Image as an object set Man

Image restoration IMAGE P ROCES S IN G IN P YTH ON Rebeca Gonzalez Data Engineer Restore an

Image Transforma1ons image filtering : change range of image Image Processing : g(x) =

Digital Image Fundamentals and Image Acquisition 1/18/2011 1 Image Acquisition 1/18/2011 2 1

Image formation How are objects in the world captured in Image formation an image? Matlab

Intro Tutorial on GANs Michela Paganini Fermilab Machine Learning Group Meeting March 21, 2018

Generative models and adversarial training Kevin McGuinness kevin.mcguinness@dcu.ie Research

Social Authentication: Vulnerabilities, Mitigations, and Redesign Marco Lancini DEEPSEC 2014

Wjets bkg. Estimation in WW analysis Jun Gao , Yanwen Liu, Emmanuel Monnier U niversity of S

challenges and lessons learned Pierre-Louis Bossart UMG Platform Architecture LPC2009 Ultra

What syntax doesnt feed semantics Fake indexicals as indexicals Emar Maier ILLC/University of

saskia sassen Columbia University www.saskiasassen.com For elaboration on some of this material

Semantic Role Labeling Deep Processing Techniques for NLP Ling571 February 27, 2017 Semantic

Sambuz

Useful Links

Newsletter

Mail Us