Unsuperv rvised Learning Niloy Mit Ni Mitra Ias asonas Kok - PowerPoint PPT Presentation

Deep Learning for Graphics Unsuperv rvised Learning Niloy Mit Ni Mitra Ias asonas Kok okkin inos os Pau aul l Gu Guer errero Vl Vladim imir ir Ki Kim Kos ostas Rematas Tobi obias s Ritsc schel UCL UCL/Facebook UCL Adobe Research U Washington UCL

Timetable Niloy Iasonas Paul Vova Kostas Tobias Introduction X X X X Theory X NN Basics X X Supervised Applications X X Data X Unsupervised Applications X Beyond 2D X X Outlook X X X X X X EG Course “Deep Learning for Graphics” 2

Unsupervised Learning • There is no direct ground truth for the quantity of interest • Autoencoders • Variational Autoencoders (VAEs) • Generative Adversarial Networks (GANs) EG Course “Deep Learning for Graphics”

Autoencoders Goal: Meaningful features that capture the main factors of variation in the dataset • These are good for classification, clustering, exploration, generation, … • We have no ground truth for them Features Encoder Input data EG Course “Deep Learning for Graphics” Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Autoencoders Goal: Meaningful features that capture the main factors of variation Features that can be used to reconstruct the image Decoder L2 Loss function: Features (Latent variables) Encoder Input data EG Course “Deep Learning for Graphics” Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Autoencoders Linear Transformation for Encoder and Decoder give result close to PCA Deeper networks give better reconstructions, since basis can be non-linear Original Autoencoder PCA EG Course “Deep Learning for Graphics” Image Credit: Reducing the Dimensionality of Data with Neural Networks, . Hinton and Salakhutdinov

Example: Document Word Prob. → 2D Code LSA (based on PCA) Autoencoder EG Course “Deep Learning for Graphics” Image Credit: Reducing the Dimensionality of Data with Neural Networks, Hinton and Salakhutdinov

Example: Semi-Supervised Classification • Many images, but few ground truth labels supervised fine-tuning start unsupervised train classification network on labeled images train autoencoder on many images Loss function (Softmax, etc) Predicted Label GT Label Decoder Classifier L2 Loss function: Features Features (Latent Variables) Encoder Encoder Input data EG Course “Deep Learning for Graphics” Slide Credit: Fei-Fei Li, Justin Johnson, Serena Yeung, CS 231n

Code example Autoencoder (autoencoder.ipynb) 9

Generative Models • Assumption: the dataset are samples from an unknown distribution • Goal: create a new sample from that is not in the dataset … ? Dataset Generated Image credit: Progressive Growing of GANs for Improved EG Course “Deep Learning for Graphics” Quality, Stability, and Variation, Karras et al.

Generative Models • Assumption: the dataset are samples from an unknown distribution • Goal: create a new sample from that is not in the dataset … Dataset Generated Image credit: Progressive Growing of GANs for Improved EG Course “Deep Learning for Graphics” Quality, Stability, and Variation, Karras et al.

Generative Models Generator with parameters known and easy to sample from EG Course “Deep Learning for Graphics”

Generative Models How to measure similarity of and ? 1) Likelihood of data in Generator with Variational Autoencoders (VAEs) parameters 2) Adversarial game: Discriminator distinguishes Generator makes it vs known and and hard to distinguish easy to sample from Generative Adversarial Networks (GANs) EG Course “Deep Learning for Graphics”

Autoencoders as Generative Models? • A trained decoder transforms some features to approximate samples from Decoder = Generator? • What happens if we pick a random ? • We do not know the distribution of random features that decode to likely samples Feature space / latent space EG Course “Deep Learning for Graphics” Image Credit: Reducing the Dimensionality of Data with Neural Networks , Hinton and Salakhutdinov

Variational Autoencoders (VAEs) • Pick a parametric distribution for features • The generator maps to an image distribution (where are parameters) Generator with parameters sample • Train the generator to maximize the likelihood of the data in : EG Course “Deep Learning for Graphics”

Outputting a Distribution Bernoulli distribution Normal distribution Generator with Generator with parameters parameters sample sample EG Course “Deep Learning for Graphics”

Variational Autoencoders (VAEs): Naïve Sampling (Monte-Carlo) • SGD approximates the expected values over samples • In each training iteration, sample from … • … and randomly from the dataset, and maximize: EG Course “Deep Learning for Graphics”

Variational Autoencoders (VAEs): Naïve Sampling (Monte-Carlo) Loss function: Generator with Random from dataset parameters • In each training iteration, sample from … sample • … and randomly from the dataset • SGD approximates the expected values over samples EG Course “Deep Learning for Graphics”

Variational Autoencoders (VAEs): Naïve Sampling (Monte-Carlo) Loss function: Generator with Random from dataset parameters • In each training iteration, sample from … sample • … and randomly from the dataset • SGD approximates the expected values over samples • Few pairs have non-zero gradients with non-zero loss gradient for EG Course “Deep Learning for Graphics”

Variational Autoencoders (VAEs): The Encoder Loss function: • During training, another network can guess a Generator with parameters good for a given sample • should be much smaller than • This also gives us the data point Encoder with parameters EG Course “Deep Learning for Graphics”

Variational Autoencoders (VAEs): The Encoder Loss function: • Can we still easily sample a new ? Generator with parameters • Need to make sure approximates sample • Regularize with KL-divergence • Negative loss can be shown to be a lower bound Encoder with for the likelihood, and equivalent if parameters EG Course “Deep Learning for Graphics”

Reparameterization Trick Example when : , where Generator with parameters sample Backprop Backprop? sample Encoder with Encoder with parameters parameters Does not depend on parameters EG Course “Deep Learning for Graphics”

Generating Data MNIST Frey Faces sample Generator with parameters sample EG Course “Deep Learning for Graphics” Image Credit: Auto-Encoding Variational Bayes , Kingma and Welling

Demos VAE on MNIST http://dpkingma.com/sgvb_mnist_demo/demo.html VAE on Faces http://vdumoulin.github.io/morphing_faces/online_demo.html 24

Code example Variational Autoencoder (variational_autoencoder.ipynb) 25

Generative Adversarial Networks Player 1: generator Player 2: discriminator real/fake Scores if discriminator Scores if it can distinguish can’t distinguish output between real and fake from real image from dataset EG Course “Deep Learning for Graphics”

Naïve Sampling Revisited Loss function: Generator with Random from dataset parameters sample • Few pairs have non-zero gradients • This is a problem of the maximum likelihood • Use a different loss: Train a discriminator network to measure similarity with non-zero loss gradient for EG Course “Deep Learning for Graphics”

Why Adversarial? • If discriminator approximates : • at maximum of has lowest loss : discriminator • Optimal has single mode at , small variance with parameters : generator with parameters sample EG Course “Deep Learning for Graphics” Image Credit: How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? , Ferenc Huszár

Why Adversarial? • For GANs, the discriminator instead approximates: : discriminator depends on the generator with parameters : generator with parameters sample EG Course “Deep Learning for Graphics” Image Credit: How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? , Ferenc Huszár

Why Adversarial? VAEs: GANs: Maximize likelihood of Maximize likelihood of generator samples in Adversarial game data samples in approximate EG Course “Deep Learning for Graphics” Image Credit: How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? , Ferenc Huszár

GAN Objective probability that is not fake fake/real classification loss (BCE): :discriminator Discriminator objective: :generator Generator objective: sample EG Course “Deep Learning for Graphics”

Unsuperv rvised Learning Niloy Mit Ni Mitra Ias asonas Kok - PowerPoint PPT Presentation

Deep Learning for Graphics Unsuperv rvised Learning Niloy Mit Ni Mitra Ias asonas Kok okkin inos os Pau aul l Gu Guer errero Vl Vladim imir ir Ki Kim Kos ostas Rematas Tobi obias s Ritsc schel UCL UCL/Facebook UCL Adobe

CS 730/830: Intro AI Unsuperv. Learning asst 11 posted Wheeler Ruml (UNH) Lecture 23, CS 730

Antarctic sea ic ice ext xtent from IS ISRO's SCATSAT-1 1 usin ing PCA and an unsuperv

Wit ith Im Image Clu lustering Jianwei Yang Devi Parikh Dhruv Batra Vir irgin inia ia

Superv rvised Applications Niloy Mi Ni Mitra Ias asonas Kok okkin inos os Pau aul l Gu

Commonsense benchmarks Or how to measure that your model is actually doing some commonsense

Action Segmentation with Jo Join int Self-Superv rvised Temporal Domain Adaptation Min-Hung

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning

Year 7 Learning Evening 2017 W elcome! Year 7 Learning Evening 2017 Year 7 Learning Evening

Learning is a never-ending process Tasks come and go, but learning is forever Learn more e ff

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

A Gentle Introduction to Machine Learning Supervised learning, unsupervised learning (very

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Learning From Data Lecture 2 The Perceptron The Learning Setup A Simple Learning Algorithm: PLA

Welcome to Welcome to The Learning Tree Workshop Series on Learning Differences, Learning

A Fast and Accurate One-Stage Approach to Visual Grounding Zhengyuan Yang Boqing Gong Liwei

Latency Insensitiveness in Adaptive Communication Channels: A Physical Design Perspective

Power-up Networking for Containers Jason Messer, Microsoft Focus and Key Takeaways Microsoft

6. Graphics MULTIMEDIA & GRAPHICS Graphics covers wide range of pictorial representations.

Chapter 4: Decoders & Encoders Computer Structure & Intro. to Digital Computers Dr. Guy

Todays Topics ! Strings ! Boolean algebra ! Representation of integers: unsigned and signed !

University of Washington University of Washington Encoding Integers The hardware (and C)

Systems Addition and Subtraction in 1s and 2s Complement Form Shankar Balachandran*

Unsuperv rvised Learning Niloy Mit Ni Mitra Ias asonas Kok - PowerPoint PPT Presentation

Deep Learning for Graphics Unsuperv rvised Learning Niloy Mit Ni Mitra Ias asonas Kok okkin inos os Pau aul l Gu Guer errero Vl Vladim imir ir Ki Kim Kos ostas Rematas Tobi obias s Ritsc schel UCL UCL/Facebook UCL Adobe

CS 730/830: Intro AI Unsuperv. Learning asst 11 posted Wheeler Ruml (UNH) Lecture 23, CS 730

Antarctic sea ic ice ext xtent from IS ISRO's SCATSAT-1 1 usin ing PCA and an unsuperv

Wit ith Im Image Clu lustering Jianwei Yang Devi Parikh Dhruv Batra Vir irgin inia ia

Superv rvised Applications Niloy Mi Ni Mitra Ias asonas Kok okkin inos os Pau aul l Gu

Commonsense benchmarks Or how to measure that your model is actually doing some commonsense

Action Segmentation with Jo Join int Self-Superv rvised Temporal Domain Adaptation Min-Hung

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning

Year 7 Learning Evening 2017 W elcome! Year 7 Learning Evening 2017 Year 7 Learning Evening

Learning is a never-ending process Tasks come and go, but learning is forever Learn more e ff

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

A Gentle Introduction to Machine Learning Supervised learning, unsupervised learning (very

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Learning From Data Lecture 2 The Perceptron The Learning Setup A Simple Learning Algorithm: PLA

Welcome to Welcome to The Learning Tree Workshop Series on Learning Differences, Learning

A Fast and Accurate One-Stage Approach to Visual Grounding Zhengyuan Yang Boqing Gong Liwei

Latency Insensitiveness in Adaptive Communication Channels: A Physical Design Perspective

Power-up Networking for Containers Jason Messer, Microsoft Focus and Key Takeaways Microsoft

6. Graphics MULTIMEDIA &amp; GRAPHICS Graphics covers wide range of pictorial representations.

Chapter 4: Decoders &amp; Encoders Computer Structure &amp; Intro. to Digital Computers Dr. Guy

Todays Topics ! Strings ! Boolean algebra ! Representation of integers: unsigned and signed !

University of Washington University of Washington Encoding Integers The hardware (and C)

Systems Addition and Subtraction in 1s and 2s Complement Form Shankar Balachandran*

6. Graphics MULTIMEDIA & GRAPHICS Graphics covers wide range of pictorial representations.

Chapter 4: Decoders & Encoders Computer Structure & Intro. to Digital Computers Dr. Guy