Adversarial Approaches to Bayesian Learning and Bayesian Approaches - PowerPoint PPT Presentation

Adversarial Approaches to Bayesian Learning and Bayesian Approaches to Adversarial Robustness Ian Goodfellow, OpenAI Research Scientist NIPS 2016 Workshop on Bayesian Deep Learning Barcelona, 2016-12-10

Speculation on Three Topics • Can we build a generative adversarial model of the posterior over parameters? • Adversarial variants of variational Bayes • Can Bayesian modeling solve adversarial examples? (Goodfellow 2016)

Generative Modeling • Density estimation • Sample generation Training examples Model samples (Goodfellow 2016)

Adversarial Nets Framework D tries to make D(G(z)) near 0, D (x) tries to be G tries to make near 1 D(G(z)) near 1 Di ff erentiable D function D x sampled from x sampled from data model Di ff erentiable function G Input noise z (Goodfellow 2016)

Minimax Game J ( D ) = � 1 2 E x ∼ p data log D ( x ) � 1 2 E z log (1 � D ( G ( z ))) J ( G ) = � J ( D ) -Equilibrium is a saddle point of the discriminator loss -Resembles Jensen-Shannon divergence -Generator minimizes the log-probability of the discriminator being correct (Goodfellow 2016)

Discriminator Strategy Optimal D ( x ) for any p data ( x ) and p model ( x ) is always p data ( x ) D ( x ) = p data ( x ) + p model ( x ) Discriminator Data Model distribution Estimating this ratio using supervised learning is the key approximation x mechanism used by GANs z (Goodfellow 2016)

High quality samples from complicated distributions (Goodfellow 2016)

Speculative idea: generator nets for sampling from the posterior • Practical obstacle: • Parameters lie in a much higher dimensional space than observed inputs • Possible solution: • Maybe the posterior does not need to be extremely complicated • HyperNetworks (Ha et al 2016) seem to be able to model a distribution on parameters (Goodfellow 2016)

Theoretical problems • A naive application of GANs to generating parameters would require samples of the parameters from the true posterior • We only have samples of the data that were generated using the true posterior (Goodfellow 2016)

HMC approach? p ( x ( i ) | θ ) p ( X | θ ) p ( X | θ ∗ ) = Π i p ( x ( i ) | θ ∗ ) • Allows estimation of unnormalized likelihoods via discriminator • Drawbacks: • Discriminator needs to be re-optimized after visiting each new parameter value • For the likelihood estimate to be a function of the parameters, we must include the discriminator learning process in the graph for the estimate, as in unrolled GANs (Metz et al 2016) (Goodfellow 2016)

Variational Bayes � log p ( x ) � log p ( x ) � D KL ( q ( z ) k p ( z | x )) z = E z ∼ q log p ( x , z ) + H ( q ) x • Same graphical model structure as GANs • Often limited by expressivity of q (Goodfellow 2016)

Arbitrary capacity posterior via backwards GAN z z x x u Generation process Posterior sampling process (Goodfellow 2016)

Related variants • Adversarial autoencoder (Makhzani et al 2015) • Variational lower bound for training decoder • Adversarial training of encoder • Restricted encoder • Makes aggregate approximate posterior indistinguishable from prior, rather than approximate posterior indistinguishable from true posterior • Uses variational lower bound for training decoder (Goodfellow 2016)

ALI / BiGAN • Adversarially Learned Inference (Dumoulin et al 2016) • Gaussian encoder • BiGAN (Donahue et al 2016) • Deterministic encoder (Goodfellow 2016)

Adversarial Examples panda gibbon 58% confidence 99% confidence (Goodfellow 2016)

Overly linear, increasingly confident extrapolation Argument to softmax (Goodfellow 2016)

Designing priors on latent factors - Both these two class mixture models implement roughly the same marginal over x , with very different posteriors over the classes. The likelihood criterion cannot strongly prefer one to the other, and in many cases will prefer the bad one. (Goodfellow 2016)

RBFs are better than linear models Attacking a linear model Attacking an RBF model (Goodfellow 2016)

Possible Bayesian solutions • Bayesian neural network • Better confidence estimates might solve the problem • So far, has not worked, but may just need more e ff ort • Variational approach • MC dropout • Regularize neural network to emulate Bayesian model with RBF kernel (amortized inference of Bayesian model) (Goodfellow 2016)

Universal engineering machine (model-based optimization) Make new inventions by finding input that maximizes Training data Extrapolation model’s predicted performance (Goodfellow 2016)

Conclusion • Generative adversarial nets may be able to • Sample from the Bayesian posterior over parameters • Implement an arbitrary capacity q for variational Bayes • Bayesian learning may be able to solve the adversarial example problem and unlock the potential of model- based optimization (Goodfellow 2016)

Adversarial Approaches to Bayesian Learning and Bayesian Approaches - PowerPoint PPT Presentation

Adversarial Approaches to Bayesian Learning and Bayesian Approaches to Adversarial Robustness Ian Goodfellow, OpenAI Research Scientist NIPS 2016 Workshop on Bayesian Deep Learning Barcelona, 2016-12-10 Speculation on Three Topics Can we

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

SECURITY, ADVERSARIAL SECURITY, ADVERSARIAL LEARNING, AND PRIVACY LEARNING, AND PRIVACY

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Adversarial Learning Bounds for Linear Classes and Neural Nets Understanding Adversarial Learning

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Friendly Adversarial Training: Attacks Which Do Not Kill Training Make Adversarial Learning

Applied Machine Learning Applied Machine Learning Naive Bayes Siamak Ravanbakhsh Siamak

T-61.3050 Machine Learning: Basic Principles Bayesian Networks Kai Puolam aki Laboratory of

Bayesian networks Independence Bayesian networks Markov conditions Inference by

CS 730/730W/830: Intro AI Bayesian Networks Approx. Inference Exact Inference 1 handout: slides

Bayesian Deep Learning and Restricted Boltzmann Machines Narada Warakagoda Forsvarets

The LESO-PB building building control system 0.0015 Density estimate 0.0010 0.0005 0.0000 0

Statistics Review of Probability Model Shiu-Sheng Chen Department of Economics National Taiwan

Machine Learning Lecture 3 Justin Pearson 1 2020 1

Adversarial Approaches to Bayesian Learning and Bayesian Approaches - PowerPoint PPT Presentation

Adversarial Approaches to Bayesian Learning and Bayesian Approaches to Adversarial Robustness Ian Goodfellow, OpenAI Research Scientist NIPS 2016 Workshop on Bayesian Deep Learning Barcelona, 2016-12-10 Speculation on Three Topics Can we

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

SECURITY, ADVERSARIAL SECURITY, ADVERSARIAL LEARNING, AND PRIVACY LEARNING, AND PRIVACY

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Synthesizing Robust Adversarial Examples Anish Athalye*, Logan Engstrom*, Andrew Ilyas*, Kevin

Adversarial Learning Bounds for Linear Classes and Neural Nets Understanding Adversarial Learning

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Friendly Adversarial Training: Attacks Which Do Not Kill Training Make Adversarial Learning

Applied Machine Learning Applied Machine Learning Naive Bayes Siamak Ravanbakhsh Siamak

T-61.3050 Machine Learning: Basic Principles Bayesian Networks Kai Puolam aki Laboratory of

Bayesian networks Independence Bayesian networks Markov conditions Inference by

CS 730/730W/830: Intro AI Bayesian Networks Approx. Inference Exact Inference 1 handout: slides

Bayesian Deep Learning and Restricted Boltzmann Machines Narada Warakagoda Forsvarets

The LESO-PB building building control system 0.0015 Density estimate 0.0010 0.0005 0.0000 0

Statistics Review of Probability Model Shiu-Sheng Chen Department of Economics National Taiwan

Machine Learning Lecture 3 Justin Pearson 1 2020 1

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin