Computational Systems Biology Deep Learning in the Life Sciences - - PowerPoint PPT Presentation

computational systems biology deep learning in the life
SMART_READER_LITE
LIVE PREVIEW

Computational Systems Biology Deep Learning in the Life Sciences - - PowerPoint PPT Presentation

Computational Systems Biology Deep Learning in the Life Sciences 6.802 6.874 20.390 20.490 HST.506 David Gifford Lecture 9 March 5, 2020 Generative Models http://mit6874.github.io 1 Why generative models? We can sample new examples from a


slide-1
SLIDE 1

Computational Systems Biology Deep Learning in the Life Sciences

6.802 6.874 20.390 20.490 HST.506

David Gifford Lecture 9 March 5, 2020

Generative Models

http://mit6874.github.io

1

slide-2
SLIDE 2

Why generative models?

slide-3
SLIDE 3

We can sample new examples from a generative models

  • Generate new examples from model fit to

training data

  • Sampling from input distribution
  • Optionally optimized with respect to a metric
  • Reveals what models understand
  • What is the best example of a written digit?
  • What is the best example of a celebrity?
  • Transform examples with respect to one or more

metrics

  • Improve sentiment of text
  • Perform multi-objective optimization of antibodies
slide-4
SLIDE 4

https://www.cs.toronto.edu/~rgrosse/courses/csc321_2018/slides/lec19.pdf

slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8

Three example generative models

  • Variational autoencoders
  • Generative Adversarial Networks
  • CycleGANs
  • For each you should understand the loss function
slide-9
SLIDE 9

Variational Autoencoders can provide improved examples

slide-10
SLIDE 10
slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14

Why is this important? Why does it make the task difficult?

slide-15
SLIDE 15

Why is this important? Find plausible revisions Why does it make the task difficult? p(z | x) intractable

slide-16
SLIDE 16
slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19
slide-20
SLIDE 20
slide-21
SLIDE 21

Overall VAE loss function

slide-22
SLIDE 22
slide-23
SLIDE 23
slide-24
SLIDE 24
slide-25
SLIDE 25
slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28
slide-29
SLIDE 29
slide-30
SLIDE 30

Generative Adversarial Networks

slide-31
SLIDE 31

We wish to learn a generative model that matches the true data distribution

slide-32
SLIDE 32

https://www.cs.toronto.edu/~rgrosse/courses/csc321_2018/slides/lec19.pdf

The Generative Adversarial Network (GAN) Game

slide-33
SLIDE 33

D(x) is probability x is from the real-world

slide-34
SLIDE 34
slide-35
SLIDE 35

Update discriminator to maximize D(real) and minimize D(G(z))

slide-36
SLIDE 36

Update generator to maximize D(G(z))

slide-37
SLIDE 37

Summary of GAN objective functions

slide-38
SLIDE 38

GANs have become a bit of a fad

slide-39
SLIDE 39
slide-40
SLIDE 40
slide-41
SLIDE 41
slide-42
SLIDE 42

GANs can fail

slide-43
SLIDE 43

GAN Problems

  • Non-convergence – model parameters oscillate and never

converge

  • Mode collapse – limited variety of samples from generator
  • Diminished gradient – Discriminator is too successful and

generator learns nothing

  • Overfitting – imbalance between generator and discriminator
  • Hyperparameter sensitivity – highly sensitive

https://medium.com/@jonathan_hui/gan-why-it-is-so-hard-to-train-generative-advisory-networks-819a86b3750b

slide-44
SLIDE 44

Mode collapse shown in second row (all 6) and in images

https://arxiv.org/pdf/1611.02163.pdf https://arxiv.org/pdf/1703.10717.pdf

slide-45
SLIDE 45

CycleGANs for style mapping

slide-46
SLIDE 46

CycleGANs map between styles

slide-47
SLIDE 47

CycleGANs permit style transfer without matched training data

slide-48
SLIDE 48

CycleGANs permit style transfer without matched training data

https://arxiv.org/pdf/1703.10593.pdf

slide-49
SLIDE 49

CycleGANs permit style transfer without matched training data

slide-50
SLIDE 50
slide-51
SLIDE 51

CycleGANs permit style transfer without matched training data

slide-52
SLIDE 52

CycleGANs permit style transfer without matched training data