Learning J.J. (Jia-Jie) Zhu Boston College GAAL 2 Active - - PowerPoint PPT Presentation

learning
SMART_READER_LITE
LIVE PREVIEW

Learning J.J. (Jia-Jie) Zhu Boston College GAAL 2 Active - - PowerPoint PPT Presentation

Generative Adversarial Active Learning J.J. (Jia-Jie) Zhu Boston College GAAL 2 Active learning classify 400 instances 30 randomly selected 30 selected using AL using logistic regression 70% accuracy 90% accuracy pool-based active


slide-1
SLIDE 1

Generative Adversarial Active Learning

J.J. (Jia-Jie) Zhu Boston College

slide-2
SLIDE 2

GAAL

2

slide-3
SLIDE 3

Active learning

classify 400 instances using logistic regression 30 randomly selected 70% accuracy 30 selected using AL 90% accuracy pool-based active learning cycle

3

source: Settles ’10

slide-4
SLIDE 4

GAAL

4

slide-5
SLIDE 5

GAN

5

slide-6
SLIDE 6

Intuition of GAN

The goal is to train a generator that generates “fake” data that looks as if it is “real”. (Think counterfeit bills)

  • We let player 1 (discriminator D) and player 2 (generator G) play an adversarial game.
  • G tries to generate “fake” data to fool D while D tries to tell “real” from “fake”.
  • Both players keep getting better by playing the game. In the end, we obtain a “good”

generator. This amounts to solving the optimization problem

6

slide-7
SLIDE 7

How GAN works

D G

Real Fake Data Training

Main idea: match the distributions

image: Radford et al.

7 Real! Fake!

slide-8
SLIDE 8

GAAL

8

slide-9
SLIDE 9

Intuition of GAAL

Traditional AL (pool-based) Can we synthesize an informative data sample on demand? We need to generate samples that follow the same distribution as the given data

9

slide-10
SLIDE 10

Algorithm sketch

Pool-based AL GAAL

10

slide-11
SLIDE 11

Experiments

Generated images

11

What’s not working? * Cats vs dogs * Some images are garbage

slide-12
SLIDE 12

Summary of GAAL

  • Generalize GAAL to other domains
  • GAN is relatively unreliable as a query generator
  • We do not understand the bounds for label complexity yet
  • The first work to report satisfactory results in active learning synthesis for image

classification

  • The first GAN application to active learning
  • The framework can be thought of as generate data that is adaptive to the current learner
  • An interesting idea. Apply similar ideas to RL/control?

12