Minimally Naturalistic AI Steven Hansen Outline 1. The Allegory - - PowerPoint PPT Presentation

minimally naturalistic ai
SMART_READER_LITE
LIVE PREVIEW

Minimally Naturalistic AI Steven Hansen Outline 1. The Allegory - - PowerPoint PPT Presentation

Minimally Naturalistic AI Steven Hansen Outline 1. The Allegory of the Play-doh 2. No Free Lunch 3. Meta-Learning 4. Imitation Learning 5. Moving Forward 6. Suggestions for COMM-AI Allegory Time! Imagine you have a ball of play-doh


slide-1
SLIDE 1

Minimally Naturalistic AI

Steven Hansen

slide-2
SLIDE 2

Outline

1. The Allegory of the Play-doh 2. No Free Lunch 3. Meta-Learning 4. Imitation Learning 5. Moving Forward 6. Suggestions for COMM-AI

slide-3
SLIDE 3

Allegory Time!

Imagine you have a ball of play-doh

slide-4
SLIDE 4

Allegory Time!

Make a door-stop

slide-5
SLIDE 5

Allegory Time!

Make a paper-weight

slide-6
SLIDE 6

Allegory Time!

Make a spear

slide-7
SLIDE 7

Allegory Time!

Make an hamburger

slide-8
SLIDE 8

Allegory Time!

Make a computer

slide-9
SLIDE 9

Works fine:

  • Door-stop
  • Paper-weight

Needs to be harder:

  • spear

Needs to be more edible:

  • hamburger

Needs to be more of a superconductor:

  • computer

Allegory Time!

slide-10
SLIDE 10

Moral

  • Inductive biases must sharpen as task complexity rises
  • The closer we get to human-level AI, the more naturalistic the tasks we

must train on

slide-11
SLIDE 11

No Free Lunch

? ? ?

slide-12
SLIDE 12

No Free Lunch

? ? ?

slide-13
SLIDE 13

But deep learning just works...

  • Explicit priors aren’t the only way we shape the inductive bias
  • Convolutions and 2D equivariance
  • RNNs and repeated computation
  • Clockwork-RNNs and periodicity
  • NTMs and... turing machines
slide-14
SLIDE 14

Meta-learning / Learning-to-learn

  • From tasks to task distributions
  • Learn an algorithm that can generalize

from few samples

  • “One-shot learning with

Memory-Augmented Neural Networks”

○ Santoro et al 2016

slide-15
SLIDE 15

An Even Less Free Lunch

slide-16
SLIDE 16

An Even Less Free Lunch

slide-17
SLIDE 17

Imitation Learning

  • Inverse reinforcement learning, apprenticeship learning, goal inference

○ Supervised learning++

  • Learn to copy your mentor by inferring their values/goal

○ Generalize better than copying behavior

  • Who is the mentor? A human or a program written by one.
  • Are we worth copying in artificial environments?
  • Would our goals in such environments have the same structure as in

natural environments?

  • These questions bound the naturalism required
slide-18
SLIDE 18

Moving Forward

1. Identify the next milestone where humans outperform AI 2. Look for the regularities in that environment and in human performance 3. Create artificial environments that still contain those regularities 4. Look again if the AI fails to scale the real thing 5. Remember that the regularities needed might include any previous encountered environments!

slide-19
SLIDE 19

Some Regularities for COMM-AI

  • Communication tasks tend to be encountered in a structured way

○ The participants take into account each other's intelligence ○ Tasks tend to be somewhat periodic

  • Communication is grounded in sensory modalities

○ Visual structure ○ Auditory emotion cues

  • Communication allows for rich feedback

○ Observations of coherent episodes ○ Occasional corrections

slide-20
SLIDE 20

Credit: Drew Purves

slide-21
SLIDE 21

Questions?

Thanks for listening!