General Atari 2600 Game Playing Michael Bowling Work with: Joel - - PowerPoint PPT Presentation

general atari 2600 game playing
SMART_READER_LITE
LIVE PREVIEW

General Atari 2600 Game Playing Michael Bowling Work with: Joel - - PowerPoint PPT Presentation

General Atari 2600 Game Playing Michael Bowling Work with: Joel Veness, Marc Bellemare, Anna Koop, Mostafa Vafadoost http://www.arcadelearningenvironment.org Friday, September 14, 2012 Friday, September 14, 2012


slide-1
SLIDE 1

General Atari 2600 Game Playing

Michael Bowling Work with: Joel Veness, Marc Bellemare, Anna Koop, Mostafa Vafadoost

http://www.arcadelearningenvironment.org

Friday, September 14, 2012

slide-2
SLIDE 2

Friday, September 14, 2012

slide-3
SLIDE 3

http://www.arcadelearningenvironment.org

Friday, September 14, 2012

slide-4
SLIDE 4

Varied Many Independent Interesting

Friday, September 14, 2012

slide-5
SLIDE 5

Friday, September 14, 2012

slide-6
SLIDE 6

Friday, September 14, 2012

slide-7
SLIDE 7

A.I.

0100010101...00001110101

Reinforcement Learning

Friday, September 14, 2012

slide-8
SLIDE 8

A.I.

0100010101...00001110101

Planning

Friday, September 14, 2012

slide-9
SLIDE 9

A.I.

Model

Model Learning

Friday, September 14, 2012

slide-10
SLIDE 10

A.I.

Expert Imitation/Apprenticeship Learning

Friday, September 14, 2012

slide-11
SLIDE 11

A.I.

. . .

Transfer Learning

Pitfall! Pitfall II

Friday, September 14, 2012

slide-12
SLIDE 12

A.I.

Intrinsic Motivation

Friday, September 14, 2012

slide-13
SLIDE 13

Friday, September 14, 2012

slide-14
SLIDE 14

Training Games Testing Games

Friday, September 14, 2012

slide-15
SLIDE 15

Friday, September 14, 2012

slide-16
SLIDE 16

Contingency Awareness: knowing what you control

(Bellemare et al., AAAI 2012)

Friday, September 14, 2012

slide-17
SLIDE 17

(Bellemare et al., AAAI 2012)

Contingency Awareness: knowing what you control Unaware Contingency Aware

Friday, September 14, 2012

slide-18
SLIDE 18

(Bellemare et al., AAAI 2012)

Contingency Awareness: knowing what you control

1.0 0.5 0.0 0.2 0.4 0.6 0.8 1.0

Inter-Algorithm Score Fraction of Games

Inter-Algorithm Score Distribution

MaxCol Extended MaxCol Basic Extended

Friday, September 14, 2012

slide-19
SLIDE 19

Sketch-Based Hashing: tug-of-war vs. standard hashing

(Bellemare et al., NIPS 2012)

1.0 0.5 0.0 0.2 0.4 0.6 0.8 1.0

Fraction of games Inter-algorithm score Tug-of-War Standard

Hash Table Size: 1000

1.0 0.5 0.0 0.2 0.4 0.6 0.8 1.0

Fraction of games Inter-algorithm score Tug-of-War Standard

Hash Table Size: 5000

1.0 0.5 0.0 0.2 0.4 0.6 0.8 1.0

Fraction of games Inter-algorithm score Tug-of-War Standard

Hash Table Size: 20,000

55 Testing Games

Friday, September 14, 2012

slide-20
SLIDE 20

Model Learning: pixels, probabilities, and priors

(Bellemare et al., In Prep)

Friday, September 14, 2012

slide-21
SLIDE 21

Model Learning: pixels, probabilities, and priors

(Bellemare et al., In Prep)

Friday, September 14, 2012

slide-22
SLIDE 22

http://www.arcadelearningenvironment.org

Questions?

Source code for ALE and all agents available!

Friday, September 14, 2012

slide-23
SLIDE 23

Friday, September 14, 2012

slide-24
SLIDE 24

Will there be a competition? No vs.

Friday, September 14, 2012