Interactive Reinforcement Learning Human Generated Reward - - PowerPoint PPT Presentation

interactive reinforcement learning human generated reward
SMART_READER_LITE
LIVE PREVIEW

Interactive Reinforcement Learning Human Generated Reward - - PowerPoint PPT Presentation

Interactive Reinforcement Learning Human Generated Reward Presentation for Summer Camp 2015 May 25 2015 Reinforcement Learning Trial and error learning Explore and exploit Sutton and Barto 1988 Represent, predict and control


slide-1
SLIDE 1

Interactive 
 Reinforcement Learning Human Generated Reward

Presentation for Summer Camp 2015 May 25 2015

slide-2
SLIDE 2
slide-3
SLIDE 3

Reinforcement Learning

  • Trial and error learning
  • Explore and exploit
  • Represent, predict and control
  • Connect actions with rewards
  • Maximize future reward

Sutton and Barto 1988

slide-4
SLIDE 4

Interactive Machine Learning

Fails and Olsen Jr. 2003

slide-5
SLIDE 5

Human Generated Reward

  • Humans know more!
  • Shaping systems to adapt
  • Effectively reward learning
  • Transfer learning through collaboration
  • How can RL harness human reward?

Knox and Stone 2012

slide-6
SLIDE 6

Kuhlmann et al. 2004

Learning from Advice Learning from Shaping

Blumberg et al. 2002 Thomaz et al. 2006

Learning from Demonstration

Left: Argall et al. 2010
 Right: Koenemann et al. 2014

slide-7
SLIDE 7

Learning from Trial and Error

Levine et al. 2015

Learning from Refinement

Cakmak et al. 2012

slide-8
SLIDE 8

Application

  • Shared control
  • Augmented representation
  • Integrate human and 


non-human interaction

  • Autonomous prosthetics