SLIDE 1
Motivation: Human-AI Collaboration
2
Commits to policy !" (Best) responds to !"
Behavioral differences Agents have different models of the world
Task
[Dimitrakakis et al., NIPS 2017]
Helper-AI Human Agent A1 Agent A2
Learning to Collaborate in Markov Decision Processes Goran Radanovic - - PowerPoint PPT Presentation
Learning to Collaborate in Markov Decision Processes Goran Radanovic , Rati Devidze, David C. Parkes, Adish Singla Motivation: Human-AI Collaboration Example setting Helper-AI Human Agent A1 Agent A2 Task (Best) responds Commits to to !
2
Commits to policy !" (Best) responds to !"
Task
[Dimitrakakis et al., NIPS 2017]
Helper-AI Human Agent A1 Agent A2
3
Commits to policy !" Agent A2 !# changes
Task
Helper-AI Human Agent A1
4
Agent A1
5
[Even-Dar et al., NIPS 2005]
)./
6
, , . / ), provided that the magnitude change
7