A Cor Cordial dial Sync nc: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks
Unnat Jain1*, Luca Weihs2*, Eric Kolve2, Ali Farhadi3, Svetlana Lazebnik1, Aniruddha Kembhavi2,3, Alexander Schwing1
* Equal contribution by UJ and LW
1 2 3
nc : Going Beyond Marginal Policies for Multi-Agent Embodied Tasks - - PowerPoint PPT Presentation
A Cor Cordial dial Sync nc : Going Beyond Marginal Policies for Multi-Agent Embodied Tasks ECCV 2020 (Spotlight) Unnat Jain 1* , Luca Weihs 2* , Eric Kolve 2 , Ali Farhadi 3 , Svetlana Lazebnik 1 , Aniruddha Kembhavi 2,3 , Alexander Schwing 1
* Equal contribution by UJ and LW
1 2 3
Jain* and Weihs* et al. “Two Body Problem: Collaborative Visual Task Completion” in CVPR 2019
Effective Joint Policy Π = #!⊗ #" =
Π∗ = Rank 1
L1 error 0.29 0.43 0.06 0.03 0.05 0.14 0.32 0.68 0.72 0.06 0.08 0.14 0.26 0.23 0.49 0.02 0.04 0.03 0.05 0.04 0.1 Rank 2 #! #" Agent 1 → Agent 2 →
Age gent nt 1 1 Pol
Age gent nt 2 2 Pol
#!
"
#"
"
&'( )
( ⊗ %& )) =
( ⊗ %( ))
1 0.4 0.6 0.2
0.8
Mi Mixtur ure weight ghts
0.3 0.7 0.9 0.1 #!
!
#"
!
( ⊗ %) ))
0.29 0.43 0.06 0.03 0.05 0.14
0.29 0.43 0.06 0.03 0.05 0.14
Agents must
Single-Agent Navigation MoveAhead RotateLeft RotateRight Pass MoveWithObject MWO MWOAhead MWORight MWOLeft MWOBack MoveObject MO MOAhead MORight MOLeft MOBack
RotateObject Right
(Details in the paper)
TV
Goal
Age gent nt 1’s 1’s view view Age gent nt 2’s 2’s view view Top
down n vie iew (Not available to agents)
Age gent nt 1’s 1’s view view Age gent nt 2’s 2’s view view Top
down n vie iew (Not available to agents)
Top
down n vie iew (Not available to agents) Age gent nt 1’s 1’s view view Age gent nt 2’s 2’s view view
Cordial SYNC agents trains as well as the Central agents Marginal agents train poorly and worsens without comm. Generalize well (with scope for improvement)
Marginal Agents
Effective Joint Policy Π = #!⊗ #" = Rank 1 L1 error 0.26 0.23 0.49 0.02 0.04 0.03 0.05 0.04 0.1
Mi Mixture-of
Margi ginals ! "! ⋅ (%!
" ⊗ %! #) # !$"
= "" ⋅ (%"
" ⊗ %" #)
+ "# ⋅ (%#
" ⊗ %# #)
=
0.29 0.43 0.06 0.03 0.05 0.14
Agent1 or Agent2 attempted a MoveWithObject action Agent1 or Agent2 took a pass action Reply weights
Steps in episode →
Cordial SYNC Marginal (prior)