Information Sharing for Distributed Planning
Prasanna Velagapudi
AAMAS 2010 - Doctoral Symposium 1
Information Sharing for Distributed Planning Prasanna Velagapudi - - PowerPoint PPT Presentation
Information Sharing for Distributed Planning Prasanna Velagapudi AAMAS 2010 - Doctoral Symposium 1 Large Heterogeneous Teams 100s to 1000s of robots, agents, people Complex, collaborative tasks Dynamic, uncertain environment
AAMAS 2010 - Doctoral Symposium 1
AAMAS 2010 - Doctoral Symposium 2
3 AAMAS 2010 - Doctoral Symposium
4 AAMAS 2010 - Doctoral Symposium
5 AAMAS 2010 - Doctoral Symposium
?
6 AAMAS 2010 - Doctoral Symposium
?
7 AAMAS 2010 - Doctoral Symposium
AAMAS 2010 - Doctoral Symposium 8
5 10 15 2 4 6 8 10 12 14 16 18
AAMAS 2010 - Doctoral Symposium 9
5 10 15 2 4 6 8 10 12 14 16 18
10 AAMAS 2010 - Doctoral Symposium
Start Goal
5 10 15 20 25 30 35 40 5 10 15 20 25 30 35 40
11 AAMAS 2010 - Doctoral Symposium
AAMAS 2010 - Doctoral Symposium 12
[van den Berg, et al 2005]
AAMAS 2010 - Doctoral Symposium 13
[van den Berg, et al 2005]
14 AAMAS 2010 - Doctoral Symposium
15 AAMAS 2010 - Doctoral Symposium
16 AAMAS 2010 - Doctoral Symposium
50 100 150 200 5 10 15 Number of robots Number of sequential planning iterations
17 AAMAS 2010 - Doctoral Symposium
50 100 150 200 5 10 15 Number of robots Number of sequential planning iterations 50 100 150 200 1 2 3 4 5 Number of robots Proportion of centralized planning time
18 AAMAS 2010 - Doctoral Symposium
A B C D A B C D Longest planning agents might replan multiple times Individual agent planning times varied by >2 orders of magnitude
19 AAMAS 2010 - Doctoral Symposium
Solution 2: Incremental Planning Solution 1: Prioritize by plan time?
AAMAS 2010 - Doctoral Symposium 20
AAMAS 2010 - Doctoral Symposium 21
22 AAMAS 2010 - Doctoral Symposium
Rescue Agent Cleaner Agent Narrow Corridor Victim Unsafe Cell Clearable Debris
23 AAMAS 2010 - Doctoral Symposium
AAMAS 2010 - Doctoral Symposium 24
Only fits one agent Passable if cleaned
[Varakantham, et al 2009]
AAMAS 2010 - Doctoral Symposium 25
[Varakantham, et al 2009]
(states) (actions) (obs.) (transition)(reward)(obs. fn)
AAMAS 2010 - Doctoral Symposium 26
[Varakantham, et al 2009]
AAMAS 2010 - Doctoral Symposium 27
[Varakantham, et al 2009]
28 AAMAS 2010 - Doctoral Symposium Role Allocation Policy Solution Interaction Detection Coordination
TREMOR
Branch & Bound MDP Independent EVA[3] solvers Joint policy evaluation Reward shaping
models
[Varakantham, et al 2009]
29 AAMAS 2010 - Doctoral Symposium Role Allocation Policy Solution Interaction Detection Coordination
TREMOR
Branch & Bound MDP Independent EVA[3] solvers Joint policy evaluation Reward shaping
models
L-TREMOR
Decentralized Auction Sampling & message passing
AAMAS 2010 - Doctoral Symposium 30
5 10 15 20 50 40 30 20 10 10 20 30 Iteration Empirical Joint Reward LTREMOR Independent 5 10 15 20 50 100 150 200 250 Iteration Empirical Joint Reward LTREMOR Independent 2 4 6 8 10 12 750 800 850 900 950 1000 1050 Iteration Empirical Joint Reward LTREMOR Independent
N = 6 N = 10 N = 100 (structurally similar to N=10)
AAMAS 2010 - Doctoral Symposium 31
2 4 6 8 10 12 14 16 18 20 5 10 15 20 25 30 35 40 45 50 Iteration Planning Time per Agent (s) n = 5, complex n = 10, tall n = 100, tall
AAMAS 2010 - Doctoral Symposium 32
100 100 200 300 400 150 100 50 50 100 Error between actual and expected value Improvement over independent policy n = 5, complex n = 10, tall n = 100, tall
33 AAMAS 2010 - Doctoral Symposium
?
AAMAS 2010 - Doctoral Symposium 34
35 AAMAS 2010 - Doctoral Symposium
36 AAMAS 2010 - Doctoral Symposium