Information Sharing for Distributed Planning Prasanna Velagapudi - - PowerPoint PPT Presentation

information sharing for distributed planning
SMART_READER_LITE
LIVE PREVIEW

Information Sharing for Distributed Planning Prasanna Velagapudi - - PowerPoint PPT Presentation

Information Sharing for Distributed Planning Prasanna Velagapudi AAMAS 2010 - Doctoral Symposium 1 Large Heterogeneous Teams 100s to 1000s of robots, agents, people Complex, collaborative tasks Dynamic, uncertain environment


slide-1
SLIDE 1

Information Sharing for Distributed Planning

Prasanna Velagapudi

AAMAS 2010 - Doctoral Symposium 1

slide-2
SLIDE 2

Large Heterogeneous Teams

  • 100s to 1000s of

robots, agents, people

  • Complex, collaborative

tasks

  • Dynamic, uncertain

environment

  • Joint planning

intractable

AAMAS 2010 - Doctoral Symposium 2

slide-3
SLIDE 3

Scaling Team Planning

  • Independent planners: can’t account for

teammates

  • Existing work: needs specific structure or doesn’t

scale to these sizes

– DPC, Prioritized Planning – JESP, Factored MDP, ND-POMDP

3 AAMAS 2010 - Doctoral Symposium

slide-4
SLIDE 4

Iterated Distributed Planning

  • 1. Factor the problem, enumerate interactions
  • 2. Compute independent plans & potential interactions
  • 3. Exchange messages about interactions
  • 4. Use exchanged information, improve local model

4 AAMAS 2010 - Doctoral Symposium

slide-5
SLIDE 5

Iterated Distributed Planning

  • 1. Factor the problem, enumerate interactions
  • 2. Compute independent plans & potential interactions
  • 3. Exchange messages about interactions
  • 4. Use exchanged information, improve local model

5 AAMAS 2010 - Doctoral Symposium

?

slide-6
SLIDE 6

Iterated Distributed Planning

  • 1. Factor the problem, enumerate interactions
  • 2. Compute independent plans & potential interactions
  • 3. Exchange messages about interactions
  • 4. Use exchanged information, improve local model

6 AAMAS 2010 - Doctoral Symposium

?

slide-7
SLIDE 7

Iterated Distributed Planning

  • 1. Factor the problem, enumerate interactions
  • 2. Compute independent plans & potential interactions
  • 3. Exchange messages about interactions
  • 4. Use exchanged information, improve local model

7 AAMAS 2010 - Doctoral Symposium

slide-8
SLIDE 8

A Tale of Two Distributed Planners

Distributed Prioritized Planning (DPP) L-TREMOR

AAMAS 2010 - Doctoral Symposium 8

5 10 15 2 4 6 8 10 12 14 16 18

slide-9
SLIDE 9

Distributed Prioritized Planning

AAMAS 2010 - Doctoral Symposium 9

slide-10
SLIDE 10

Multiagent Path Planning

5 10 15 2 4 6 8 10 12 14 16 18

10 AAMAS 2010 - Doctoral Symposium

Start Goal

slide-11
SLIDE 11

Multiagent Path Planning

5 10 15 20 25 30 35 40 5 10 15 20 25 30 35 40

11 AAMAS 2010 - Doctoral Symposium

slide-12
SLIDE 12

Prioritized Planning

  • Assign priorities to agents based on path length

AAMAS 2010 - Doctoral Symposium 12

[van den Berg, et al 2005]

slide-13
SLIDE 13

Prioritized Planning

  • Plan from highest priority to lowest priority
  • Use previous agents as dynamic obstacles

AAMAS 2010 - Doctoral Symposium 13

[van den Berg, et al 2005]

slide-14
SLIDE 14

Distributed Prioritized Planning

14 AAMAS 2010 - Doctoral Symposium

Parallelizable & Equivalent

slide-15
SLIDE 15

Large-Scale Path Solutions

15 AAMAS 2010 - Doctoral Symposium

slide-16
SLIDE 16

Large-Scale Path Solutions

16 AAMAS 2010 - Doctoral Symposium

slide-17
SLIDE 17

50 100 150 200 5 10 15 Number of robots Number of sequential planning iterations

DPP Results

Fewer Sequential Plans

17 AAMAS 2010 - Doctoral Symposium

slide-18
SLIDE 18

50 100 150 200 5 10 15 Number of robots Number of sequential planning iterations 50 100 150 200 1 2 3 4 5 Number of robots Proportion of centralized planning time

DPP Results

18 AAMAS 2010 - Doctoral Symposium

Longer Planning Time Fewer Sequential Plans

slide-19
SLIDE 19
  • Prioritized Planning
  • DPP

Why does this happen?

A B C D A B C D Longest planning agents might replan multiple times Individual agent planning times varied by >2 orders of magnitude

19 AAMAS 2010 - Doctoral Symposium

Solution 2: Incremental Planning Solution 1: Prioritize by plan time?

slide-20
SLIDE 20

Summary of DPP

  • Observable, certain world
  • Only one type of interaction: collision
  • Far fewer sequential planning iterations
  • Incremental planning may reduce execution time

AAMAS 2010 - Doctoral Symposium 20

slide-21
SLIDE 21

L-TREMOR

AAMAS 2010 - Doctoral Symposium 21

slide-22
SLIDE 22

A Simple Rescue Domain

22 AAMAS 2010 - Doctoral Symposium

Rescue Agent Cleaner Agent Narrow Corridor Victim Unsafe Cell Clearable Debris

slide-23
SLIDE 23

A Simple (Large) Rescue Domain

23 AAMAS 2010 - Doctoral Symposium

slide-24
SLIDE 24

Distributed POMDP with Coordination Locales (DPCL)

  • Often, interactions between agents are sparse

AAMAS 2010 - Doctoral Symposium 24

Only fits one agent Passable if cleaned

[Varakantham, et al 2009]

slide-25
SLIDE 25

Distributed POMDP with Coordination Locales (DPCL)

  • Define coordination locales (CLs) where POMDP

model functions are not independent:

AAMAS 2010 - Doctoral Symposium 25

[Varakantham, et al 2009]

<S, A, Ω, P, R, O>

(states) (actions) (obs.) (transition)(reward)(obs. fn)

slide-26
SLIDE 26

Distributed POMDP with Coordination Locales (DPCL)

  • Define coordination locales (CLs) where POMDP

model functions are not independent:

AAMAS 2010 - Doctoral Symposium 26

[Varakantham, et al 2009]

S1, A1 S2, A2 Sglobal R1, P1, O1 R2, P2, O2

Outside CL: (typical)

slide-27
SLIDE 27

Distributed POMDP with Coordination Locales (DPCL)

  • Define coordination locales (CLs) where POMDP

model functions are not independent:

AAMAS 2010 - Doctoral Symposium 27

[Varakantham, et al 2009]

S1, A1 S2, A2 Sglobal R12, P12, O12

Inside CL: (interaction)

slide-28
SLIDE 28

TREMOR

28 AAMAS 2010 - Doctoral Symposium Role Allocation Policy Solution Interaction Detection Coordination

TREMOR

Branch & Bound MDP Independent EVA[3] solvers Joint policy evaluation Reward shaping

  • f independent

models

[Varakantham, et al 2009]

slide-29
SLIDE 29

L-TREMOR

29 AAMAS 2010 - Doctoral Symposium Role Allocation Policy Solution Interaction Detection Coordination

TREMOR

Branch & Bound MDP Independent EVA[3] solvers Joint policy evaluation Reward shaping

  • f independent

models

L-TREMOR

Decentralized Auction Sampling & message passing

Distributed & Parallelizable

slide-30
SLIDE 30

Preliminary Results – Joint Utility

AAMAS 2010 - Doctoral Symposium 30

5 10 15 20 50 40 30 20 10 10 20 30 Iteration Empirical Joint Reward LTREMOR Independent 5 10 15 20 50 100 150 200 250 Iteration Empirical Joint Reward LTREMOR Independent 2 4 6 8 10 12 750 800 850 900 950 1000 1050 Iteration Empirical Joint Reward LTREMOR Independent

N = 6 N = 10 N = 100 (structurally similar to N=10)

slide-31
SLIDE 31

Preliminary Results – Timing

AAMAS 2010 - Doctoral Symposium 31

2 4 6 8 10 12 14 16 18 20 5 10 15 20 25 30 35 40 45 50 Iteration Planning Time per Agent (s) n = 5, complex n = 10, tall n = 100, tall

slide-32
SLIDE 32

Preliminary Results – Model Accuracy

AAMAS 2010 - Doctoral Symposium 32

100 100 200 300 400 150 100 50 50 100 Error between actual and expected value Improvement over independent policy n = 5, complex n = 10, tall n = 100, tall

R = 0.804

slide-33
SLIDE 33

Current Issues

  • Oscillations in solutions
  • Discovery of relevant locales

33 AAMAS 2010 - Doctoral Symposium

?

slide-34
SLIDE 34

Summary of L-TREMOR

  • Partially-observable, uncertain world
  • Multiple types of interactions
  • Role-allocation of tasks
  • Improvement over independent planning
  • Handles large problems
  • Next steps: improving convergence

AAMAS 2010 - Doctoral Symposium 34

slide-35
SLIDE 35

Conclusions

  • Two approaches to distributed planning

– DPP: approaching centralized performance – L-TREMOR: exceeding joint tractability

  • Analogous strategies for distributing planning

– Both iterate independent planners – Both exchange messages about states, actions

35 AAMAS 2010 - Doctoral Symposium

slide-36
SLIDE 36

Future Work

  • Generalized framework for distributed planning

through iterative message exchange

  • Reduce necessary communication
  • Better search over task allocations
  • Scaling to larger team sizes

36 AAMAS 2010 - Doctoral Symposium