TDP: Agent Simulation AIT-Rescue 1 1 1 Taishun Kusaka , Yuki - - PowerPoint PPT Presentation

tdp agent simulation ait rescue
SMART_READER_LITE
LIVE PREVIEW

TDP: Agent Simulation AIT-Rescue 1 1 1 Taishun Kusaka , Yuki - - PowerPoint PPT Presentation

RoboCup 2019 RoboCup Rescue Simulation League TDP: Agent Simulation AIT-Rescue 1 1 1 Taishun Kusaka , Yuki Miyamoto , Akira Hasegawa 2 1 Kazunori Iwata , Nobuhiro Ito 1 Department of Information Science, Aichi Institute of Technology,


slide-1
SLIDE 1

RoboCup Rescue Simulation League

TDP:Agent Simulation AIT-Rescue

Taishun Kusaka , Yuki Miyamoto , Akira Hasegawa 
 Kazunori Iwata , Nobuhiro Ito

RoboCup 2019

1 1 1 2 Department of Information Science, Aichi Institute of Technology, Japan Department of Business Administration, Aichi University, Japan 1 2 1

slide-2
SLIDE 2

Introduction

slide-3
SLIDE 3

Task assignment problem in RRS

Introduction

  • Typical problem in RRS that assigns all agents to all tasks (disasters).

Agents Tasks

Various approaches

The DCOP algorithm, 
 which is one of the most effective approaches
 to this problem, attracted considerable attention in AAAI-18.

Tutorial on Multi-agent Distributed Constrained Optimization @AAAI-18


http://www-personal.umich.edu/~fioretto/cfp/AAAI18/

1

1

slide-4
SLIDE 4

DCOP and DCOP algorithm in RRS

Introduction

  • DCOP : Decentralized Constraints Optimization Problem
  • A typical DCOP algorithm 


needs to perform message propagating numerous times. Sending & receiving messages takes 2 simulation steps.

However,

1 step 2 step

Agent A Agent B

slide-5
SLIDE 5

Utilizing the DCOP algorithm on RRS

Introduction

  • Task assignment by the DCOP algorithm might work more effectively

inside an agent that gathers all agents and all tasks …

→ The CenterAgent can perform this task assignment.

Center agent

Working

Can perform 
 the message propagation internally

Information Assignments Platoon agents

slide-6
SLIDE 6
  • Task assignment by the DCOP algorithm might work more effectively

inside an agent that gathers all agents and all tasks

Center agent

Working

Can perform 
 the message propagation internally

Information Assignments Platoon agents

Utilizing the DCOP algorithm on RRS

Introduction

→ The CenterAgent can perform this task assignment.

2 Weiss, Y., Freeman, W.T.: On the optimality of solutions of the max-product belief- propagation algorithm in arbitrary graphs. IEEE Trans. Information Theory 47,736-744 (2001)

In our attempt to implement center agents the task assignment,
 we used the Max-Sum algorithm, 
 which is one of the primary DCOP algorithms.

2

slide-7
SLIDE 7

Improvements of AIT-Rescue 2019

Introduction

Complex modules Algorithm modules Centralized

ExtAction PathPlanning Clustering Command Picker Command Executor Tactics

Center Self

TargetAllocator TargetDetector Search

Agent Development Framework

slide-8
SLIDE 8

Improvements of AIT-Rescue 2019

Introduction

Complex modules Algorithm modules Centralized

ExtAction PathPlanning Clustering Command Picker Command Executor Tactics

Center Self

TargetDetector Search

Agent Development Framework

TargetAllocator

We attempted to create an implementation that 
 allows center agents to perform

  • Centralized task assignment


(by the CenterAgent)

slide-9
SLIDE 9

Improvements of AIT-Rescue 2019

Introduction

Complex modules Algorithm modules Centralized

ExtAction PathPlanning Clustering Command Picker Command Executor Tactics

Center Self

TargetDetector Search

Agent Development Framework

TargetAllocator

We implemented two types of Clustering modules for

  • Properly dividing working area
  • Grouping burning buildings for

detecting a target → Modules section

slide-10
SLIDE 10

Improvements of AIT-Rescue 2019

Introduction

Complex modules Algorithm modules Centralized

PathPlanning Clustering Command Picker Command Executor Tactics

Center Self

TargetDetector Search

Agent Development Framework

TargetAllocator

We implemented strategies of
 FireBrigade & PoliceForce for

  • Effectively detecting a target

→ Strategies section

ExtAction

slide-11
SLIDE 11

Improvements of AIT-Rescue 2019

Introduction

Complex modules Algorithm modules Centralized

ExtAction PathPlanning Clustering Command Picker Command Executor Tactics

Center Self

Search

Agent Development Framework

TargetAllocator

We implemented an action of PoliceForce for

  • Effectively clearing blockades

→ Strategies section

TargetDetector

slide-12
SLIDE 12

Centralized
 Task Assignment

( Not In The Template )

slide-13
SLIDE 13
  • Our center agents performs centralized task assignments for all

platoon agents using the Max-Sum algorithm, which is a major DCOP algorithm

  • Only works if all platoon agents have two-way communication with

the center agents

CenterAgent : TargetAllocator

Centralized Task Assignment

Fire station

e.g.

slide-14
SLIDE 14
  • Our center agent performs centralized task assignments for all

platoon agents using the Max-Sum algorithm, which is a major DCOP algorithm

  • Only works if all platoon agents have two-way communication with

the center agents

In other situations,

the platoon agents must detect their own targets individually.

Working Centralized Task Assignment

CenterAgent : TargetAllocator

slide-15
SLIDE 15

All platoon agents get tasks based on each agent’s perceptions and send task information messages to the center agent.

e.g. civilians, buildings, blockades Simulation Step i: Center agent Platoon agents Send Task information Send Assignments Working Get tasks Radio communication

How the center agent works : 3 steps

Centralized Task Assignment

CenterAgent : TargetAllocator

slide-16
SLIDE 16

The center agent updates own WorldModel and finds assignments by repeatedly using the Max-Sum algorithm and then sends assignment messages to each platoon agent via radio.

Simulation Step i+1: Center agent Platoon agents Radio communication Send Task information Send Assignments Working

How the center agent works : 3 steps

Centralized Task Assignment

CenterAgent : TargetAllocator

slide-17
SLIDE 17

All platoon agents get their own assignments and work to complete those assignments.

How the center agent works : 3 steps

Simulation Step i+2: Center agent Platoon agents Send Assignments Working Send Task information Radio communication Work to the tasks Got it Centralized Task Assignment

CenterAgent : TargetAllocator

slide-18
SLIDE 18
  • Center agents evaluate tasks to determine assignments as follows:

What the center agent evaluates tasks with

* we explained them in TDP more detail

Distance

Cost : Distance 


between tasks and assigned agents

x3

2 (-1)

Penalty : Shortfall 


in the number of necessary agents

→ Minimizing values based on all costs & all penalties

Centralized Task Assignment

CenterAgent : TargetAllocator

slide-19
SLIDE 19

Modules

slide-20
SLIDE 20

Clustering for the working areas

Modules

  • The module divides a map into working areas to which each agent

assigned.

slide-21
SLIDE 21

Improvements (from the sample)

Modules

  • Fixed the problem of some clusters overlapping each other
  • Implemented the k-means++ to address the initial-value dependence

Clustering for the working areas

Sample K-means Current K-means++ A characteristic of clustering

slide-22
SLIDE 22

Clustering for the buildings on fire

Modules

  • The module groups some buildings on fire.
  • We use this module to create convex hulls in ordering when

extinguish buildings on the cluster edges.

slide-23
SLIDE 23
  • Implemented a hierarchical clustering that merges any two clusters

if they are within the set distance from the criterion

Implementation

Modules e.g. The average distance between all buildings

Clustering for the buildings on fire

Too far Close

slide-24
SLIDE 24

Each building is set as a cluster.

Flow

Modules

Clustering for the buildings on fire

Step 1:

Too far Close

slide-25
SLIDE 25

Any two of the clusters if they are within the set distance from the criterion

Flow

Modules

Clustering for the buildings on fire

Step 2:

Too far Close

slide-26
SLIDE 26

Any two of the clusters if they are within the set distance from the criterion

Flow

Modules

Clustering for the buildings on fire

Step 2:

Too far Close

slide-27
SLIDE 27

Strategies

slide-28
SLIDE 28
  • A fire brigade typically selects a target as follows:
  • When dealing with buildings on the edge of the convex hull closest to

the agent

  • Within the set of the buildings that have a lowest possible fieriness

✓The target is the building that is closer to the agent than any

  • ther agents

Strategies

  • FireBrigade : BuildingDetctor
slide-29
SLIDE 29
  • If there are no candidate buildings, the fire brigade selects a target as

follows:

  • When dealing with buildings in the agent’s perception at the step
  • In the set of the buildings having the fieriness as low as possible

✓The target is the building that is nearer to the agent than other

agents

Strategies

  • FireBrigade : BuildingDetctor
slide-30
SLIDE 30
  • A police force basically selects a target as follows:
  • Regarding other agents from which the police force has received

CommandPolice messages

✓The road where the nearest agent to the police force is in

  • PoliceForce : RoadDetctor

Strategies S e n d s C

  • m

m a n d P

  • l

i c e S e l e c t s t h i s a g e n t

slide-31
SLIDE 31

Strategies

  • If there is no candidate, the police force selects a target as follows:
  • Regarding buildings in the cluster to which the agent is assigned

✓The target is the building that is the nearest to the agent ✓The road that the agent will take is decided randomly

If there is no candidate …

PoliceForce : RoadDetctor

Has been 
 selected

slide-32
SLIDE 32

PoliceForce : ExtActionClear

Strategies

  • A police force basically clears its way to a target as follows:
  • In the set of blockades on the way from the agent to the target

✓The centerline blockade of Roads the agent has to travel on

to reach the target.

slide-33
SLIDE 33

Evaluations

slide-34
SLIDE 34
  • Our team results in 2019 are better than the others.

Results

Evaluations

Agent Scenario

Eindhoven2 Paris1 Sakae1

AIT-Rescue 2019

66.83 15.07 9.99

AIT-Rescue 2018

64.46 9.98 9.75

Sample

64.43 11.40 9.74

  • Run 20 simulations for each result.
  • Extended Communication bandwidths of all scenarios maximize.
  • Disabled noises of all scenarios.
slide-35
SLIDE 35
  • AIT-Rescue implemented the following in 2019:
  • The task assignment by the center agent
  • K-means++
  • Hierarchical clustering
  • Task detection by fire brigades and police forces
  • Police force clearing action
  • AIT-Rescue 2019 is better than last year’s version.
  • But, we need to implement a decentralized approach for the Max-

Sum algorithm.

Conclusions

Evaluations

Thank you for your attentions!