Swarm Robotics an overview Vito Trianni, PhD Institute of - - PowerPoint PPT Presentation

swarm robotics an overview
SMART_READER_LITE
LIVE PREVIEW

Swarm Robotics an overview Vito Trianni, PhD Institute of - - PowerPoint PPT Presentation

Swarm Robotics an overview Vito Trianni, PhD Institute of Cognitive Sciences and Technologies National Research Council vito.trianni@istc.cnr.it swarm robotics swarm robotics studies robotic systems composed of a multitude


slide-1
SLIDE 1

Swarm Robotics
 – an overview –

Vito Trianni, PhD

Institute of Cognitive Sciences and Technologies National Research Council

vito.trianni@istc.cnr.it

slide-2
SLIDE 2
slide-3
SLIDE 3
  • swarm robotics studies robotic systems composed of


a multitude of interacting units

  • homogeneous systems or few heterogeneous groups
  • each unit is relatively simple and inexpensive
  • individual limitations, absence of global information
  • limitations can be physical or functional
  • access to local and incomplete information only
  • decentralised control
  • no single point of failure
  • redundancy is built-in in the system
  • expected properties:

swarm robotics

  • parallelism
  • scalability
  • robustness
  • efficiency
  • adaptivity
slide-4
SLIDE 4

swarm robotics

  • simple individuals and simple behaviours
  • complexity results from cooperation
  • research mainly focuses on:
  • development of specific hardware to support

communication and physical interactions

  • development and test of swarm control systems
  • problem:

how to define individual rules?

slide-5
SLIDE 5

macroscopic behaviour individual agent rules

?

design of decentralised systems

DESIGN PROBLEM

  • distributed
  • large number of

interconnected agents

  • self-organised

WIRELESS SENSOR NETWORKS SWARM ROBOTICS

Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.

slide-6
SLIDE 6

design patterns

  • reusable solutions for a specific class of problems
  • leverage on the principled understanding of 


theoretical models of collective systems

slide-7
SLIDE 7

what design rationale
 for robot swarms?

slide-8
SLIDE 8

super-organisms

slide-9
SLIDE 9
slide-10
SLIDE 10

Swarm-Bots (2004)

slide-11
SLIDE 11
slide-12
SLIDE 12

Swarmanoid (2011)

slide-13
SLIDE 13
slide-14
SLIDE 14

Kilobots (2014)

slide-15
SLIDE 15
slide-16
SLIDE 16

Verity Studios (2017)

slide-17
SLIDE 17
slide-18
SLIDE 18

perspectives

  • potential application domains
  • agriculture and precision farming
  • security, search&rescue
  • logistics
  • space exploration
  • swarm robotics still confined into the lab
  • more research needed for higher cognitive skills
  • collective decision-making
  • task allocation
  • categorisation
  • learning
slide-19
SLIDE 19

collective decisions

slide-20
SLIDE 20

collective decisions

  • definition:


the process that leads a group to identify
 the best option out of several alternatives

  • precondition:


partial/noisy information about the available alternatives

  • postcondition:


the group (or a large majority) shares the same choice

  • constraints:


individuals cannot know/compare all alternatives

slide-21
SLIDE 21

decentralised decision making

  • best-of-n decision problem
  • set of n options
  • each option i has a quality vi
  • GOAL: select the best (or equal-best) option

which rules should
 each agent follow?

  • discover the options
  • estimate their qualities
  • select the best one

macroscopic behaviour individual agent rules

?

slide-22
SLIDE 22

design rationale

nest-site selection in honeybees

+ attains near-optimal


speed-accuracy tradeoff

+ no need of direct comparison

between option qualities

+ adaptive mechanisms to tune

decision speed and break symmetry deadlocks

slide-23
SLIDE 23

collective decisions in bees

a swarm needs to select the new nesting site

slide-24
SLIDE 24

collective decisions in bees

scout bees identify the available alternatives and share information through the ‘waggle dance’

slide-25
SLIDE 25

modelling collective decisions

U A B …

uncommitted agents committed agents

slide-26
SLIDE 26

modelling collective decisions

U A B

discovery of alternatives

U U A B

γA γB

slide-27
SLIDE 27

modelling collective decisions

U A B

abandonment of commitment

αA αB

A B U U

slide-28
SLIDE 28

modelling collective decisions

U A B

recruitment to discovered alternatives

U+A U+B A+A B+B

ρA ρB

A B

slide-29
SLIDE 29

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU ΨU = 1 − ΨA − ΨB

nest-site selection model

discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

slide-30
SLIDE 30

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU ΨU = 1 − ΨA − ΨB

nest-site selection model

discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

slide-31
SLIDE 31

modelling collective decisions

A B

switch of alternatives

B+A A+B A+A B+B

σB σA

A B

slide-32
SLIDE 32

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σB − σA)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σA − σB)ΨAΨB ΨU = 1 − ΨA − ΨB

slide-33
SLIDE 33

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σB − σA)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σA − σB)ΨAΨB ΨU = 1 − ΨA − ΨB

slide-34
SLIDE 34

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σB − σA)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σA − σB)ΨAΨB ΨU = 1 − ΨA − ΨB

slide-35
SLIDE 35
  • T. D. Seeley, P. K. Visscher, T. Schlegel, P. M. Hogan, N. R. Franks, and J. A. R. Marshall, “Stop Signals Provide Cross

Inhibition in Collective Decision-Making by Honeybee Swarms”. Science, vol. 335, no. 6064, pp. 108–111, 2012.

slide-36
SLIDE 36

modelling collective decisions

A B

cross-inhibition

B+A A+B U+A U+B

σA σB

A B U

slide-37
SLIDE 37

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σA − σB)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σB − σA)ΨAΨB ΨU = 1 − ΨA − ΨB ,

slide-38
SLIDE 38

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B cross-inhibition A+B B+A A+U B+U

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − σBΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − σAΨAΨB ΨU = 1 − ΨA − ΨB ,

slide-39
SLIDE 39

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B cross-inhibition A+B B+A A+U B+U

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − σBΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − σAΨAΨB ΨU = 1 − ΨA − ΨB ,

slide-40
SLIDE 40

A B U

γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA

nest-site selection model

discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B cross-inhibition A+B B+A A+U B+U

   ˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − σBΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − σAΨAΨB ΨU = 1 − ΨA − ΨB ,

slide-41
SLIDE 41

design pattern solution

multi-level description of the decision process

Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.

slide-42
SLIDE 42

design pattern solution

multi-level description of the decision process

Macroscopic description

infinite-size
 deterministic
 time continuous

⇢ ˙ Ψi = γiΨU αiΨi +ρiΨiΨU ∑ j6=i σ jΨiΨ j ΨU = 1∑i Ψi

System of ODEs

Macroscopic description

finite-size
 stochastic
 time continuous Master equation

δ δt P(N,t) =

4n

k=1

[βk P(N,t)Qk], 8N

Microscopic description

agent-based
 stochastic
 time discrete PFSM

C1 . . . Cn CU A

Pα1

q

j”=1 PΨjPσj

Pαn

q

j”=n PΨjPσj

Pγ1 PΨ1Pρ1 Pγn PΨnPρn

Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.

slide-43
SLIDE 43

design pattern solution

multi-level description of the decision process C1 . . . Cn CU A

Pα1

q

j”=1 PΨjPσj

Pαn

q

j”=n PΨjPσj

Pγ1 PΨ1Pρ1 Pγn PΨnPρn

Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.

slide-44
SLIDE 44

C1 . . . Cn CU A

Pα1

q

j”=1 PΨjPσj

Pαn

q

j”=n PΨjPσj

Pγ1 PΨ1Pρ1 Pγn PΨnPρn

8 < : ˙ Ψi = γiΨU − αiΨi+ ρiΨiΨU − P

j6=i σjΨiΨj

ΨU = 1 − P

i Ψi

micro-macro link

transform parameters of the macroscopic model into the probabilities of the individual PFSM

slide-45
SLIDE 45

micro-macro link

transform parameters of the macroscopic model into the probabilities of the individual PFSM λi = fλ(vi) → Pλ(vi) = fλ(vi)τ, λ ∈ {γ, α, ρ, σ} i ∈ {1, . . . , n}

slide-46
SLIDE 46

usage of the design pattern

1.Choice of the macroscopic parameterisation, including application specific constraints 2.Derivation of the microscopic parameterisation 3.Implementation and testing

slide-47
SLIDE 47

macroscopic parameterisation

  • The choice depends on the expected properties

with respect to the options value

  • Value-sensitive decision-making
  • Best-of-N decisions

γi = ρi = 1 αi = vi σi = ˆ σ

Pais et al. (2013). A Mechanism for Value-Sensitive Decision-Making. PLoS ONE, 8(9), e73216 Reina et al. (2017). Model of the best-of-N nest-site selection process in honeybees. Physical Review E, 95(5), 052411–15

γi = 1 αi = kvi ρi = σi = hvi r = h k

slide-48
SLIDE 48

value sensitivity

decision deadlock deadlock breaking

Two options with equal quality

γi = ρi = 1 αi = vi σi = ˆ σ

slide-49
SLIDE 49

best-of-N decisions

I

Deadlock

II

Possible deadlock

III

Deadlock breaking

r1 r2 2 4 6 8 10 0.0 0.2 0.4 0.6 0.8 1.0

r x1

I

Deadlock

II

Possible deadlock

III

Deadlock breaking

2 4 6 8 10 2 4 6 8 10

r v

Three options with equal quality

γi = 1 αi = kvi ρi = σi = hvi r = h k

slide-50
SLIDE 50

best-of-N decisions

III

Deadlock breaking

II

Possible deadlock

I Deadlock

v

1 2 3 5 10 2 3 4 5 6 7 8 10 20 30 40 50 60

N r

N options with equal quality

γi = 1 αi = kvi ρi = σi = hvi r = h k

slide-51
SLIDE 51

best-of-N decisions

One superior and two inferior options

slide-52
SLIDE 52

case studies

.3.
 Swarm robotics system for search & exploitation

Robots exemplify embodiment challenges

.1.
 Multiagent simulations

  • n fully-connected

networks

Basic case study to investigate several parameterisations

.2.
 Multiagent simulations for search & exploration

Mobile point-size particles capable to move in a 2D environment

.4.
 Coexistence in heterogeneous
 cognitive networks

fully-decentralised solution for channel selection in cognitive radio networks

slide-53
SLIDE 53

case study #1

.1.
 Multiagent simulations

  • n fully-connected

networks

Basic case study to investigate several parameterisations

2 4 6 8 0.0 0.2 0.4 0.6 0.8 1.0 vi, i ≠ A ΨA 1 3 5 7 9 ODEs Gillespie Multi−agent n = 2 n = 3 n = 4 n = 5

vA vB Convergence time Success rate

A

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

N= 10 N= 50 N= 100 N= 500 N= 1000

Gillespie (master eq.) Multi-agent homogeneous Multi-agent heterogeneous

Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.

slide-54
SLIDE 54

case study #2

.2.
 Multiagent simulations for search & exploration

Mobile point-size particles capable to move in a 2D environment

Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.

slide-55
SLIDE 55

case study #3

.3.
 Swarm robotics system for search & exploitation

Robots exemplify embodiment challenges

video by A. Reina Reina, A., Miletitch, R., Dorigo, M., & Trianni, V. (2015). A quantitative micro–macro link
 for collective decisions: the shortest path discovery/selection example. Swarm Intelligence, 9(2-3), 75–102.

slide-56
SLIDE 56

case study #3

.3.
 Swarm robotics system for search & exploitation

Robots exemplify embodiment challenges

video by A. Reina Reina et al (2016): Effects of Spatiality on Value-Sensitive Decisions Made by Robot Swarms.
 In: Proceedings of DARS 2016, pp. 1–8, Natural History Museum in London, UK

slide-57
SLIDE 57

task allocation

slide-58
SLIDE 58

task allocation

  • definition:


the process that leads a group to (equally) 
 divide labour among the group members

  • precondition:


a set of tasks with different labour demands (utility)

  • postcondition:


agents are deployed to execute one or more tasks

  • constraints:


individuals do not know task requirements
 and other’s preferences/choices

slide-59
SLIDE 59

task allocation: variants

  • single-task (ST) versus multi-task robots (MT)
  • single-robot (SR) versus multi-robot tasks (MR)
  • instantaneous (IA) versus time-extended assignment (TA)

Gerkey, B. P., & Matarić, M. J. (2004). A Formal Analysis and Taxonomy of Task Allocation in Multi-Robot Systems. The International Journal of Robotics Research, 23(9), 939–954.

slide-60
SLIDE 60

TA via response thresholds

Theraulaz, G., Bonabeau, E., & Denuebourg, J. N. (1998). Response threshold reinforcements and division of labour in insect societies. Proceedings of the Royal Society of London. Series B: Biological Sciences, 265(1393), 327–332.

slide-61
SLIDE 61
  • tasks are associated with a utility (stimulus)
  • agents have a response threshold for each task

TA via response thresholds

Sj, j ∈ {1, . . . , M} θij, i ∈ {1, . . . , N}

slide-62
SLIDE 62

TA via response thresholds

  • agents apply a simple decision rule
  • task utility varies over time

˙ Sj = δ − αnj N Pi(Sj) = S2

j

S2

j + θ2 ij

  • spontaneous


growth enrolled agents individual execution rate

slide-63
SLIDE 63

TA via response thresholds

  • How to distribute thresholds


for optimal task allocation?

  • How to assign threshold to have


specialised agents?
 What about generalists?

  • Adaptive response thresholds:

θij ← θij − ξ∆t if agent i performs task j θij ← θij + ξ∆t if agent i does not perform task j

slide-64
SLIDE 64

confronting TA with CD

task allocation

  • discover tasks and

evaluate utility

  • leave tasks when

completed

  • recruit workers to tasks

that need attention

  • …


collective decision

  • discover alternatives and

evaluate quality

  • abandon commitment for

low quality options

  • recruit agents to

favourable options

  • cross-inhibition between

competing options

slide-65
SLIDE 65

coupled dynamical models

  • the utility of executing a task is dependent on the

number of enrolled agents:

  • the optimal number of agents depends on the utility

dynamics:

  • coupled dynamics of task allocation and utility:

˙ ui = −uini(δni − ξn2

i ),

ui ∈ [0, 1].

n? = 2δ 3ξ

γi = kui αi = kH(ν ui) ρi = hui σij = hui 2δ 3ξnj 2δ , i 6= j σii = (3ξN 2δ)(3ξNγi + 2δρi) 4δ2

slide-66
SLIDE 66

coupled dynamical models

  • the utility of executing a task is dependent on the

number of enrolled agents:

  • the optimal number of agents depends on the utility

dynamics:

  • coupled dynamics of task allocation and utility:
  • dynamics controlled by the ratio between


interactive and spontaneous transitions

˙ ui = −uini(δni − ξn2

i ),

ui ∈ [0, 1].

n? = 2δ 3ξ

r = h k

slide-67
SLIDE 67

single task

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 ui xi

Time [s] xi,ui

50 100 150 200 250 300 0.0 0.2 0.4 0.6 0.8 1.0

  • xi

ui ODEs Multi−agent

slide-68
SLIDE 68

Time (s) Time (s)

three tasks

slide-69
SLIDE 69

TA in a nutshell

  • task allocation and collective decisions 


share many important aspects

  • recruitment and inhibition dynamics provide means to

implement different task allocation strategies

  • strategies varies from utility-proportional to 


winner-take-all strategies

  • giving more importance to interactions, 


task allocation becomes responsive to changes in utility

slide-70
SLIDE 70

Thanks for your attention