Swarm Robotics – an overview –
Vito Trianni, PhD
Institute of Cognitive Sciences and Technologies National Research Council
vito.trianni@istc.cnr.it
Swarm Robotics an overview Vito Trianni, PhD Institute of - - PowerPoint PPT Presentation
Swarm Robotics an overview Vito Trianni, PhD Institute of Cognitive Sciences and Technologies National Research Council vito.trianni@istc.cnr.it swarm robotics swarm robotics studies robotic systems composed of a multitude
Vito Trianni, PhD
Institute of Cognitive Sciences and Technologies National Research Council
vito.trianni@istc.cnr.it
a multitude of interacting units
communication and physical interactions
how to define individual rules?
macroscopic behaviour individual agent rules
DESIGN PROBLEM
interconnected agents
WIRELESS SENSOR NETWORKS SWARM ROBOTICS
Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.
theoretical models of collective systems
the process that leads a group to identify the best option out of several alternatives
partial/noisy information about the available alternatives
the group (or a large majority) shares the same choice
individuals cannot know/compare all alternatives
which rules should each agent follow?
macroscopic behaviour individual agent rules
nest-site selection in honeybees
+ attains near-optimal
speed-accuracy tradeoff
+ no need of direct comparison
between option qualities
+ adaptive mechanisms to tune
decision speed and break symmetry deadlocks
a swarm needs to select the new nesting site
scout bees identify the available alternatives and share information through the ‘waggle dance’
uncommitted agents committed agents
discovery of alternatives
U U A B
γA γB
abandonment of commitment
αA αB
A B U U
recruitment to discovered alternatives
U+A U+B A+A B+B
ρA ρB
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU ΨU = 1 − ΨA − ΨB
discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU ΨU = 1 − ΨA − ΨB
discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
switch of alternatives
B+A A+B A+A B+B
σB σA
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σB − σA)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σA − σB)ΨAΨB ΨU = 1 − ΨA − ΨB
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σB − σA)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σA − σB)ΨAΨB ΨU = 1 − ΨA − ΨB
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σB − σA)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σA − σB)ΨAΨB ΨU = 1 − ΨA − ΨB
Inhibition in Collective Decision-Making by Honeybee Swarms”. Science, vol. 335, no. 6064, pp. 108–111, 2012.
cross-inhibition
B+A A+B U+A U+B
σA σB
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
direct switch: discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B A+B B+A A+A B+B
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − (σA − σB)ΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − (σB − σA)ΨAΨB ΨU = 1 − ΨA − ΨB ,
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B cross-inhibition A+B B+A A+U B+U
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − σBΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − σAΨAΨB ΨU = 1 − ΨA − ΨB ,
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B cross-inhibition A+B B+A A+U B+U
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − σBΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − σAΨAΨB ΨU = 1 − ΨA − ΨB ,
A B U
γA γB ΨAρA ΨBρB αA ΨBσB αB ΨAσA
discovery: abandonment: recruitment: U U A B A B U U A+U B+U A+A B+B cross-inhibition A+B B+A A+U B+U
˙ ΨA = γAΨU − αAΨA + ρAΨAΨU − σBΨAΨB ˙ ΨB = γBΨU − αBΨB + ρBΨBΨU − σAΨAΨB ΨU = 1 − ΨA − ΨB ,
Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.
Macroscopic description
infinite-size deterministic time continuous
⇢ ˙ Ψi = γiΨU αiΨi +ρiΨiΨU ∑ j6=i σ jΨiΨ j ΨU = 1∑i Ψi
System of ODEs
Macroscopic description
finite-size stochastic time continuous Master equation
δ δt P(N,t) =
4n
∑
k=1
[βk P(N,t)Qk], 8N
Microscopic description
agent-based stochastic time discrete PFSM
C1 . . . Cn CU A
Pα1
q
j”=1 PΨjPσjPαn
q
j”=n PΨjPσjPγ1 PΨ1Pρ1 Pγn PΨnPρn
Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.
Pα1
q
j”=1 PΨjPσj
Pαn
q
j”=n PΨjPσj
Pγ1 PΨ1Pρ1 Pγn PΨnPρn
Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.
C1 . . . Cn CU A
Pα1
q
j”=1 PΨjPσj
Pαn
q
j”=n PΨjPσj
Pγ1 PΨ1Pρ1 Pγn PΨnPρn
8 < : ˙ Ψi = γiΨU − αiΨi+ ρiΨiΨU − P
j6=i σjΨiΨj
ΨU = 1 − P
i Ψi
transform parameters of the macroscopic model into the probabilities of the individual PFSM
transform parameters of the macroscopic model into the probabilities of the individual PFSM λi = fλ(vi) → Pλ(vi) = fλ(vi)τ, λ ∈ {γ, α, ρ, σ} i ∈ {1, . . . , n}
1.Choice of the macroscopic parameterisation, including application specific constraints 2.Derivation of the microscopic parameterisation 3.Implementation and testing
with respect to the options value
γi = ρi = 1 αi = vi σi = ˆ σ
Pais et al. (2013). A Mechanism for Value-Sensitive Decision-Making. PLoS ONE, 8(9), e73216 Reina et al. (2017). Model of the best-of-N nest-site selection process in honeybees. Physical Review E, 95(5), 052411–15
γi = 1 αi = kvi ρi = σi = hvi r = h k
decision deadlock deadlock breaking
Two options with equal quality
γi = ρi = 1 αi = vi σi = ˆ σ
I
Deadlock
II
Possible deadlock
III
Deadlock breaking
r1 r2 2 4 6 8 10 0.0 0.2 0.4 0.6 0.8 1.0
r x1
I
Deadlock
II
Possible deadlock
III
Deadlock breaking
2 4 6 8 10 2 4 6 8 10
r v
Three options with equal quality
γi = 1 αi = kvi ρi = σi = hvi r = h k
III
Deadlock breaking
II
Possible deadlock
I Deadlock
v
1 2 3 5 10 2 3 4 5 6 7 8 10 20 30 40 50 60
N r
N options with equal quality
γi = 1 αi = kvi ρi = σi = hvi r = h k
One superior and two inferior options
.3. Swarm robotics system for search & exploitation
Robots exemplify embodiment challenges
.1. Multiagent simulations
networks
Basic case study to investigate several parameterisations
.2. Multiagent simulations for search & exploration
Mobile point-size particles capable to move in a 2D environment
.4. Coexistence in heterogeneous cognitive networks
fully-decentralised solution for channel selection in cognitive radio networks
.1. Multiagent simulations
networks
Basic case study to investigate several parameterisations
2 4 6 8 0.0 0.2 0.4 0.6 0.8 1.0 vi, i ≠ A ΨA 1 3 5 7 9 ODEs Gillespie Multi−agent n = 2 n = 3 n = 4 n = 5
vA vB Convergence time Success rate
A
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
N= 10 N= 50 N= 100 N= 500 N= 1000
Gillespie (master eq.) Multi-agent homogeneous Multi-agent heterogeneous
Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.
.2. Multiagent simulations for search & exploration
Mobile point-size particles capable to move in a 2D environment
Reina, A., Valentini, G., Fernández-Oto, C., Dorigo, M., & Trianni, V. (2015). A Design Pattern for Decentralised Decision Making. PLoS ONE, 10(10), e0140950–18.
.3. Swarm robotics system for search & exploitation
Robots exemplify embodiment challenges
video by A. Reina Reina, A., Miletitch, R., Dorigo, M., & Trianni, V. (2015). A quantitative micro–macro link for collective decisions: the shortest path discovery/selection example. Swarm Intelligence, 9(2-3), 75–102.
.3. Swarm robotics system for search & exploitation
Robots exemplify embodiment challenges
video by A. Reina Reina et al (2016): Effects of Spatiality on Value-Sensitive Decisions Made by Robot Swarms. In: Proceedings of DARS 2016, pp. 1–8, Natural History Museum in London, UK
the process that leads a group to (equally) divide labour among the group members
a set of tasks with different labour demands (utility)
agents are deployed to execute one or more tasks
individuals do not know task requirements and other’s preferences/choices
Gerkey, B. P., & Matarić, M. J. (2004). A Formal Analysis and Taxonomy of Task Allocation in Multi-Robot Systems. The International Journal of Robotics Research, 23(9), 939–954.
Theraulaz, G., Bonabeau, E., & Denuebourg, J. N. (1998). Response threshold reinforcements and division of labour in insect societies. Proceedings of the Royal Society of London. Series B: Biological Sciences, 265(1393), 327–332.
Sj, j ∈ {1, . . . , M} θij, i ∈ {1, . . . , N}
˙ Sj = δ − αnj N Pi(Sj) = S2
j
S2
j + θ2 ij
growth enrolled agents individual execution rate
for optimal task allocation?
specialised agents? What about generalists?
θij ← θij − ξ∆t if agent i performs task j θij ← θij + ξ∆t if agent i does not perform task j
task allocation
evaluate utility
completed
that need attention
collective decision
evaluate quality
low quality options
favourable options
competing options
number of enrolled agents:
dynamics:
˙ ui = −uini(δni − ξn2
i ),
ui ∈ [0, 1].
n? = 2δ 3ξ
γi = kui αi = kH(ν ui) ρi = hui σij = hui 2δ 3ξnj 2δ , i 6= j σii = (3ξN 2δ)(3ξNγi + 2δρi) 4δ2
number of enrolled agents:
dynamics:
interactive and spontaneous transitions
˙ ui = −uini(δni − ξn2
i ),
ui ∈ [0, 1].
n? = 2δ 3ξ
r = h k
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 ui xi
Time [s] xi,ui
50 100 150 200 250 300 0.0 0.2 0.4 0.6 0.8 1.0
ui ODEs Multi−agent
Time (s) Time (s)
share many important aspects
implement different task allocation strategies
winner-take-all strategies
task allocation becomes responsive to changes in utility