L ECTURE 20: S WARM I NTELLIGENCE 1 / P ARTICLE S WARM O PTIMIZATION - - PowerPoint PPT Presentation

l ecture 20 s warm i ntelligence 1 p article s warm o
SMART_READER_LITE
LIVE PREVIEW

L ECTURE 20: S WARM I NTELLIGENCE 1 / P ARTICLE S WARM O PTIMIZATION - - PowerPoint PPT Presentation

15-382 C OLLECTIVE I NTELLIGENCE S19 L ECTURE 20: S WARM I NTELLIGENCE 1 / P ARTICLE S WARM O PTIMIZATION 1 T EACHER : G IANNI A. D I C ARO L IMITATIONS OF THE ( CLASSIC ) CA MODEL 2D Environment tized according to an ! # regular lattice


slide-1
SLIDE 1

LECTURE 20: SWARM INTELLIGENCE 1 / PARTICLE SWARM OPTIMIZATION 1

TEACHER: GIANNI A. DI CARO

15-382 COLLECTIVE INTELLIGENCE – S19

slide-2
SLIDE 2

2

LIMITATIONS OF THE (CLASSIC) CA MODEL

§ Sp Spac ace is di discreti tized according to an ! × # regular lattice § The CA is equivalent to $ × % coupled it iterated m maps § Coupling is determined by the definition of neigh ghbor

  • rhood
  • ods, and by

bou

  • undary con
  • ndition
  • ns and synchron
  • nization
  • n

§ Neighborhoods are statically defined based on the lattice’s topology, and capture some notion of meaningful spatial prox

  • ximity

§ The CA is useful as a simulation

  • n mod
  • del for a dynamical system, but has

some obvious limitations for different types of use 2D Environment

slide-3
SLIDE 3

3

FROM CA TO PSO / SI

Discrete à Con

  • ntinuou
  • us

!" !# !$ %(', ) ' ) Spatially-fixed cell state ~ FSM à Age gent-mod

  • del:

§ internal state § mobile Spatially-related neighborhood (static) Physical topology induced by the lattice à Relation

  • nal neigh

ghbor

  • rhood
  • od

§ Agents form a network § Logical topology § Can be dynamic

slide-4
SLIDE 4

4

REFERENCE TASK: GLOBAL FUNCTION OPTIMIZATION

Find the gl glob

  • bal maximum of the function ! "

(and the point " where it happens) Find the gl glob

  • bal minimum of the function
slide-5
SLIDE 5

5

OPTIMIZATION PROBLEMS

§ Optimization problems expressed in mathematical form: min

$ % $

subject to $ ∈ ℱ § %: ℝ* ⟼ ℝ, is the ob

  • bjective function
  • n (for now, m=1)

§ $ ∈ -* ⊆ ℝ* is the op

  • ptimization
  • n vector
  • r variable

§ ℱ ⊆ -* is the fe feasibl ble set (constraints = values the variables can feasibly take) § $∗ ∈ -* is an op

  • ptimal sol
  • lution
  • n (gl

glob

  • bal minimum) if

$∗ ∈ ℱ and % $∗ ≤ %($) for all $ ∈ ℱ § Mathematical programming prob

  • blem
slide-6
SLIDE 6

6

BASIC PROPERTIES

Given an optimization problem: min

$ % $

subject to $ ∈ ℱ, $ ∈ )* ⊆ ℝ* § min

$ % $ is equivalent to m-. $

−% $ § If ℱ = ∅ the problem has no solution (unfeasible) § If ℱ is an open set, only the inf (sup) is guaranteed but not min (max) § The problem is unbounded if f → −∞

slide-7
SLIDE 7

7

UNCONSTRAINED VS. CONSTRAINED OPT

§ Sublevel sets (isolines): {" ∈ ℝ%: ' " = )}

ü Any constrained optimization problem can be formulated as an unconstrained

  • ne by including constraint violations as penalty terms in the objective function
slide-8
SLIDE 8

8

GLOBAL AND LOCAL OPTIMALITY

§ A point ! ∈ #$ ⊆ ℝ$ is gl globa

  • bally opt
  • ptimal (global minimum) if ! ∈ ℱ

and for all ( ∈ ℱ, ) ! ≤ )(() § A point ! ∈ ℝ$ is loc

  • cally opt
  • ptimal (local minimum) if ! ∈ ℱ and

there exists ε > 0 small such that for all ( ∈ ℱ with ! − ( 1 ≤ ε, ) ! ≤ )(()

slide-9
SLIDE 9

9

GLOBAL AND LOCAL OPTIMALITY

§ What about discrete spaces? ! ∈ #$ ⊆ ℤ$

min ZILP = x2 s.t. 2x1 + x2 > 13 5x1 + 2x2 6 30 −x1 + x2 > 5 x1, x2 ∈ Z+

slide-10
SLIDE 10

10

BLACK-BOX OPTIMIZATION

§ The function to optimize is not given in algebraic form, or § The function is given, but it’s not amenable to analytical treatment in terms of using its derivatives for finding min/max § All we can do is to query the black box and observe (", $$ " ) pairs… § Bl Black-box box / / Derivative-free opt

  • ptimization
  • n vs. Whi

hite box box opt

  • ptimization
  • n
slide-11
SLIDE 11

11

HOW DO WE FIND MINIMA / MAXIMA?

§ Using rates of change: derivatives / gradients / Jacobians / Hessians / … § Sampling / Searching in the !" ⊆ ℝ" domain, the input state space: § Figure out where to search / sample next, from the values that are returned from the function, without generating a model of the function § Using the sampled data to generate a model of the function and in turn, using it to iteratively direct the search § Iteratively constructing a solution, by adding / trying out assignment to solution components %&, %(, … %" § … many, many variants and combinations of these two basic approaches...

state space % ∈ ! Objective function +(%) global maximum shoulder local maximum “flat” local maximum % “saddle”

slide-12
SLIDE 12

12

PARTICLE SWARM OPTIMIZATION (PSO)

  • J. Kennedy, R. Eberhart, Particle Swarm Optimization. Proc. 4th IEEE Int. Conf. on Neural Networks, 1995.

§ Mu Multi-age gent black-box

  • x op
  • ptimization
  • n

inspired by soc

  • cial and roos
  • osting

g behavior

  • r of
  • f floc
  • cking

g birds: § Each agent (a particle) encodes a solution point ! § Agents move in "# ⊆ ℝ#, searching for the spots regions/points where the

  • bjective function gets its max (min) values

§ Individual swarm members establish a soc

  • cial networ
  • rk and can profit from the

discoveries and previous experience of the other members of the swarm: § Each agent iteratively changes its position (i.e., decides how to move) using information from personal past experience and from its social neighborhood

&' &(

slide-13
SLIDE 13

13

COMPLEX SYSTEMS ↔ SWARM INTELLIGENCE/PSO

Com

  • mplex systems (definition adapted from Lecture 1):

ü Multi-agent / Multi-component ü Distributed: each agent/component is situated in the embedding environment and acts autonomously ü Decentralized: neither central controller, nor representation of global patterns/goals ü Possibly (not necessarily) with a large number of components ü Localized interactions (allowing propagation of information) ü Emerging and / or Self-Organizing properties ü Agents do not need to be “complex” ü Dynamic: Time and space evolution of the system

slide-14
SLIDE 14

14

PSO IS A FIRST EXAMPLE OF SWARM INTELLIGENCE

Swarm intellige gence: Study and design gn of complex systems that: § Are potentially made of a large number of components, a swa swarm § Each component has purpos

  • se(s) (as for animals, or artificially designed

agents), that implicitly contributes to the “performance” of the whole § Under certain conditions, the system displays forms of swarm intelligence in terms of generation at the system-level, of effective spatio-temporal patterns and/or optimized decision-making and action-making

Mod Modeling: g: study of natural complex systems with the above characteristics in order to identify the local rules that give raise to complex system-level behaviors and self-organization, make formal models Engi gineering: g: bottom-up design of artificial systems that display useful system-level behaviors, possibly, but not necessarily, taking inspiration from the natural systems

Mimicking nature: Bi Bio-mi mimet metic c (algor gorithms, rob

  • bot
  • ts)
slide-15
SLIDE 15

15

PSO IS A FIRST EXAMPLE OF SWARM INTELLIGENCE

Swarm intellige gence: Study and design gn of complex systems that: § Are potentially made of a large number of components, a swa swarm § Each component is “sentient” and has purpos

  • se(s)

(s) (as for animals, or artificially designed agents), that implicitly contributes to the “performance” of the whole § Under certain conditions, the system displays forms of swarm intelligence in terms of generation at the system-level of effective spatio-temporal patterns and/or optimized decision-making and action-making

Mod Modeling: g: study of natural complex systems with the above characteristics in order to identify the local rules that give raise to complex system-level behaviors and self-organization, make formal models Engi gineering: g: bottom-up design of artificial systems that display useful system-level behaviors, possibly, but not necessarily, taking inspiration from the natural systems

Mimicking nature: Bi Bio-mi mimet metic c (algor gorithms, rob

  • bot
  • ts)
slide-16
SLIDE 16

16

BOTTOM-UP VS. TOP-DOWN DESIGN

Ontogenetic and phylogenetic evolution has (necessarily) followed a bot

  • ttom
  • m-up

up approach (grassroots) to “design” systems: § Instantiation

  • n of
  • f the basic units (atoms, cells, organs, organisms, individuals)

composing the system and let them (se (self lf-)or

  • rga

ganize to generate more complex/organized system-level behaviors, structures, and functions § Pop

  • pulation
  • n + Interaction
  • n prot
  • toc
  • col
  • ls are more important than single modules

§ System-level structural patterns and behaviors are emerging properties From an engineering point of view we can also choose a top

  • p-dow
  • wn approach:

§ Acquisition of comprehensive knowledge about the problem/system, make analysis, decomposition, definition of a possibly optimal strategy § Amenable to formal analysis, “predictable” response

slide-17
SLIDE 17

17

MANY DIFFERENT PARADIGMS OF SI

Different ways of modeling com

  • mmunication
  • ns, con
  • nnection
  • n top
  • pol
  • logy
  • gy,

and spatial distribution

  • n have given raise to different SI frameworks

§ Poi

  • int-to

to-poi

  • int com
  • mmunication
  • n (one-to-one): two agents get in direct contact (e.g.,

antennation, trophallaxis, axons and dendrites in neurons) § Lim Limit ited-range ge infor

  • rmation
  • n broa
  • adcast (one-to-many): the signal propagates to some

limited extent throughout the environment and/or is available for a short time (e.g., fish’ use of lateral line to detect water waves, visual detection) § Indirect com

  • mmunication
  • n: two individuals interact indirectly when one of them

modifies the environment and the other responds to the modified environment, maybe at a later time (e.g., stigmergy, pheromone communication in ants) § Physical mob

  • bility: individuals move through the states of the environment, such as

the connection topology changes over time (based on communication capability), different environment areas are accessed in parallel § Static pos

  • sition
  • ning,

g, state evol

  • lution
  • n: connection topology and/or positioning in the

environment do not change over time. Local information propagates in multi-hop

  • modality. The internal state of an individual changes over time.
slide-18
SLIDE 18

18

SOME SI AND SI-RELATED FRAMEWORKS

slide-19
SLIDE 19

2

BACKGROUND: REYNOLDS’ BOIDS

Reynolds, C.W.: Flocks, herds and schools: a distributed behavioral model. Computer Graphics, 21(4), p.25-34, 1987

Reynolds created a model of coordinated animal motion in which the agents (boids) obeyed three simple local rules:

Separation: steer to avoid crowding local flockmates Alignment: steer towards the average heading of local flockmates Cohesion: steer to move toward the average position

  • f local flockmates

https://www.youtube.com/watch?v=QbUPfMXXQIY

Field of view Play with Boids, PSO, CA, and … with NetLogo https://ccl.northwestern.edu/netlogo/

slide-20
SLIDE 20

3

BOIDS + ROOSTING BEHAVIOR

Kennedy and Eberhart included a roost, or, more generally, an attraction point (e.g, a prey) in a simplified Boids-like simulation, such that each agent: Eventually, (almost) all agents land on the roost What if:

  • roost = (unknown) extremum of a function
  • distance to the roost = quality of current

agent position on the optimization landscape

  • is attracted to the location of the roost,
  • remembers where it was closer to the roost,
  • shares information with its neighbors about its closest

location to the roost

slide-21
SLIDE 21

4

PARTICLE SWARM OPTIMIZATION (PSO)

  • PSO consists of a swarm of bird-like particles ➔ Multi-agent system
  • At each discrete time step, each particle is found at a position in the search space:

it encodes a solution point 𝒚 ∈ 𝛹𝑜 ⊆ ℝ𝑜 for an optimization problem with objective function f(𝒚): 𝛹𝑜 ⟼ ℝm (black-box or white-box problem)

  • The fitness of each particle represents the quality of its position on the optimization

landscape (note: this can be virtually anything, think about robots looking for energy)

  • Particles move over the search space with a certain velocity
  • Each particle has: Internal state + (Neighborhood ⟷ Network of social connections)

{~ x,~ v, ~ xpbest, N(p)}

  • At each time step, the velocity (both direction and speed) of

each particle is influenced + random, by:

  • pbest: its own best position found so far
  • lbest: the best solution that was found so far by the

teammates in its social neighbor, and/or

  • gbest: the global best solution so far
  • “Eventually” the swarm will converge to optimal positions
slide-22
SLIDE 22

5

NEIGHBORHOODS

Geographical Social Global

slide-23
SLIDE 23

6

VECTOR COMBINATION OF MULTIPLE BIASES

~ xpbest ~ xlbest − ~ xt !~ vt ~ xpbest − ~ xt ~ xt+1 ~ xt r2 · (~ xlbest − ~ xt) ~ vt r1 · (~ xpbest − ~ xt) ~ xlbest

slide-24
SLIDE 24

7

PARTICLE SWARM OPTIMIZATION (PSO)

~ r1 = U(0, 1) ~ r2 = U(0, 2)

ɸ are acceleration coefficients determining scale of forces in the direction of individual and social biases

element-wise multiplication operator

slide-25
SLIDE 25

8

VECTOR COMBINATION OF MULTIPLE BIASES

  • Makes the particle move in the same

direction and with the same velocity

  • Improves the individual
  • Makes the particle return to a previous

position, better than the current

  • Conservative
  • Makes the particle follow the best

neighbors direction

  • 1. Inertia
  • 2. Personal

Influence

  • 3. Social

Influence

Exploits what good so far Search for new solutions

slide-26
SLIDE 26

9

PSO AT WORK (MAX OPTIMIZATION PROBLEM)

Example slides from Pinto et al.

slide-27
SLIDE 27

10

PSO AT WORK

slide-28
SLIDE 28

11

PSO AT WORK

slide-29
SLIDE 29

12

PSO AT WORK

slide-30
SLIDE 30

13

PSO AT WORK

slide-31
SLIDE 31

14

PSO AT WORK

slide-32
SLIDE 32

15

PSO AT WORK

slide-33
SLIDE 33

16

PSO AT WORK

slide-34
SLIDE 34

17

PSO AT WORK

slide-35
SLIDE 35

18

PSO AT WORK