[PPT] - Complex Networks in Evolutionary Computation and Heuristic Search PowerPoint Presentation

SLIDE 1

Complex Networks in Evolutionary Computation and Heuristic Search

Marco Tomassini

Faculty of Business and Economics Information Systems Department University of Lausanne, Switzerland

SLIDE 2

Complex Networks

What are complex networks?

SLIDE 3

Complex Networks

What are complex networks?

They are large; larger than the networks that were common in

social sciences already some decades ago

SLIDE 4

Complex Networks

What are complex networks?

They are large; larger than the networks that were common in

social sciences already some decades ago

They have short diameters: going from any node to any other

node takes a few steps (O(logN), N being the number of vertices)

SLIDE 5

Complex Networks

What are complex networks?

They are large; larger than the networks that were common in

social sciences already some decades ago

They have short diameters: going from any node to any other

node takes a few steps (O(logN), N being the number of vertices)

They are clustered: locally many triangles and polygons; at

the mesoscopic scale: many of them, especially social ones, have communities

SLIDE 6

Complex Networks

What are complex networks?

They are large; larger than the networks that were common in

social sciences already some decades ago

They have short diameters: going from any node to any other

node takes a few steps (O(logN), N being the number of vertices)

They are clustered: locally many triangles and polygons; at

the mesoscopic scale: many of them, especially social ones, have communities

Their degree distribution functions P(k) are often

right-skewed: stretched exponentials, power-laws P(k) ∝ k−γ

SLIDE 7

The GP collaboration graph (B.W. Langdon)

SLIDE 8

The GP collaboration graph

Cumulative Degree Distribution

0.1 1 10 100 1000 10000 1 10 100

number of collaborators number of authors

SLIDE 9

The World Airlines Graph (2000))

●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
(courtesy of C. Rozenblat et al., 2006)

SLIDE 10

World Airlines: Cumulative DDF

1 2 5 10 20 50 100 200 500 5e−04 5e−03 5e−02 5e−01 degree k complementary cumulative frequency

SLIDE 11

Communities: Les Mis´ erables

M yriel

Napoleon M lleBaptistine M meM agloire CountessDeLo Geborand Champtercier Cravatte Count OldM an Labarre Valjean M arguerite M meDeR Isabeau Gervais Tholomyes Listolier Fameuil Blacheville Favourite Dahlia Zephine Fantine M meThenardier Thenardier Cosette Javert Fauchelevent Bamatabois Perpetue Simplice Scaufflaire Woman1 Judge Champmathieu Brevet Chenildieu Cochepaille Pontmercy Boulatruelle Eponine Anzelma Woman2 M otherInnocent Gribier Jondrette M meBurgon Gavroche Gillenormand M agnon M lleGillenormand M mePontmercy M lleVaubois LtGillenormand M arius BaronessT M abeuf Enjolras Combeferre Prouvaire Feuilly Courfeyrac Bahorel Bossuet Joly Grantaire M otherPlutarch Gueulemer Babet Claquesous M ontparnasse Toussaint Child1 Child2 Brujon M meHucheloup

SLIDE 12

Random Graphs

Erd¨
s-R´

enyi: an arbitrary edge is present with probability p and absent with probability 1 − p

SLIDE 13

Random Graphs

Erd¨
s-R´

enyi: an arbitrary edge is present with probability p and absent with probability 1 − p

P(k) = e−¯

k¯

kk/k! is Poissonian, nodes with ¯ k ± 3σ are extremely improbable

paths are short (if ¯

k > 1)

the clustering coefficient ¯

C → 0 as N → ∞

degree correlations are absent

SLIDE 14

Random Graphs

Erd¨
s-R´

enyi: an arbitrary edge is present with probability p and absent with probability 1 − p

P(k) = e−¯

k¯

kk/k! is Poissonian, nodes with ¯ k ± 3σ are extremely improbable

paths are short (if ¯

k > 1)

the clustering coefficient ¯

C → 0 as N → ∞

degree correlations are absent
Random graphs with given degree sequences {k1, k2, . . . , kN}

SLIDE 15

Random Graphs

Erd¨
s-R´

enyi: an arbitrary edge is present with probability p and absent with probability 1 − p

P(k) = e−¯

k¯

kk/k! is Poissonian, nodes with ¯ k ± 3σ are extremely improbable

paths are short (if ¯

k > 1)

the clustering coefficient ¯

C → 0 as N → ∞

degree correlations are absent
Random graphs with given degree sequences {k1, k2, . . . , kN}
P(k) ok
no degree correlations, no clustering as N → ∞

SLIDE 16

Random Graphs

Erd¨
s-R´

enyi: an arbitrary edge is present with probability p and absent with probability 1 − p

P(k) = e−¯

k¯

kk/k! is Poissonian, nodes with ¯ k ± 3σ are extremely improbable

paths are short (if ¯

k > 1)

the clustering coefficient ¯

C → 0 as N → ∞

degree correlations are absent
Random graphs with given degree sequences {k1, k2, . . . , kN}
P(k) ok
no degree correlations, no clustering as N → ∞

These models are unrealistic but very useful as statistical null models

SLIDE 17

Regular Graphs

In regular graphs all the nodes have the same degree: Grids are often used but they are not good models for real networks: long paths, the same number of contacts for everybody

SLIDE 18

Model Complex Networks

Beyond random and regular graphs, including complete graphs, there exist today several new models that are much more faithful to actual network topologies. Here we shall describe the two oldest and classical:

SLIDE 19

Model Complex Networks

Beyond random and regular graphs, including complete graphs, there exist today several new models that are much more faithful to actual network topologies. Here we shall describe the two oldest and classical:

Watts–Strogatz networks
Barab´

asi–Albert networks

SLIDE 20

Watts–Strogatz Networks

Go iteratively through each node and rewire each link with some small probability β to a randomly chosen node “Shortcuts”shorten the mean path length by a large amount.

SLIDE 21

Watts–Strogatz Model: CC and Mean Path Length

Small-World region: short path length and high clustering coefficient

10

−4

10

−3

10

−2

10

−1

10 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Scaled Length Scaled Clustering

The P(k) remains Poissonian however, no degree-heterogeneity

Redrawn from Watts and Strogatz ’98

SLIDE 22

Barab´ asi–Albert Growing Scale-Free Networks

Start with a small clique (a completely connected graph) of

N0 nodes and M0 edges.

SLIDE 23

Barab´ asi–Albert Growing Scale-Free Networks

Start with a small clique (a completely connected graph) of

N0 nodes and M0 edges.

At each time step:
a new node is added such that its m ≤ N0 edges link it to m

nodes already in the graph.

SLIDE 24

Barab´ asi–Albert Growing Scale-Free Networks

Start with a small clique (a completely connected graph) of

N0 nodes and M0 edges.

At each time step:
a new node is added such that its m ≤ N0 edges link it to m

nodes already in the graph.

the probability π that a new node will be connected to node i

is such that highly connected nodes are more likely to be chosen: π(ki) =

ki P

j kj

SLIDE 25

Barab´ asi–Albert Growing Scale-Free Networks

Start with a small clique (a completely connected graph) of

N0 nodes and M0 edges.

At each time step:
a new node is added such that its m ≤ N0 edges link it to m

nodes already in the graph.

the probability π that a new node will be connected to node i

is such that highly connected nodes are more likely to be chosen: π(ki) =

ki P

j kj

such a growing graph evolves into a stationary scale-free

network with a power-law degree probability distribution P(k) ∼ k−γ, with γ ∼ 3.

SLIDE 26

A Small Computer-Generated BA Scale-Free Network

SLIDE 27

Barab´ asi–Albert Networks: Properties

Barab´

asi–Albert networks are based on preferential attachment which is an effect that has been observed in real networks, for example in references to web pages or to fundamental scientific articles among many others.

They are much better than WS networks as models of real

graphs

They are very robust against random attacks
However, they lack clustering and communities

SLIDE 28

Some references for this part

1. M. E. J. Newman, Networks: An Introduction, Oxford

University Press, 2010 (The most complete)

2. G. Caldarelli, Scale-Free Networks, Oxford University

Press, 2007 (Easier)

3. M. E. J. Newman, The structure and function of complex

networks, SIAM Review, 45, 167-256, 2003. (Shorter)

SLIDE 29

Complex Networks in Evolutionary Computation and Heuristics

We shall deal with two applications of complex networks in artificial evolution and search:

SLIDE 30

Complex Networks in Evolutionary Computation and Heuristics

We shall deal with two applications of complex networks in artificial evolution and search:

population structures, selection pressure, and evolutionary

algorithms

SLIDE 31

Complex Networks in Evolutionary Computation and Heuristics

We shall deal with two applications of complex networks in artificial evolution and search:

population structures, selection pressure, and evolutionary

algorithms

network representation of search spaces

SLIDE 32

Population Structures in Evolutionary Algorithms

Typically three types: single well-mixed, communicating well-mixed subpopulations, and cellular

SLIDE 33

Population Structures in Evolutionary Algorithms

Typically three types: single well-mixed, communicating well-mixed subpopulations, and cellular

cellular populations have always been low-degree regular

graphs such as rings and toroidal grids

SLIDE 34

Population Structures in Evolutionary Algorithms

Typically three types: single well-mixed, communicating well-mixed subpopulations, and cellular

cellular populations have always been low-degree regular

graphs such as rings and toroidal grids

here we shall explore other topologies drawn from the

world of complex networks

SLIDE 35

Population Structures in Evolutionary Algorithms

Typically three types: single well-mixed, communicating well-mixed subpopulations, and cellular

cellular populations have always been low-degree regular

graphs such as rings and toroidal grids

here we shall explore other topologies drawn from the

world of complex networks

in evolutionary algorithms social or biological credibility of

the contact structure is not strictly required and thus we can use what fits us best

SLIDE 36

Takeover Times on Watts–Strogatz Networks

50 100 150 200 250 300 100 200 300 400 500 600 700 800 900 1000

Time Steps Best Individual Copies

SW β = 0.8 SW β = 0.02 SW β = 0.005 SW β = 0.001 RING

One can indeed control the selection pressure (here binary tournament) from linear (ring) to exponential (random graph and well mixed population)

SLIDE 37

Takeover Times on Barab´ asi–Albert Networks

5 10 15 20 25 30 100 200 300 400 500 600 700 800 900 1000 Time Steps Best Individual Copies SF Synchronous SF Asynchr NRS SF Asynchr UC 5 10 15 20 25 30 100 200 300 400 500 600 700 800 900 1000 Time Steps Best Individual Copies

Left: best individual uniformly distributed; Right: best starts in a hub. Propagation of the best is extremely fast, especially when the

riginal individual sits on a hub

SLIDE 38

Some Experimental Results on Discrete Problems I

Success rates for small-world (left) and scale-free (right) topology

n the MMDP problem. Each point is the average of 100 runs

SLIDE 39

Some Experimental Results on Discrete Problems II

Success rates for small-world (left) and scale-free (right) topology

n the ECC problem. Each point is the average of 100 runs

SLIDE 40

Some Experimental Results on Discrete Problems III

Success rates for small-world (left) and scale-free (right) topology

n the P-PEAKS problem. Each point is the average of 100 runs

SLIDE 41

Summary of Results on Static Complex Networks

The results on static population graphs are somewhat mixed

SLIDE 42

Summary of Results on Static Complex Networks

The results on static population graphs are somewhat mixed

In general, Watts–Strogatz small worlds behave better

than complete graph (panmictic) populations for low rewiring probabilities (true small-worlds)

SLIDE 43

Summary of Results on Static Complex Networks

The results on static population graphs are somewhat mixed

In general, Watts–Strogatz small worlds behave better

than complete graph (panmictic) populations for low rewiring probabilities (true small-worlds)

Watts–Strogatz graphs with larger β approach the

random graph and panmictic population limit and are less effective

SLIDE 44

Summary of Results on Static Complex Networks

The results on static population graphs are somewhat mixed

In general, Watts–Strogatz small worlds behave better

than complete graph (panmictic) populations for low rewiring probabilities (true small-worlds)

Watts–Strogatz graphs with larger β approach the

random graph and panmictic population limit and are less effective

Scale-free structured populations do not seem to help

much: they converge too fast, even when the initial clique is small

SLIDE 45

Summary of Results on Static Complex Networks

The results on static population graphs are somewhat mixed

In general, Watts–Strogatz small worlds behave better

than complete graph (panmictic) populations for low rewiring probabilities (true small-worlds)

Watts–Strogatz graphs with larger β approach the

random graph and panmictic population limit and are less effective

Scale-free structured populations do not seem to help

much: they converge too fast, even when the initial clique is small

Dynamically varying population topologies might be the

answer

SLIDE 46

Some references for this part

1. E. Alba and B. Dorronsoro, Cellular Genetic Algorithms,

Springer, 2008

2. M. Tomassini, Spatially Structured Evolutionary Algorithms,

Springer, 2005

3. M. Giacobini, M. Tomassini, A. Tettamanzi, Takeover time

curves in random and small-world structured populations, GECCO ’05, ACM Press , 1333-1340, 2005.

4. M.Giacobini, M. Preuss, M. Tomassini, Effects of scale-free

and small-world topologies on binary-coded, self-adaptive CAS, EvoCop 2006, Lecture Notes in Computer Science, 3906, 86-98, Springer, 2006. More recent work in this direction, including dynamical and self-organized networked populations: J. Payne, J.-L. Jim´ enez-Laredo and cw, Whitacre and cw, plus a few other groups

SLIDE 47

Discrete Search Spaces

Fitness landscape (S, V, f ) :

S : set of admissible solutions,
V : S → 2S : neighborhood

function,

f : S → I

R : fitness function.

SLIDE 48

Basin of Attraction

Hill-Climbing (HC) algorithm

SLIDE 49

Basin of Attraction

Hill-Climbing (HC) algorithm

Choose initial solution s ∈ S repeat choose s

′ ∈ V(s) such that f (s ′) = maxx∈V(s) f (x)

if f (s) < f (s

′) then

s ← s

′

end if until s is a Local Optimum

SLIDE 50

Basin of Attraction

Hill-Climbing (HC) algorithm

Choose initial solution s ∈ S repeat choose s

′ ∈ V(s) such that f (s ′) = maxx∈V(s) f (x)

if f (s) < f (s

′) then

s ← s

′

end if until s is a Local Optimum Basin of attraction of s∗ : {s ∈ S | HillClimbing(s) = s∗}.

SLIDE 51

Local Optima Network (LON)

i j

arbitrary configuration local maximum

ij
ji

Nodes : set of local optima S∗ Edges : notion of connectivity between basins

eij between i and j if there is at

least a pair of neighbours si and sj ∈ V(si) such that si ∈ bi and sj ∈ bj

weights wij attached to edges →

transition probabilities between basins

SLIDE 52

Underlying Ideas and Goals

Bring the tools of complex networks analysis to the study
f the structure of combinatorial fitness landscapes

SLIDE 53

Underlying Ideas and Goals

Bring the tools of complex networks analysis to the study
f the structure of combinatorial fitness landscapes
Relate problem features such as fitness distribution,

basins number and size distribution etc. with network structure

SLIDE 54

Underlying Ideas and Goals

Bring the tools of complex networks analysis to the study
f the structure of combinatorial fitness landscapes
Relate problem features such as fitness distribution,

basins number and size distribution etc. with network structure

Use network information to design more effective

heuristic search algorithms

SLIDE 55

The NK Landscape Case: Outgoing weight distribution

0.1 1 0.001 0.01 0.1 P(wij>W) W K=2 K=4 K=10 K=12 K=15

Cumulative distribution of the network weights wij for outgoing edges with j = i in log-log scale, N = 16

Weights (transition prob. to

neighbouring basins) are small

The distributions are not

uniform or Poissonian, nor power laws

For high K the decay is

faster

Low K has longer tails (on

average the transition probabilities are higher for low K)

SLIDE 56

Average weights to remain in the same basin

0.1 0.2 0.3 0.4 0.5 0.6 0.7 2 4 6 8 10 12 14 16 18 average wii K N=14 N=16 N=18

Average weight wii according to the parameter N and K

Weights to remains in the

same basin wii, are large compared to wij with i = j

wii are higher for low K

(50% for K = 2, above 12% for high K),

It seems easier to leave the

basin for high K (high exploration), however, number the of local optima increases fast with K

SLIDE 57

Global optimum basin size

1e-05 0.0001 0.001 0.01 0.1 1 2 4 6 8 10 12 14 16 18 Normalized size of the global optima’s basin K N=16 N=18

Size of the basin corresponding to the global maximum for each K

Trend: the basin shrinks very

quickly with increasing K.

For higher K, it is more

difficult for a search algorithm to locate the basin

f attraction of the global
ptimum

SLIDE 58

Fitness vs. basin size

1 10 100 1000 10000 0.5 0.55 0.6 0.65 0.7 0.75 0.8 basin of attraction size fitness of local optima exp.

regr. line

Fitness of local optima vs. their corresponding basins sizes

Trend: clear positive

correlation between the fitness values of maxima and their basins’ sizes

On average, the global
ptimum seems easier to

find than another local

ptimum, however, the

number of local optima increases exponentially with increasing K

SLIDE 59

Community Structure in the LON of a Small Real-Like Instance of QAP

SLIDE 60

Some references for this part

1. G. Ochoa, M. Tomassini, S. V´

erel, C. Darabos, A Study of NK Landscapes’ Basins and Local Optima Networks, GECCO ’08, ACM Press, 555-562, 2008.

2. M. Tomassini,S. V´

erel, G. Ochoa, Complex Networks Analysis

f Combinatorial Spaces: the NK Landscape Case, Phys. Rev.

E, 78, 6, 066114, 2008.

3. S. V´

erel, G. Ochoa, M. Tomassini, Local Optima Networks of NK Landscapes with Neutrality, IEEE Transactions on Evolutionary Computation, to appear, 2011.

4. F. Daolio, M. Tomassini, S. V´