Modeling the Internet: stat. observables, dynamical approaches, - - PowerPoint PPT Presentation

modeling the internet stat observables dynamical
SMART_READER_LITE
LIVE PREVIEW

Modeling the Internet: stat. observables, dynamical approaches, - - PowerPoint PPT Presentation

Modeling the Internet: stat. observables, dynamical approaches, parameter proliferation.. A.Vespignani Collaborators Romualdo Pastor-Satorras Ignacio Alvarez-Hamelin Luca DallAsta Alain Barrat Vic


slide-1
SLIDE 1

Modeling the Internet: stat.

  • bservables, dynamical

approaches, parameter proliferation………………..

A.Vespignani

slide-2
SLIDE 2

Romualdo Pastor-Satorras Ignacio Alvarez-Hamelin Luca Dall’Asta Alain Barrat Vic Colizza Mark Meiss Filippo Menczer Mariangels Serrano Alexei Vazquez Collaborators

slide-3
SLIDE 3

Once upon a time there was the physical Internet….

slide-4
SLIDE 4

The beginning…..

Faloutsos et al. 1999

Degree distribution of the Internet graph (AS and Router level) Measurement infrastructures Passive/active measurements (CAIDA; NLANR; Lumeta…)

slide-5
SLIDE 5

Internet graphs…..

Skewed Heterogeneity and high

variability

Very large fluctuations

(variance>>average)

Various fits : power-

law+cut-off; Weibull etc.

slide-6
SLIDE 6
slide-7
SLIDE 7

Higher order statistical characterization…. Model validation…… Model construction…..

slide-8
SLIDE 8

Multi-point correlations P(k,k’)

  • 0-dimensional projection (pearson coefficient)
  • M. Newman (2002)
  • One-dimensional projection (average nearest neighbor degree)

Pastor-Satorras & A.V. (2001)

  • Three dimensional analysis

Maslov&Sneppen (2002)

slide-9
SLIDE 9

Average nearest neighbors degree

knn(i) = Σj kj

1 ki Correlation spectrum: Average over degree classes < knn(k)>

Multi-point correlations P(k,k’)

slide-10
SLIDE 10

Degree correlation function

< knn(k)> = Σk’ k’ p(k’|k)

k (k)

k

nn

Assortative Disassortative

k

  • Assortative behaviour: growing knn(k)

Example: social networks Large sites are connected with large sites

  • Disassortative behaviour: decreasing knn(k)

Example: internet Large sites connected with small sites

slide-11
SLIDE 11

Degree correlation function

< knn(k)> = Σk’ k’ p(k’|k)

Highly degree ASs connect to low degree ASs Low degree ASs connect to high degree ASs No hierarchy for the router map

Pastor Satorras, Vazquez &Vespignani, PRL 87, 258701 (2001)

slide-12
SLIDE 12
slide-13
SLIDE 13

Clustering coefficient = connected peers will

likely know each other

C =

# of links between 1,2,…k neighbors k(k-1)/2 1 2 3 n

Higher probability to be connected

slide-14
SLIDE 14

Clustering spectrum

Clustering spectrum

This is a kind of three-points correlation function…..

slide-15
SLIDE 15

Clustering Clustering Spectrum Spectrum in the Internet in the Internet

Clustering coefficient as a function of the vertex degree

Highly degree ASs bridge not connected regions of the Internet Low degree ASs have links with highly interconnected regions of the Internet

Vazquez et al. Physical Review E 65, 066130 (2002).

slide-16
SLIDE 16
slide-17
SLIDE 17

Rich-Club coefficient

Fraction of edges shared by nodes of degree >k with respect to the Maximum allowed number. Increasing interconnectivity for increasing k

Rich-club phenomenon??

slide-18
SLIDE 18
slide-19
SLIDE 19

Normalized rich-club coefficient

It is possible to show that for a completely uncorrelated network Coefficient of the maximally randomized equivalent graph

NO rich club phenomenon

V.Colizza et al.

slide-20
SLIDE 20

K-core decomposition

K-shell K-shell K-shell

slide-21
SLIDE 21

K-core structure…

slide-22
SLIDE 22

http://xavier.informatics.indiana.edu/lanet-vi

slide-23
SLIDE 23

Betweenness centrality = # of shortest paths traversing a vertex or edge

(flow of information ) if each individuals send a message to all other individuals

Non-local measure of centrality

slide-24
SLIDE 24

Beteweenness Probability distribution

Heavy-tailed and highly heterogeneous

slide-25
SLIDE 25

Scale-free topology generators INET (Jin, Chen, Jamin) BRITE (Medina & Matta) Classical topology generators

  • Waxman generator
  • Structural generators

Transit-stub Tiers

Exponentially Bounded Degree distributions Modeling of the Network structure with ad-hoc algorithms tailored on the properties we consider more relevant

slide-26
SLIDE 26

What about the degree distribution ?

Heavy tails ?

Static construction Molloy-reed Position model Hidden variables Etc. Generalized random graphs with pre-assigned degree distribution

slide-27
SLIDE 27

Shift of focus: Static construction Dynamical evolution Direct problem Evolution rules Emerging topology Inverse problem Given topology Evolution rules

slide-28
SLIDE 28

P(k) ~k-3 The rich-get-richer mechanism

(Barabasi& Albert 1999)

slide-29
SLIDE 29

Continuous approximations Average degree value that the node born at time s has a time t Evolution equation

{ }

Degree distribution

slide-30
SLIDE 30

BA is a conceptual model…. It has not been thought to specifically

model the Internet

More details/realism/ingredients needed

slide-31
SLIDE 31

COPY MODEL

Dynamical evolution Preferential attachment component Degree distribution

slide-32
SLIDE 32

More models

  • Generalized BA model

(Redner et al. 2000) (Mendes & Dorogovstev 2000) (Albert et al.2000)

≅ Π

j j j i i i

k k k η η ) ( Non-linear preferential attachment : Π(k) ~ kα Initial attractiveness : Π(k) ~ A+kα Rewiring

  • Highly clustered

(Eguiluz & Klemm 2002)

  • Fitness Model

(Bianconi et al. 2001)

  • Multiplicative noise

(Huberman & Adamic 1999)

slide-33
SLIDE 33

Heuristically Optimized Trade-offs (HOT)

Papadimitriou et al. (2002)

New vertex i connects to vertex j by minimizing the function Y(i,j) = α d(i,j) + V(j) d= euclidean distance V(j)= measure of centrality

Optimization of conflicting objectives

slide-34
SLIDE 34

Model validation……

Correlations Clustering Hierarchies (k-cores, modularity etc.) ………..

slide-35
SLIDE 35

Clustering spectrum Correlation spectrum

slide-36
SLIDE 36

Clustering spectrum Correlation spectrum

slide-37
SLIDE 37

Rich-club coefficient

slide-38
SLIDE 38

K-core structure

slide-39
SLIDE 39

http://xavier.informatics.indiana.edu/lanet-vi

slide-40
SLIDE 40

E E-

  • R model

R model

slide-41
SLIDE 41

B-A Model

slide-42
SLIDE 42

Wide spectrum of complications and complex features to include…

IP 1 IP 2 IP 3

Simple Realistic

Ability to explain (caveats) trends at a population level Model realism looses in transparency. Validation is harder.

slide-43
SLIDE 43

Wide spectrum of complications and complex features to include…

Conceptual/theoretical Data driven Value if data are realistic and parameters are physical IP 1 IP 2 IP 3

slide-44
SLIDE 44

Agent Based Modeling

The good…

Data driven:

Demographic, societal, census data from real experiments

Sensibility analysis / scenario evaluation

The bad…

Non-physical parameters

(non-measurable/fitness/unmotivated parameters

etc.)…

slide-45
SLIDE 45

Physical Parameters ??

Measurable quantity. Combination of measurable quantities. Parameters appearing from the symmetry and consistency of

equations. Hints..

Minimum number of free (measurable) parameters…. Falsifiable requisite for the model….

slide-46
SLIDE 46

A few examples….

BA model Rewiring/copy model Fitness model HOT

≅ Π

j j j i i i

k k k η η ) (

Y(i,j) = a d(i,j) + V(j)

slide-47
SLIDE 47

Census/societal data Geographical data Traffic data In the lack of that ……..topology generators!!

(Using measurement data)

slide-48
SLIDE 48

Effect of complex network topologies on physical processes

Epidemic models Resilience & robustness Avalanche and failure cascades Search and diffusion…..