Self-similar traffic 1 Self-similarity 2 Aggregate traffic - - - PowerPoint PPT Presentation

self similar traffic
SMART_READER_LITE
LIVE PREVIEW

Self-similar traffic 1 Self-similarity 2 Aggregate traffic - - - PowerPoint PPT Presentation

Self-similar traffic 1 Self-similarity 2 Aggregate traffic - exact self-similarity Intuition: self-similar processes look the same at all (i.e., over a wide range of) time scales Def.: A stationary process X = (X k : k > 1) is called


slide-1
SLIDE 1

1

Self-similar traffic

slide-2
SLIDE 2

2

Self-similarity

slide-3
SLIDE 3

3

Aggregate traffic - exact self-similarity

Intuition: self-similar processes “look the same” at all (i.e., over a wide range of) time scales Def.: A stationary process X = (Xk : k > 1) is called exactly self-similar (with self-similarity parameter H, 0 < H < 1), if for all m > 1, [LTWW94] LAN traffic is consistent with exact self-similarity

) ( 1 m H X

m X

=

slide-4
SLIDE 4

4

Aggregate traffic - exact self-similarity

Intuition: self-similar processes “look the same” at all (i.e., over a wide range of) time scales Def.: A stationary process X = (Xk : k > 1) is called exactly self-similar (self-similarity parameter H, 0 < H < 1), if for all m > 1,

) ( 1 m H X

m X

=

∞ →

− −

m as cm ~ ) X var(

2 H 2 ) m (

slide-5
SLIDE 5

5

Variance time plot

slide-6
SLIDE 6

6

Network topology 1989

slide-7
SLIDE 7

7

Network topology 1992

slide-8
SLIDE 8

8

slide-9
SLIDE 9

9

Self-similarity

Just a mathematical concept? What does it mean?

slide-10
SLIDE 10

10

Self-similarity via heavy tails

Math: Superposition of independent ON/OFF sources is self-similar, if durations of periods are heavy- tailed with infinite variance Superposition of independent ON/OFF sources is short-range dependent, if durations of periods are light-tailed

slide-11
SLIDE 11

11

Superposition of sources

time time time time

slide-12
SLIDE 12

12

Covariance

Given two random variables x, y with means µx and

µy, their covariance is:

Their correlation coefficient is the normalized

covariance

) y ( E ) x ( E ] xy [ E )] y )( x [( E ) y , x ( Cov

y x 2 xy

− = µ − µ − = σ =

y x 2 xy xy

) y , x ( Cor

σ σ σ

= ρ =

slide-13
SLIDE 13

13

Short-Range Dependence

A stationary process X = (Xk : k > 1) with mean y,

variance ρ2 and autocorrelation function X r(k), k > 1, is said to exhibit short-range dependence (SRD) if there exists 0 < ρ < 1 and τ > 0 with

Important feature: Autocorrelations decay (at least)

exponentially fast for large lags k

∞ → → τρ− k as ) k ( r

k

slide-14
SLIDE 14

14

Poisson process: a SRD processes

slide-15
SLIDE 15

15

Short-range dependence

The aggregated process X(m) = (X(m)(k); k > 1)

tends to second-order white noise, as for all k > 1, where r(m) denotes the autocorrelation function of X(m)

The variance-time function, i.e., the variance of

the sample mean, as a function of m, satisfies:

∞ → → k as ) k ( r

) m (

∞ → k

∞ →

m as cm ~ ) X var(

1 ) m (

slide-16
SLIDE 16

16

Short-range dependence

Key features

Short range dependence = finite correlation length Fluctuations over narrow range of time scales Plotting var(X(m)) vs. m on log-log scale shows linear

relationship for large m, with slope –1

slide-17
SLIDE 17

17

Light-tailed distributions

X random variable with distribution function F. F is said to be light-tailed if there exists c > 0 Important feature: tails decay exponentially fast

for large x; i.e.,

∞ → → − x as e )) X ( F 1 (

cx

∞ → − = >

x as e ~ ) X ( F 1 ] x X [ P

x

slide-18
SLIDE 18

18

Light-tailed distributions

Examples: Exponential, Normal, Poisson, Binomial Key features:

F has limited variability F is tightly concentrated around its mean F has finite moments P[X > x] vs. x on log-linear scale is linear for large x

slide-19
SLIDE 19

19

Summary of light-tails and SRD

Distributional assumptions

Light-tails imply limited variability in space

Assumptions about temporal dynamics

SRD implies limited variability over time

Common characteristics of traditional traffic

processes

Limited burstiness (in time and space)

slide-20
SLIDE 20

20

Long-range dependence

A stationary process X = (Xk : k > 1) with mean y,

variance ρ2 and autocorrelation function X r(k), k > 1, is said to exhibit long-range dependence (LRD) if for some 1/2 < H < 1 and H is called the Hurst parameter

Important features of LRD

Infinite correlation length Fluctuations over all time scales No characteristic time scale

∞ →

k as ck ~ ) k ( r

2 H 2

slide-21
SLIDE 21

21

Long-range dependence

The aggregated process X(m) = (X(m)(k); k > 1)

tends to non-degenerate limiting process, for for m, k sufficiently large

The variance-time function satisfies:

∞ → → k as ) k ( r ) k ( r

) m (

∞ →

m as cm X

H m 2 2 ) (

~ ) var(

slide-22
SLIDE 22

22

Heavy-tailed distributions

X random variable with distribution function F F is said to be heavy-tailed if there exists c > 0 Important features:

1 < α < 2, X has finite mean but infinite variance Heavy-tailed implies high variability Tail decays like a power, hence power-law dist. Plotting P[X > x] vs. x on log-log scale is linear for large x with slope α

∞ → > = −

α −

x as cx ~ ] x X [ P ) X ( F 1

slide-23
SLIDE 23

23

Detour Characteristics of modem calls (~ 1999)

slide-24
SLIDE 24

24

Interarrival times of modem calls

slide-25
SLIDE 25

25

Durations of modem calls

slide-26
SLIDE 26

26

What about pkts from modem calls

slide-27
SLIDE 27

27

Detour Characteristics of Web (~ 2000)

slide-28
SLIDE 28

28

General characteristics of WWW transfers

slide-29
SLIDE 29

29

General characteristics of WWW transfers

slide-30
SLIDE 30

30

General characteristics of WWW transfers

slide-31
SLIDE 31

31

# of TCP connections per session

slide-32
SLIDE 32

32

Flow durations

slide-33
SLIDE 33

33

Why is LAN traffic self-similar

Possible explanations:

Network? User behavior?

User behavior:

Examine characteristics of individual src-dst pairs Clustering of packets between src-dst pairs Define clusters as ON/OFF periods Distribution of ON/OFF periods

slide-34
SLIDE 34

34

SRC/DST traffic matrix

slide-35
SLIDE 35

35

Texture plot

slide-36
SLIDE 36

36

Tex- ture plot

slide-37
SLIDE 37

37

Grouping IP packets into flows

Group packets with the “same” address

Application-level: single transfer web server to client Host-level: multiple transfers from server to client Subnet-level: multiple transfers to a group of clients

Group packets that are “close” in time

60-second spacing between consecutive packets

flow 1 flow 2 flow 3 flow 4

slide-38
SLIDE 38

38

ON/OFF periods

slide-39
SLIDE 39

39

ON/OFF periods are heavy-tailed

slide-40
SLIDE 40

40

Self-Similarity via heavy tails

Math:

Superposition of independent ON/OFF sources is self- similar, if durations of periods are heavy-tailed with infinite variance

Statistical analysis of LAN traffic traces:

Users are ON/OFF ON periods are heavy-tailed (file sizes) OFF periods are heavy-tailed (think times) Distributions of ON/OFF-periods show heavy tails

with infinite variance

slide-41
SLIDE 41

41

slide-42
SLIDE 42

42

Wide area network traffic

How are WANs different from LANs

Network effects matter: roundtrip delays, queuing, flow control Many more source destination pairs (not continuously active)

WAN traffic is not exactly self-similar [PF95, FGWK98]

Generalize notion of self-similarity Examine nature of traffic at application/connection layer Beyond self-similarity (where are the network effects)

slide-43
SLIDE 43

43

Asymptotic self-similarity

Def.: A stationary process X = (Xk : k > 1) is called asymptotically self-similar (with self-similarity parameter H, 0 < H < 1), if for all large enough m, Observations:

Asymptotic self-similarity is equivalent to long-range

dependence of infinite correlation length

Asymptotic self-similarity does not specify the small-time

scale behavior of a process

) ( 1 m H X

m X

slide-44
SLIDE 44

44

Structural model of WAN traffic

Cox‘s construction

M/G/oo model or birth-immigration process

Poisson session arrivals Session durations or session sizes are heavy tailed with

infinite variance (i.e., 1 < = alpha < 2)

Traffic within session is generated at constant rate The resulting process is (asymptotically second-order)

self-similar with self-similarity parameter

2 / ) 3 ( H α − =

slide-45
SLIDE 45

45

Structural model of WAN

Telnet and FTP sessions

Extract session-level information from WAN traces Test if arrivals are consistent with Poisson Test if arrivals are consistent with independence

slide-46
SLIDE 46

46

Dataset WAN traffic LBL/WRL

slide-47
SLIDE 47

47

Test for Poisson arrivals

slide-48
SLIDE 48

48

Test for heavy tail

slide-49
SLIDE 49

49

Implications (shaded 2% ,black 0.5% )

slide-50
SLIDE 50

50

Self-similar?

slide-51
SLIDE 51

51

Self-similar?

slide-52
SLIDE 52

52

Mathematical results

LAN:

Superposition of independent ON/OFF sources ON/OFF periods are heavy-tailed with infinite variance

Packets per unit time is exactly self-similar WAN:

Sessions arriving in a Poisson manner sizes (# packets) are heavy-tailed with infinite variance

Packets per unit time is asymptotically self-similar

slide-53
SLIDE 53

53

Statistical analysis of WEB

Before Web (1994): Self-similarity at packets per time unit

Poisson arrivals at application layer (FTP, Telnet) Heavy-tailed session durations/sizes

Since Web (1995)????

Arrivals of User session # of Web requests per session

  • Dist. of # of bytes, pkts, duration per request?
slide-54
SLIDE 54

54

Web client trace analysis 1995

Modified Web browser (Mosaic) Population: students at BU Duration: 21 Nov 94 to 8 May 95

Sessions 4,700 Users 591 URLs Requested 575,775 Files Transferred 130,140 Unique Files Requested 46,830 Bytes Requested 2,713 MB Bytes Transferred 1,849 MB Unique Bytes Requested 1,088 MB

slide-55
SLIDE 55

55

What about WEB traffic

slide-56
SLIDE 56

56

Durations of WEB transfers???

slide-57
SLIDE 57

57

File size of WEB transfers???

slide-58
SLIDE 58

58

Unique files vs. files transfered?

slide-59
SLIDE 59

59

What about the available files

slide-60
SLIDE 60

60

What about off times?

Web page TCP 1 TCP 2 TCP 3 TCP 4 HTTP Request 1 HTTP Request 2 HTTP Request 3 HTTP Request 4 HTTP Request 4 Users ….

slide-61
SLIDE 61

61

What about the WEB

slide-62
SLIDE 62

62

Interarrival times of URL requests

slide-63
SLIDE 63

63

Statistical analysis of WAN traffic Traces

Before Web (1994): Self-similarity at packets per time unit

Poisson arrivals at application layer (FTP, Telnet) Heavy-tailed session durations/sizes

Since Web (1995): Self-similarity at # of TCP connections per time unit

Poisson arrivals of User session (modem session) Heavy-tailed # of TCP connections per session