Observed structure of addresses in IP traffic
- Eddie Kohler, Jinyang Li, Vern Paxson, Scott Shenker
ICSI Center for Internet Research Thanks to David Donoho and Dick Karp
1
Observed structure of addresses in IP traffic Eddie Kohler, - - PowerPoint PPT Presentation
Observed structure of addresses in IP traffic Eddie Kohler, Jinyang Li, Vern Paxson, Scott Shenker ICSI Center for Internet Research Thanks to David Donoho and Dick Karp 1 Problem How can we model the set of destination IP
Observed structure of addresses in IP traffic
ICSI Center for Internet Research Thanks to David Donoho and Dick Karp
1
Problem
some link? (And does it matter?) Example from a 4-hour trace at a university access link:
255.255.255.255 192.0.0.0 128.0.0.0 64.0.0.0 0.0.0.0
In particular, can we model how the addresses aggregate? We call this address structure.
aggregate-based congestion control, realistic sets of addresses for simulations, . . .
2
Results
prefix aggregates, such as /16s.
modeled by a multifractal Cantor set construction with two parameters. The model captures both fractal metrics and metrics we developed for address structures.
Structural metrics differ between sites. At a given site, these metrics are stable over short time scales. New communication dynamics, such as worm propagation, show up in the metrics.
3
Outline
4
Terminology
N ≤ 232 by definition; N ≪ 232 for all our traces
prefix (0 ≤ p ≤ 32) Also called a /p 1.0.0.0 and 1.99.130.14 are in the same /8, but different /10s
5
Traces
Description
∆T
# pkts N U1 large university access link ∼ 4 h 62M 69,196 U2 large university access link ∼ 1 h 101M 144,244 A1 ISP ∼ 0.6 h 34M 82,678 A2 ISP 1 h 29M 154,921 R1 link from regional ISP 1 h 1.5M 168,318 § R2 link from regional ISP 2 h 1M 110,783 § W1 large Web site access link ∼ 2 h 5M 124,454
Most anonymized while preserving prefix and class relationships § means sampled (1 in 256)
6
Does address structure matter?
Accounting, fairness, congestion control . . .
Packet counts per address: probably a heavy-tailed distribution Addresses per aggregate = address structure Correlation
count distribution Medium scales are most interesting (/16s and thereabouts)
7
R1 packet count distributions
10-5 10-4 10-3 10-2 0.1 1 106 105 104 1000 100 10 1 Complementary CDF Packet count slope -1.46 slope -1.16 slope -1.13 R1 16-aggregates R1 addresses R1 flows
8
Semi-experiments
factors impact aggregate packet counts
Replace the (heavy-tailed) per-address packet count distribution with a uniform distribution over [0, 17.54]
Replace address structure with a uniform random distribution over the entire IP address space
Permute per-address packet counts among the active addresses
9
Address structure matters most
10-3 10-2 0.1 1 106 105 104 1000 100 10 1 Complementary CDF 16-aggregate packet count R1 Permuted counts Random counts Random addresses
10
Tour of U1’s address structure
192.0.0.0 128.0.0.0 64.0.0.0 0.0.0.0 200.0.0.0 198.0.0.0 196.0.0.0 194.0.0.0 192.0.0.0 195.192.0.0 195.176.0.0 195.160.0.0 195.144.0.0 195.128.0.0 195.190.0.0 195.189.128.0 195.189.0.0 195.188.128.0 195.188.0.0
11
Self-similarity?
Visually “self-similar” characteristics
Treat an address structure as a subset of the unit interval Fractal dimension D ∈ [0, 1]?
12
Fractal dimension for address structure
Corresponds nicely to prefix aggregation
n32 = N np ≤ np+1 ≤ 2np each /p contains and is covered by 2 disjoint /(p + 1)s
p→∞
log np p log 2 But p ≤ 32 here, and expect sampling effects for high p Examine medium p to see if the limit exists
13
log np is linearly related to p at medium scales
105 104 103 100 10 1 4 8 12 16 20 24 28 32 np Prefix length p D = 0.79 R1 A2 U1
14
Multifractality
Same scaling behavior everywhere Not what we saw in the tour
(different local scaling behavior) Binned approximation (Histogram Method) If multifractal, spectrum will cover a wide range of scaling exponents
15
Address structure is multifractal at /16
0.4 0.6 0.8 1 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 f16(x) Scaling exponent R1 Cantor dust with D = 0.79
16
Multifractal model
Repeatedly remove middle subinterval with proportion h = 1 − 21−1/D
Distribute a unit of “mass” between subintervals; left gets m0, middle gets 0 (removed), right gets m2 = 1 − m0 Produces a sequence of measures µk that weakly converge to µ Sample an address with probability equal to its measure Result: different local scaling behavior
17
The model fits well
0.4 0.6 0.8 1 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 f16(x) Scaling exponent R1 Cantor dust (D = 0.79, m0 = 0.5) R1 Model (D = 0.79, m0 = 0.80)
18
The model fits well
0.4 0.6 0.8 1 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 f16(x) Scaling exponent A2 A2 Model (D = 0.80, m0 = 0.70)
19
Why multifractal?
Recursive subdivision plus a rule for distributing mass
Pure speculation! ICANN allocates short prefixes to providers Providers allocate longer prefixes to their customers All parties might allocate basically from left to right
20
Does the multifractal spectrum matter?
U1
255.255.255.255 192.0.0.0 128.0.0.0 64.0.0.0 0.0.0.0
U1 Model
255.255.255.255 192.0.0.0 128.0.0.0 64.0.0.0 0.0.0.0
How do we know whether we’ve captured relevant properties?
Contrast metrics among traces Compare with model
21
Active aggregate counts: np and γp
If N = 216 and n16 = 1, addresses are closely packed If N = 216 and n16 = 216, addresses are well spread out Useful for algorithms keeping track of aggregates—shows how many aggregates there tend to be
N =
1≤p<32 γp
22
γp
1.2 1.4 1.6 1.8 2 4 8 12 16 20 24 28 32 γp Prefix length p R1 A2 U1 W1
23
Models’ γp
1.2 1.4 1.6 1.8 2 4 8 12 16 20 24 28 32 γp Prefix length p R1 R1 model A2 A2 model
24
Discriminating prefixes
namely a. Example with 4-bit addresses:
Prefix length 4 4 3 2 4 4 1 2 3 4
If many addresses have d.p. < 20, say, then addresses are well separated How depopulated do aggregates become?
25
Discriminating prefixes: πp
πp = N
Turns discriminating prefixes into a metric
26
πp
10-5 10-4 10-3 10-2 0.1 1 4 8 12 16 20 24 28 32 CDF of πp Prefix length p R1 A2 U1 W1
27
Models’ πp
10-5 10-4 10-3 10-2 0.1 1 4 8 12 16 20 24 28 32 CDF of πp Prefix length p R1 R1 Model A2 A2 Model
28
Aggregate population distribution
active addresses per aggregate Expect a wide range of variation, just as with the other metrics
29
Aggregate population distribution
10-3 10-2 0.1 1 105 104 1000 100 10 1 Complementary CDF Aggregate population /16s /8s R1 A2 U1 W1
30
Models’ aggregate population distribution
10-3 10-2 0.1 1 105 104 1000 100 10 1 Complementary CDF Aggregate population /16s /8s R1 R1 Model A2 A2 Model
31
A tough metric
R1, W1 match well, A2, U1 do not Significant aggregation in A2, U1 at long prefixes . . . ?
Heck, match “generalized discriminating prefixes”—d.p.s for aggregates—as well Call this the “Match-DP” model How well does this do?
32
Match-DP fails aggregate population distribution
10-3 10-2 0.1 1 105 104 1000 100 10 1 Complementary CDF Aggregate population /16s /8s R1 R1 Model R1 DP-Model A2 A2 Model A2 DP-Model
33
Another tough metric: The multifractal spectrum
0.4 0.6 0.8 1 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 f16(x) Scaling exponent R1 R1 Model R1 DP-Model
34
Properties of γp: Sampling effects?
N is effectively a sample size How does the shape of the γp curve depend on N?
24 hours → 6 minutes; N = 161,560 → 11,838
35
Shape of γp similar for wide range of sample sizes
1.2 1.4 1.6 1.8 2 4 8 12 16 20 24 28 32 γp Prefix length p 24 hours, N = 161,560 U1: 4 hours, N = 69,196 40 minutes, N = 26,108 6 minutes, N = 11,838
36
Shape of γp similar for wide range of sample sizes
1.2 1.4 1.6 1.8 2 4 8 12 16 20 24 28 32 γp Prefix length p 24 hours, N = 161,560 U1: 4 hours, N = 69,196 40 minutes, N = 26,108 6 minutes, N = 11,838 R1 A2
37
Short-term stability?
Plot maximum, minimum, and mean γp over all sections R1, A2, and U2; sections last about 6–7 minutes each
38
Shape of γp relatively stable over short time scales
1.2 1.4 1.6 1.8 2 4 8 12 16 20 24 28 32 γp Prefix length p R1 sections A2 sections U2 sections
39
New communication dynamics?
as worm propagation? Expect worm propagation to significantly change the destination addresses visible at an access link, since every possible internal address will be contacted. Not the best detection metric . . .
after Code Reds 1 and 2 Consider γp and aggregate population distribution
40
Shape of γp changes during worm propagation
1.2 1.4 1.6 1.8 2 4 8 12 16 20 24 28 32 γp Prefix length p 18 Jul, pre-Code Red 19 Jul, Code Red 1 3 Aug, pre-Code Red 2 4 Aug, Code Red 2
41
10-4 10-3 10-2 0.1 1 104 1000 100 10 1 Complementary CDF Packet count of 24-aggregates 18 Jul, pre-Code Red 19 Jul, Code Red 1 3 Aug, pre-Code Red 2 4 Aug, Code Red 2
42
Address stability
. . . in sections 1, 2, and 3? and so forth Indicates how quickly address sets change
Every section contains nS short-lived and nL long-lived Addresses survive into the next section with probabilities pS and pL (where pL > pS) How well does this model match?
43
U2, 6-minute sections
10000 15000 20000 25000 30000 35000 40000 1 2 3 4 5 6 7 8 9 Surviving addresses Number of sections real data 9779.9*0.937x-1 + 22260.1*0.255x-1
44
Other time scales
45
Conclusions
multifractal Captures some aggregation behavior better than models built using metrics from real data
Metrics differ between sites, are stable over short time scales
46
Future work
47
Analysis details
n[A] is number of active addresses in intersection of sections A.
48