Improving Accuracy in End-to-end Packet Loss Measurement * Joel - - PowerPoint PPT Presentation

improving accuracy in end to end packet loss measurement
SMART_READER_LITE
LIVE PREVIEW

Improving Accuracy in End-to-end Packet Loss Measurement * Joel - - PowerPoint PPT Presentation

Improving Accuracy in End-to-end Packet Loss Measurement * Joel Sommers, Paul Barford, Nick Duffield, Amos Ron University of Wisconsin-Madison * AT&T Labs-Research Background Understanding its basic characteristics is important


slide-1
SLIDE 1

Improving Accuracy in End-to-end Packet Loss Measurement

Joel Sommers, Paul Barford, Nick Duffield, Amos Ron University of Wisconsin-Madison AT&T Labs-Research

* *

slide-2
SLIDE 2

Background

  • Understanding its basic characteristics is important
  • Transport protocol design, throughput modeling, overlay

monitoring and optimization

  • Standard ways to measure packet loss
  • Passive (SNMP

, tcpdump)

  • Active (ping, Poisson modulated probes)

2

slide-3
SLIDE 3

Loss characteristics of interest

mean loss episode duration: ((b-a) + (d-c)) / 2 loss episode frequency (fraction of time queue is congested): ((b-a) + (d-c)) / T

NB: Packets are still transmitted during congestion periods

Q time capacity buffer length queue c d b a

T

3

slide-4
SLIDE 4

Focus of our study

  • How well does traditional Poisson sampling work?
  • What are its limitations? What can be done better?
  • Design new sampling process
  • Theory and heuristics
  • Controlled laboratory evaluation
  • Compare with Poisson sampling

4

slide-5
SLIDE 5

How well does traditional Poisson sampling work?

30 32 34 36 38 40 0.00 0.02 0.04 0.06 0.08 0.10 time (seconds) queue length (seconds) 34 36 38 40 42 44 0.00 0.02 0.04 0.06 0.08 0.10 time (seconds) queue length (seconds) 10 12 14 16 18 20 0.00 0.02 0.04 0.06 0.08 0.10 time (seconds) queue length (seconds)

5

  • Evaluate frequency and duration

estimates

  • Controlled laboratory setting
  • Three kinds of cross traffic
  • Probe rates and packet sizes as

[ZPDS01]

  • Experiment duration (15 min)

should allow frequency estimates to be close to true frequency

CBR Infinite TCP Web-like, self-similar

slide-6
SLIDE 6

Evaluation of traditional Poisson sampling

  • CBR
  • Frequency estimate off by 40%

Duration estimate off by 85%

  • Infinite TCP
  • Very poor frequency estimates

Duration estimates are 0

  • Web-like (table to right)

6

frequency duration (sec) true values

0.0093 0.136

Poisson (10 Hz)

0.0014 0.000

Poisson (20 Hz)

0.0012 0.022

slide-7
SLIDE 7

Lessons and hypotheses

7

  • Poisson sampling is relatively ineffective for

estimating congestion frequency and duration

➡ use multi-packet probes

  • Single packet probes often do not experience loss episodes

➡ use loss and delay correlation heuristics ➡ create sampling process to improve duration estimates

slide-8
SLIDE 8

Multi-packet probes

8

  • Single packet miss

congestion episodes

  • Probes with a few

packets are more likely to see congestion episodes

  • Too many probes distort

measurements

11.10 11.12 11.14 11.16 11.18 11.20 0.0980 0.0990 0.1000 0.1010

  • cross traffic packet

cross traffic loss probe probe loss 15.20 15.22 15.24 15.26 15.28 15.30 0.0980 0.0990 0.1000 0.1010

  • cross traffic packet

cross traffic loss probe probe loss 14.80 14.82 14.84 14.86 14.88 14.90 0.0980 0.0990 0.1000 0.1010 time (seconds)

  • cross traffic packet

cross traffic loss probe probe loss

slide-9
SLIDE 9

Probe process model

  • At the sender
  • Send two multi-packet (3) probes in succession, initiated with

probability r at discrete time slot i

  • Individual probe gives instantaneous measure of congestion
  • Probe pairs used to determine congestion dynamics
  • At the receiver
  • Record time slots as congested (1) or uncongested (0), using

actual packet loss and one-way delay heuristics

  • yi records congestion as two-digit binary number
  • Yi denotes true congestion along the path

9

slide-10
SLIDE 10
  • Assume probes don’t lie ... usually
  • If there is truly congestion (Yi), the probes see the effect
  • If yi is incorrect, assume it is a false negative (yi = 00)
  • yi equals Yi with probability pk, which is independent of i and

depends only on the number k of 1-digits in Yi

  • For basic algorithm, assume
  • p{01,10} = p{11} for consistent estimation of duration
  • p{01,10} = p{11} = 1 for consistent and unbiased frequency

estimation

Key assumptions

10

slide-11
SLIDE 11

One-way delay and congestion heuristics

  • Improve single probe measurement of congestion
  • Probes within τ seconds of true loss ⇒ congestion
  • Probes with OWD ≥ (1-α) OWDmax ⇒ congestion
  • Observations from sensitivity experiments
  • Relationship between larger parameter value and more

congestion inferred

  • Tradeoff between probe rate and parameter settings

11

slide-12
SLIDE 12

New probe model example

12

Red line denotes α OWD threshold heuristic Green areas denote τ loss proximity heuristic 00 1111 0000 00 0111 00 00 yi

time→ time→

slide-13
SLIDE 13
  • zi is a random variable whose value is the first digit
  • f yi
  • M is the total number of probe pairs
  • Estimator is unbiased, and under mild conditions,

consistent

Estimating congestion frequency

13

ˆ F =

  • i

zi/M

slide-14
SLIDE 14
  • Assume we have knowledge of the path at all

possible time slots in our discretization

  • For k=1,2,..., there were exactly jk congestion episodes of

length k

  • Congestion occurred over total of A time slots, A = ∑kjk
  • Total number of congestion episodes is B = ∑ jk
  • Average duration D of a congestion episode is therefore D := A/B

Estimating congestion duration (1)

14

slide-15
SLIDE 15

Estimating congestion duration (2)

Define R:=#{i:yi ∈ {01,10,11}} and S:=#{i:yi ∈ {01,10}} Note that there are B time slots i for which Yi = 01, and also B time slots i for which Yi = 10 Note also that there are exactly A+B time slots i for which Yi ≠ 00 Assuming p{01,10} = p{11}, the estimator for the mean congestion duration is therefore

15

ˆ D := 2 × R S − 1

E(R)/E(S) = p2(A − B) + 2p1B 2p1B

We arrive at E(R)/E(S) = p2(A − B) + 2p1B

2p1B

slide-16
SLIDE 16

Validation of output

  • Monitor results in real-time to check whether

assumptions have been violated and to increase confidence in results

  • Probability of yi = 01 is assumed to be same as yi = 10 —

monitor these rates of occurrence

  • p{01,10} = p{11} for consistent estimation of duration
  • p{01,10} = p{11} = 1 for consistent and unbiased frequency

estimation

16

slide-17
SLIDE 17

Laboratory results summary

  • Implemented new sampling model in a tool called

badabing

  • Experiments in a controlled testbed using a range
  • f probe rates and range of thresholds for inferring

congestion

  • Estimates are often within 25% of actual congestion

frequency and duration values; many within 10%

  • A significant improvement over traditional Poisson sampling

for both frequency and duration estimation

17

slide-18
SLIDE 18

badabing evaluation (CBR, single episode type)

18

loss frequency loss duration r

true badabing true badabing

0.1

0.0069 0.0016 0.068 0.054

0.3

0.0069 0.0065 0.068 0.073

0.5

0.0069 0.0060 0.068 0.051

0.7

0.0069 0.0070 0.068 0.051

0.9

0.0069 0.0078 0.068 0.053

slide-19
SLIDE 19

badabing evaluation (web-like, self-similar traffic)

19

loss frequency loss duration r

true badabing true badabing

0.1

0.0044 0.0017 0.060 0.071

0.3

0.0011 0.0011 0.113 0.143

0.5

0.0114 0.0117 0.079 0.074

0.7

0.0043 0.0039 0.071 0.076

0.9

0.0031 0.0038 0.073 0.062

slide-20
SLIDE 20

Comparing badabing with Poisson probes

20

  • With same probe stream rate for Poisson and

badabing

  • Constant bit rate cross traffic
  • Both frequency and duration estimates are within 7% for badabing;

Frequency estimate off by 40% and duration estimate off by 85% for Poisson

  • Web-like cross traffic
  • Badabing correctly estimates frequency and duration estimate is within 25%;

Each estimate derived from Poisson-modulated probes is at least 80% off

slide-21
SLIDE 21

Summary

  • Simple Poisson sampling is relatively ineffective for

measuring congestion frequency and duration

  • Badabing provides more accurate estimation of

congestion frequency and duration

  • Estimator performance depends only on total number of

probes sent, not on sending rate

  • Simple validation methods for measurement output
  • Accuracy improvements (and basic assumptions) validated in

a laboratory testbed

21

slide-22
SLIDE 22

the end

http://wail.cs.wisc.edu/