1
Modeling and Characterization of Large-Scale Wi-Fi Traffic in Public Hot-Spots
Amitabha Ghosh*, R. Jana+, V. Ramaswami+, J. Rowland+, N.K. Shankar+
*Electrical Engineering, Princeton University +AT&T Labs - Research
Large-Scale Wi-Fi Traffic in Public Hot-Spots Amitabha Ghosh * , R. - - PowerPoint PPT Presentation
Modeling and Characterization of Large-Scale Wi-Fi Traffic in Public Hot-Spots Amitabha Ghosh * , R. Jana + , V. Ramaswami + , J. Rowland + , N.K. Shankar + * Electrical Engineering, Princeton University + AT&T Labs - Research 1 Outline
1
*Electrical Engineering, Princeton University +AT&T Labs - Research
2
3
Increasing number of WLAN deployments to meet the growing demand of (mobile) users for wireless access
4
Coffee shops, fast food chains, book stores, hotels,
Data contains:
Session arrivals Connection duration distribution Simultaneously present customer distribution
5
6
7
Arrival rates vary drastically within the same business type Characteristic peaks in means across all categories within
Significantly different weekday and weekend patterns
3 am 6 am 9 am 12 pm 3 pm 6 pm 9 pm 12 am 3 am 6 am 9 am 12 pm 3 pm 6 pm 9 pm 12 am 2 4 6 8 10 12
Two weekdays (15 min bins) Average number of arrivals
Tiny Small Medium Large
Coffee Shops
3 am 6 am 9 am 12 pm 3 pm 6 pm 9 pm 12 am 3 am 6 am 9 am 12 pm 3 pm 6 pm 9 pm 12 am 3 6 9 12 15
Two days (15 min bins) Average number of arrivals
Weekday Weekend
Bookstore/Hotels
8
Coffee shops: typically download a few KB Enterprises: typically download a few MB to a few GB Long tails
9
10
MMPP fails
Polynomial curve fitting to the observed mean
Standard Poisson regression fails
11
Time-dependent deterministic arrival rate Divide time into 3 hour bins I: 8 bins per day Divide each bin into 15 min slots J: 12 slots per bin
12
Polynomial type dependence on bin and slot numbers
First term
Over-a-day mean behavior
Sum terms
Differential effects of specific cluster and slots within it
Last term
Interaction term – differential effect of slot J does not have to be
13
K-means clustering:
Automatic 24 hour wrap-around in clustering
14
3 am 6 am 9 am 12 pm 3 pm 6 pm 9 pm 12 am 1 2 3 4 5 6 7 8 9 One weekday (15 min bins) Average number of arrivals Observed mean arrival rate Model mean arrival rate
Coffee shops: Observed mean arrival rate plotted against the model mean arrival rate; these provide intra--day patterns for a cluster by averaging over its members
15
Mon Tue Wed Thu Fri 2 4 6 8 10 12 14 5 weekdays (15 min bins) Number of arrivals
Observed data Model mean 2.5% quantile 97.5% quantile
Coffee Shops: Model mean arrival rate along with the 97.5% quantile and 2.5% quantile bands plotted against 5 days of validation data for an example coffee shop.
16
17
Distribution time to absorption in a Markov Process Dense in the class of all distributions Exponentially decaying tail asymptotically going to 0 as
Captures both tails and heads, as opposed to Pareto and
18
50 100 150 200 250 300 350 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Connection duration (min) CDF
Observed Model
Phase type distributions were
fit using the EM algorithm A fit of order 5 was found to be adequate Coffee Shops: CDF plot of durations for coffee shops and data (truncated at 6 hours)
19
Non-homogeneous Poisson process (time-dependent
PH-type distribution
Queuing model
20
21
Does not require the system to be empty at some
Simple, transparent, and general
22
No arrivals in (u,t] First arrival occurs at some v in (u,t] Q(u): num of customers who arrive in (u,t] and are still
G(z,u): PGF of Q(u,t) Expected number of arrivals in (0,t] Let
23
24
Mon Tue Wed Thu Fri 3 6 9 12 15 5 weekdays (15 min bins) Number of simultaneously present customers Observed data Model mean, m(t) 2.5% quantile 97.5% quantile
Coffee Shops: Expected number of simultaneously present customers along with the 97.5% quantile and 2.5% quantile bands plotted against 5 days of validation data for an example coffee shop.
25
Examined salient differences w.r.t.
Arrival counts, temporal variations, connection durations, byte
Modeling
Arrival count modeling using statistical clustering and non-
Use of Phase-Type r.v. to model the logarithm of long-tailed
Simultaneously present customer modeling using a
New proof on semi-regenerative argument for the number
26