Robust Traffic Matrix Estimation with Imperfect Information: Making - PowerPoint PPT Presentation

Robust Traffic Matrix Estimation with Imperfect Information: Making Use of Multiple Data Sources Qi Zhao, Georgia Institute of Technology Zihui Ge, AT&T Labs-Research Jia Wang, AT&T Labs-Research Jun (Jim) Xu, Georgia Institute of Technology SIGMETRICS/PERFORMANCE 2006 1

Traffic Matrix (TM) and its usefulness • The aggregate traffic volume for every origin/destination (OD) pair T ij , i, j = ... , useful for – Capacity planning and forecasting – Routing configuration – Network fault/reliability diagnoses – Provisioning for service level agreements (SLA) 2

Existing Approaches • Indirect inference from SNMP link counts and the routing matrix by making statistical assumptions about the traffic matrix elements to be estimated, such as [Vardi:2006, ZRDG:2002, ZRLD:2003, SNCLT:2004] • Direct measurement through – Sampled NetFlow, such as [Feldmann et. al. 2000] – Data streaming algorithms, such as [ZKWX:2005] 3

What inspires this work? • TM can be (and had been) estimated from each of the following two data sources: – Traffic volume at each link reported by SNMP and routing matrix (which router path does an OD flow take?) – Sampled NetFlow records at (possibly a subset of) network ingress points • Our question: how to combine the information at both data sources to obtain more accurate TM estimations? 4

Additional challenges addressed by this work • Partial NetFlow deployment problem: Netflow is available at only a subset of of ingress points. Our solution is an Equiva- lent Ghost Observation (EGO) method that helps blend the gravity model with our statistical model. • Dirty data problem: both the traffic volume and sampled NetFlow data can be dirty or missing. Our idea is to use both data sources as “error correction codes” to each other. • Routing change problem: routing tables change in the middle of a measurement interval. 5

TM estimation with clean and complete data I  X = X + ε X �  Y = HX + N B = AX + ε B �  A is the routing matrix. � B is the link counts; B is the corresponding SNMP link count measurement. X is the traffic matrix organized as a vector; � X is its estimation obtained from sampled NetFlow records. ε X is the measurement noise of sampled NetFlow data. ε B is the measurement noise of SNMP link counts. 6

TM estimation with clean and complete data II • The measurement noises ε X and ε B can faithfully modeled as N (0 , σ 2 i ) and N (0 , µ 2 i ), respectively. • The least-squares (LS) estimator is to minimize || X − � || 2 + || AX − � X B || 2 Σ Γ where Σ 2 = ( σ 2 n ) T and Γ 2 = ( µ 2 1 , σ 2 2 , · · · , σ 2 1 , µ 2 2 , · · · , µ 2 m ) T • The LS estimator is equal to X = ( H T K − 1 H ) − 1 H T K − 1 Y where K is the covariance matrix of N . It is also the best linear unbiased estimator (BLUE) by Gauss-Markov Theo- rem. 7

Technique to Reduce Computational Complexity • Singular-Value Decomposition (SVD) is used to compute the pseudo-inverse. • The number of OD flows could be very large (e.g., several tens of thousands). We want to reduce the dimension of the problem. – only focus on the subvector X L of X where the corresponding OD flows estimation is larger than a predefined threshold T (e.g., 0 . 01% of the total traffic) – treating the remaining subvector X S as known 8

Evaluation • Data gathering method – Traffic matrices : Sampled NetFlow data – Routing matrices : Simulate OSPF routing – Link counts: Project the above traffic matrices on a routing matrix • Performance metric: mean relative error (MRE) equal to � � � � � � � 1 x i − x i � where N T is the number of matrix elements i : x i >T N T x i that are greater than a threshold value T , i.e., N T = |{ x i | x i > T, i = 1 , 2 , · · · , N }| . 9

Noise in NetFlow measurement 0.045 0.04 0.035 0.03 0.025 MRE 0.02 0.015 0.01 raw NetFlow TM 0.005 estimated TM estimated TM w/ complexity reduction 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 scaling factor of NetFlow noise 10

Noise in SNMP measurement 0.0092 0.009 0.0088 0.0086 0.0084 MRE 0.0082 0.008 0.0078 raw NetFlow TM 0.0076 estimated TM estimated TM w/ complexity reduction 0.0074 0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 noise level of SNMP link counts 11

TM estimation with partial NetFlow coverage • With partial NetFlow coverage, while the same LS and BLUE estimator can still be estimated, it is not a good estimator due to the fact that the probability model is severely under- populated (already observed in [ZRDG:2002]) • Our idea: populate our probability model with the gravity model in [ZRDG:2002], i.e., using estimations from the gravity model as a starting point for TM elements that are not covered by NetFlow observations • Challenge: the gravity model is not a probability model 12

Overview of the Generalized Gravity Model [ZRDG:2002] • Simple gravity model: T i,j ∝ T i, ∗ · T ∗ ,j , resulting in a default estimation T ( g ) to be corrected by SNMP link count observations. Generalized Gravity model = Simple Gravity Model + Side Information (e.g., link classification and routing policy). • The probability model of the gravity model can be implicitly characterized as “the probability model under which the following Tomogravity constrained optimization problem pro- duces a good estimator”: � || ( T − T ( g ) ) / T ( g ) || 2 minimize subject to || AT − B || being minimized We discovered the explicit probability model underlying the gravity model. 13

Equivalent Ghost Observation (EGO) � X = T ( g ) and � • Let � x i − x i ∼ N (0 , v 2 T ( g ) . The i ) where v i ∝ least-squares (LS) estimator of X , which minimizes || X − � || 2 + || AX − � X B || 2 V Γ is exactly the Tomogravity constrained optimization result. • In other words, EGO’s � X are statistically equivalent to the implicit beliefs of the gravity model. 14

Blending EGO’s with NetFlow observations i ∼ N (0 , σ 2 • If a TM element X i is covered by NetFlow, ε X i ); ∼ N (0 , λσ 2 i ) where σ 2 • Otherwise, ε X is the corresponding i i element in T ( g ) • The parameter λ is a normalization factor that captures the relative credibility of an EGO to a NetFlow observation. 15

MRE under different values of λ (20% NetFlow coverage) 0.14 scaling factor of NetFlow noise = 1.0 scaling factor of NetFlow noise = 4.0 0.13 0.12 0.11 0.1 MRE 0.09 0.08 0.07 0.06 0.05 0.1 1 10 100 λ 16

The Weighted CDF of the relative error (20% NetFlow coverage). 1 0.9 0.8 0.7 Fraction of traffic 0.6 0.5 0.4 0.3 estimated TM 0.2 estimated TM w/ complexity reduction EGO amended NetFlow tomogravity 0.1 generalized gravity 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 relative error 17

Impact of partial deployment of NetFlow on traffic matrix estimation (20% NetFlow coverage). 0.2 order actual 0.18 order NetFlow order gravity order |acutal-gravity| 0.16 order |NetFlow-gravity| random 0.14 0.12 MRE 0.1 0.08 0.06 0.04 0.02 0 0 0.2 0.4 0.6 0.8 1 ratio of the known traffic matrix rows 18

Removal of Dirty Data • Dirty Data: Measurement error in SNMP or NetFlow or both due to hardware, software or transmission faults • We can rewrite the previous equations about the observations of NetFlow and link counts X + ε X + ξ X X = � B + ε B + ξ B B = � • We expect � � ε X + ξ X | ξ X i | ≫ | ε X | ξ B j | ≫ | ε B j | = i | , ⇒ ξ ≡ ε B + ξ B 19

Sparsity Maximization • We expect there are only a small number of dirty data. • Minimize || δ || 0 subject to the observation • L 0 norm is not convex and hence hard to minimize – Greedy heuristic algorithm – L 1 norm minimization • Comparing the computed results with 3 . 09 times of the stan- dard deviation of the Gaussian measurement noise to identify and remove the dirty data. 20

Traffic Matrix Estimation with and without Dirty Data 0.035 0.35 0.03 0.3 0.025 0.25 0.02 0.2 MRE MRE 0.015 0.15 0.01 0.1 0.005 0.05 0 0 clean opt dirty prior dirty opt greedy alg L1 min clean opt dirty prior dirty opt greedy alg L1 min 21

Handling of Routing Changes • Assume the routing only changes once. • A 1 X 1 + A 2 X 2 = B •  X 1 = X 1 + ε X 1 �     X 2 = X 2 + ε X 2 � Y = HX + N    B = A 1 X 1 + A 2 X 2 + ε B �  � � X 1 where X = . X 2 22

Weighted CDFs of the relative errors 1 0.9 0.8 0.7 Fraction of traffic 0.6 0.5 0.4 0.3 0.2 SNMP only with routing change SNMP only without routing change 0.1 30% NetFlow with routing change 30% NetFlow without routing change 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 relative error 23

Conclusion: Strength of Combining Multiple Information Sources • Provide a comprehensive formulation and design an algorithm for estimating traffic matrices • Extend the formulation and algorithm to the case where sampled NetFlow only covers partial ingress points • Design two algorithms to identify and remove dirty data in measurements • Develop algorithm to estimate traffic matrices upon routing changes 24

Robust Traffic Matrix Estimation with Imperfect Information: Making - PowerPoint PPT Presentation

Robust Traffic Matrix Estimation with Imperfect Information: Making Use of Multiple Data Sources Qi Zhao, Georgia Institute of Technology Zihui Ge, AT&T Labs-Research Jia Wang, AT&T Labs-Research Jun (Jim) Xu, Georgia Institute of

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

[3] The Matrix What is a matrix? Traditional answer Neo: What is the Matrix? Trinity: The answer

Matrix Multiplication Matrix Multiplication via Matrix-Vector Mult Defn. If matrix A is m n

Collaborating in an Imperfect World: Collaborating in an Imperfect World: Understanding Category

Chapter 29: The Imperfect Subjunctive Chapter 29 covers the following: the formation of the

Imperfect competition with exit/entry Session 13 Imperfect competition Slide 1 P1 SepOct

Imperfect Information Extensive Form Games CMPUT 654: Modelling Human Strategic Behaviour

Traffic Shaping, Traffic Policing Peter Puschner, Institut fr Technische Informatik Traffic

Traffic signal optimization and traffic assignment Traffic signals Traffic signal optimization

A Tale of Two Theories: A Tale of Two Theories: Reconciling Reconciling random matrix theory

Building an IoT Platform with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Introductory Matrix Operations Matrix Entries Defn. For matrix A , notation a ij means the en-

Gov 2000: 10. Multiple Regression in Matrix Form Matthew Blackwell Fall 2016 1 / 64 1. Matrix

Liberating Communication with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Robust Location and Scatter Estimators Outline for Multivariate Data Analysis Background

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Thresh a Data-Directed SNMP Threshold Poller John Sellens Certainty Solutions Inc.

Target system maintenance experience in hot cell at J-PARC Hidetaka Kinoshita, Katsuhiro Haga,

E. Elnahrawy, X. Li, and R. Martin Rutgers U. WLAN-Based Localization Localization in

MIGRATION TO EMVCO 3.0 NFC TECHNICAL III PABLO FUENTES JULY 2019 PUBLIC Agenda

Meeting 20/01/2010 - Outcome PVSS servers (PVSS project + Linux System) + create a

A Knowledge Proxy bridging between SNMP and ACMP Formatvorlage des Untertitelmasters durch

Open Programmable Architecture for Java-enabled Network Devices Tal Lavian Technology Center

No Need to Marry to Change your Name! Attacking Profinet IO Automation Networks Using DCP S