data driven learning to predict wide area network traffic
play

Data-driven Learning to Predict Wide Area Network Traffic Nandini - PowerPoint PPT Presentation

Data-driven Learning to Predict Wide Area Network Traffic Nandini Krishnaswamy Lawrence Berkeley National Lab SNTA 2020 1 62% Year- on-Year Growth Log scale 103 PB Mar 19 Network Traffic Growth This diagram illustrates the


  1. Data-driven Learning to Predict Wide Area Network Traffic Nandini Krishnaswamy Lawrence Berkeley National Lab SNTA 2020 1

  2. 62% Year- on-Year Growth Log scale 103 PB – Mar ‘19 Network Traffic Growth – This diagram illustrates the growth rate of traffic on ESnet backbone (The Department of Energy’s dedicated science network). – Projected 62% growth every year. 2

  3. Year 2019 Bandwidth Usage Normalized shows only up to 40% used DOE Networks Link Utilization – Links are designed to be used at 40% capacity for unanticipated traffic surges. – How can we improve utilization? – Proposed solution: Predict future network traffic. 3

  4. Challenges posed by Traffic Prediction – Noisy data – Missing data – Multiple hour forecasts 4

  5. – SNMP data collected at router interfaces – Traffic volume in GBs – 30 second intervals (aggregated to 1 hour intervals) – 1 year in total – 4 Bidirectional links (8 traces) – ESnetTrans-Atlantic links Traffic Data Used 5

  6. – Fourier analysis – Correlation heat map – file:///Users/nandinik/Desktop/2018-Jan-Dec(1).gif Justification of Chosen Links 6

  7. – ARIMA – Autoregressive Integrated Moving Average – Requires stationary series as input (can make series Classical stationary through differencing) – Holt-Winters Time Series – Triple exponential smoothing Algorithms – Smoothing equations correspond to: – Level – Trend – Seasonality 7

  8. Traditional Recurrent Neural Networks (RNNs) – Feedback loop -> during training, RNN will unfold into deep feedforward network – Vanishing gradient problem -> cannot capture long-term dependencies 8

  9. Long Short- Term Memory (LSTM ) Network – Variant of RNN – Memory to track long time period – Can learn long-term dependencies 9

  10. Stacked LSTM (two LSTM Simple LSTM (one LSTM layers) Three LSTM layer) Variants Seq2Seq LSTM 10

  11. – ARIMA: – Inspect AC and PAC plots – Holt-Winters Model – Trial-and-error/grid search Parameters – LSTM – Tested different # of nodes in hidden layers – Tested different activation functions 11

  12. Performance Comparison • Af-Lnd has strong seasonality • Wash-Cern problematic data collection • All LSTM approaches are better • Each link has different behavior 12

  13. – Deploy prediction tools to inform network engineering. Conclusion – Further research: – Extend prediction periods – Experiment with different NN architectures 13

  14. Email me at nk2869@columbia.edu with Thank you! any questions! 14

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend