sheriff a regional pre alert management scheme in data
play

Sheriff: A Regional Pre-Alert Management Scheme in Data Center - PowerPoint PPT Presentation

Sheriff: A Regional Pre-Alert Management Scheme in Data Center Networks Xiaofeng Gao, Wen Xu, Fan Wu, Guihai Chen and Ding-Zhu Du Shanghai Jiao Tong University Background Data Center Networks Background Goal of a data center network


  1. Sheriff: A Regional Pre-Alert Management Scheme in Data Center Networks Xiaofeng Gao, Wen Xu, Fan Wu, Guihai Chen and Ding-Zhu Du Shanghai Jiao Tong University

  2. Background Data Center Networks

  3. Background • Goal of a data center network management system is to be: – Stable – Effective – Robust • Several problems of existing management schemes: – Weakness of a centralized controller – Short-sighted mechanism

  4. Introduction Centralization vs Distribution • Drawbacks of Centralization – Sharply increased response time – Upgraded or heterogeneous components • What we need – Distributed managers – Regional self-automatic control

  5. Introduction Contingency vs Pre-Control • Drawbacks of Contingency – Working after detecting errors – Harmful to device prevention and system maintenance • What we need – Take early warnings – React in advance to avoid congestions

  6. Introduction • Sheriff – Distributed (at end host side) – Pre-alert – Regional self-automatic • Two phases – PREDICTION • ALERT message – MANAGEMENT • VM migration

  7. System Design Two kinds of graphs in a DCN 1. Wired Network Graph 2. Dependency Graph

  8. Problem Formulation Problems & Solutions • Overloaded servers – Migrate VMs – Reshuffle VMs • Congestion – Check QCN – Modify the rate at end host

  9. Problem Formulation Pre-alert & Actions • Monitored servers by shim – Information collection – Prediction by ARIMA model and NN model – Report ALERT value once it exceeds THRESHOLD • Monitored switches by shim – Flow congestion detection – Signal congestion flows

  10. Problem Formulation Pre-alert & Actions • Alert from servers or from ToR switches – VM migration • Alert from outer switches – Flow reroute • Implement flow reroute first • VM migration – More expensive – Slower

  11. Problem Formulation Cost of VM Migration • Six steps – Initialization – Reservation – Iterative copy – Stop&Copy – Commitment – Activation

  12. Problem Formulation Cost of VM Migration • Cost of initialization • Cost of transmission – – Transmission time: – Utilization rate of the bandwidth: • Cost of dependency – – Unit cost per distance: – Physical distance of e:

  13. Problem Formulation Cost of VM Migration • Total cost: • Gz: • Goal: minimize

  14. Problem Formulation Pre-alert Mechanism • Collecting Necessary Information – Workload profile – Normalized to [0, 1]

  15. Problem Formulation Pre-alert Mechanism • Time series prediction – Autoregressive Integrated Moving Average (ARIMA) • Modeling linear, dynamic signals – Nonlinear Autoregressive Neural Network (NARNET) • Modeling nonlinear, dynamic and chaotic • Dynamic Model Selection – For each method f – Choose method f which has the minimum value

  16. Problem Formulation Alert Scheme • Seriousness of the condition • Collect ALERTs – VM Migration

  17. Alert-Migration Algorithm Simplification of VM Migration Algorithm • Cost of migration • Simplification – First step – Second step • All pair shortest path problem • K-median problem – Only related with the source and destination of the migration

  18. Alert-Migration Algorithm Framework • Running periodically T time • Each round – Collect alerts – Select a group of candidate VMs (as sources) – VM Migration • Not all VMs are migrated – Parameters as portion for migration

  19. Alert-Migration Algorithm Select Subroutine • Remove delay-sensitive flows • Pick up as many VMs with lowest value – Dynamic Knapsack Algorithm • If the priority parameter is one – Pick VM with highest ALERT value

  20. Alert-Migration Algorithm Migration • Find optimal pairs: K-center problem – Local Search Algorithm – It has an approximation ratio 3 + 2 / p with time complexity O(n^p) • ACKs from the destination’s delegation node – Enough capacity • REJECTs from the destination’s delegation node – Recalculate possible migration destinations

  21. Evaluation Network Trace Training • Data from ZopleCloud Corp. • Combined model has a smaller minimum square error.

  22. Evaluation Simulation For VM Migration • Fat-tree & Bcube • Workload balancing

  23. Evaluation Simulation For VM Migration • Result & Time Complexity

  24. Conclusion • Sheriff: A fast distributed pre-alert manage- ment scheme in data center network – Monitor locally – Predict possible ALERTs – Apply Flow Reroute / VM migration • Evaluation – Accuracy of the prediction – Efficiency of the migration algorithm

  25. • Thanks for your attention!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend