 
              Lehrstuhl Netzarchitekturen und Netzdienste Institut für Informatik Technische Universität München Seminar Innovative Internet Technologien und Mobilkommunikation Self-Healing in Self-Organising Networks Oliver Scheit
Self-Organising Networks  Self-Organising Networks in Mobile Communications  Motivation  Self-Configuration  Self-Optimization  Self-Protection  Self-Healing  Detection  Diagnosis  Problems Self-Healing in Self-Organising Networks 2
Motivation  Tendencies from macro to micro/pico/femto cells in networks  To overcome the bandwidth problems  More and smaller cells needed for coverage  Paradigm change for management purposes: • Network configuration: – Move from centralized planning and configuration to decentralized Self Organizing Networks • Network maintenance and optimization: – Cells autonomously gather information and provide it to a management system – This cannot be done manually by personnel any longer Self-Healing in Self-Organising Networks 3
Motivation  Heterogenous networks with GSM, HSPA, LTE  Interference  Knowledge of the network often only by experience  Dependency on individual experts  Typical operater use cases:  Planning  Deployment  Optimisation  Maintenance  Find a cost efficient way to manage the network tasks Self-Healing in Self-Organising Networks 4
Self-Management Domains • Configuration • Resource of network monitoring parameters • Parameter tuning • Network planning Self- Self- Configurati Optimisatio on n Self- Self-Healing Protection • Protection • Error detection against attacks and recovery • Faulty operation Self-Healing in Self-Organising Networks 5
Self-Configuration  System can add and configure new elements/features in run-time  Reduce time until first operation  Improves deployment of new network elements  Automatically install new software  Reduce human involvement Self-Healing in Self-Organising Networks 6
Self-Optimisation  Improve efficiency of the network over manual configuration  Adapting to changing environment  Handover parameters, cell-individual parameters  Neighbor cell list  Monitor the system parameters  Automated parameter tuning  Mobility Load-Balancing Self-Healing in Self-Organising Networks 7
Self-Healing  Detection  Sleeping cells  Performance indicators  Diagnosis  Use network knowledge  Prevent false alarms  Recover/reduce impact on network performance  Cell Outage Compensation  Reconfigure neighbor cells to compensate cell outage Self-Healing in Self-Organising Networks 8
Network Architecture Network NM Management Domain DM DM Management Network NE NE NE Element Where to implement which SON function ? Self-Healing in Self-Organising Networks 9
Challenges of SON-Architecture  Centralised vs distributed functions  Processing power vs bandwidth  Centralised Decision making (more knowledge) vs distributed (deadlocks, scaling)  Reliability & availability  Management Self-Healing in Self-Organising Networks 10
Self-Healing  Mobile networks are large, complicated systems  Prone to faults and inefficient behaviour  Most critical part is the Radio Access Network  Little to no redundancy in base stations  No service for users if one station fails completely  High amount of network elements  Degradation possible in each element  Identification of root cause for triggered alarms is hard  Often requires manual troubleshooting  Long time period of degradation  Significant costs Self-Healing in Self-Organising Networks 11
3GPP Self-Healing  Use-cases derived by 3rd Generation Partnership Project (3GPP)  Self-recovery of NE-software  Self-healing of board faults  Cell Outage Detection  Cell Outage Recovery  Cell Outage Compensation  Return of Cell Outage Compensation Self-Healing in Self-Organising Networks 12
3GPP Self-Healing Process  Self-Healing process  Input monitoring checks for pre-defined conditions, triggers self-healing process  Self-healing process gathers additional system information  Diagnosis of the root cause with the provided information  If root cause can be solved automatically: • recovery action performed • Back up configuration data  Evalution of the new state • Attempt new self-healing iteration • Report the result of the self-healing process • Fall back to backup configuration Self-Healing in Self-Organising Networks 13
Cell Degradation Management  Detection of Cell degradations partly automated  Check for alarms and faults  Reset the cell  After too many resets, operational personnel investigates the cell  If it can‘t be fixed remotely, engineers investigate on site  Often multiple visits requiered  KPI based detection  Investigate key performance indicators (KPI) • E.g. Top 10 cells with dropped calls, change of dropped call rate  KPIs often indicate external influences like interference  Faults also typically related to hardware Self-Healing in Self-Organising Networks 14
Cell Degradation Detection  Detecting unusual behavior of performance indicator  No diagnosis yet  Decide if performance is „healthy“  Performance indicators aquired from  Base station  User equipment  Neighbor cells  Core network elements  KPIs are stochastic variables  Absolute decision what is „healthy“ or not impractical  Use profiles to evaluate operation Self-Healing in Self-Organising Networks 15
Profiles  Absolute Threshold  Indicator should not exceed or fall below certain threshold [1] Figure 1: absolute Threshold Self-Healing in Self-Organising Networks 16
Profiles  Statistical  Values that should be around the statistical mean of the indicater +/- a threshold defined by the operator [2] Figure 2: statistical thresholds Self-Healing in Self-Organising Networks 17
Profiles  Time dependend  Indicators fluctuate time dependend, usually these are dependend of user behaviors [3] Figure 3: absolute Threshold Self-Healing in Self-Organising Networks 18
Cell Degradation Detection  The performance of the Detection method can be evaluated by  The delay between a degradation and its detection  The accurancy false postive/false negative  The processing Overhead Self-Healing in Self-Organising Networks 19
Cell Degradation Diagnosis  Diagnosis usually done manually  Requires knowledge of typical symptom-cause relation  This is often refered to as expert knowledge  SONs have to map the observations to the most possible root causes similar to a way a human solves this problem  different approaches to this problem possible  Rule based systems  Bayesian Networks  Case based reasoning  Neural Networks Self-Healing in Self-Organising Networks 20
Rule Based Systems  If(a)-then(b) style of implementation  Easy to implement  Easy to train/add new rules  Only effective in small deterministic networks  Complex networks require a complex set of rule  Uncertainty to performance indicators Self-Healing in Self-Organising Networks 21
Bayesian Networks  Includes the uncertainty of the performance indicators into the analysis of the root cause  Based on Bayes Theorem P(A|B) = (P(B|A)*P(A)) / P(B)  Can be depicted as an acyclic graph with nodes on different hierarchies  Conditions (global properties)  Root causes  Sypmptoms  Weighted edges represent the probability that a fault causes an observed symptom  Can be build top down with knowledge of the operator  Can evolve over time, starts with limited knowledge Self-Healing in Self-Organising Networks 22
Bayesian Networks Condition Root Root Cause Cause Symptom Symptom Symptom  Faults are rare, their statistical distribution hard to define  Size of conditional probability table grows exponentially with the number of nodes in the network Self-Healing in Self-Organising Networks 23
Case Based Reasoning  Map KPI anomalies to Fault cases  Initially requires expert knowledge  To solve a new problem, knowledge based on the similarity to other already known faults  If the fault could be resolved, add this new fault with the corresponding KPI mapping to the database  If it can‘t be solved, manual troubleshooting is done  The new case will be added to the knowledge base  CBR does not rely on probability Self-Healing in Self-Organising Networks 24
Conclusion  SON are an efficient way to deal with increasingly complex networks  SON Domains cover deployment, configuration and self- healing of network elements  The SON Task can be done centralized by an higher Network Management element or decentralized by the cells themselves  Detection is linked to alarms and performance indicators  Dependent on probabilities  Finding the root cause of a problem is a machine learning problem Self-Healing in Self-Organising Networks 25
Recommend
More recommend