ABSENCE: Usage-based Failure Detection in Mobile Networks Binh - - PowerPoint PPT Presentation

absence usage based failure detection in mobile networks
SMART_READER_LITE
LIVE PREVIEW

ABSENCE: Usage-based Failure Detection in Mobile Networks Binh - - PowerPoint PPT Presentation

ABSENCE: Usage-based Failure Detection in Mobile Networks Binh Nguyen , Zihui Ge, Jacobus Van der Merwe, He Yan, Jennifer Yates Mobicom 2015 1 Silent failures EPC core core RAN 2 Silent failures EPC core core RAN Silent failures:


slide-1
SLIDE 1

ABSENCE: Usage-based Failure Detection in Mobile Networks

Binh Nguyen, Zihui Ge, Jacobus Van der Merwe, He Yan, Jennifer Yates Mobicom 2015

1

slide-2
SLIDE 2

Silent failures

2

EPC core core RAN

slide-3
SLIDE 3

Silent failures

  • Silent failures: service disruptions/outages that are not detected by current

monitoring systems.

  • New features rolled out, bugs on devices, or combination of both.

2

EPC core core RAN

slide-4
SLIDE 4

Silent failures

  • Silent failures: service disruptions/outages that are not detected by current

monitoring systems.

  • New features rolled out, bugs on devices, or combination of both.

2

EPC core core RAN

slide-5
SLIDE 5

Silent failures

  • Silent failures: service disruptions/outages that are not detected by current

monitoring systems.

  • New features rolled out, bugs on devices, or combination of both.

2

EPC core core RAN

Detecting silent failures is challenging!

slide-6
SLIDE 6

Detecting silent failures is difficult - passive network monitoring

3

slide-7
SLIDE 7

Detecting silent failures is difficult - passive network monitoring

  • Drops in traffic/usage on network elements do not imply service disruptions:
  • Load balancing/maintenance activities.
  • Dynamic routing/Self-Organizing Network (SON).

3

Load balancing event

Load Time

expected load actual load

slide-8
SLIDE 8

Detecting silent failures is difficult - passive network monitoring

  • Drops in traffic/usage on network elements do not imply service disruptions:
  • Load balancing/maintenance activities.
  • Dynamic routing/Self-Organizing Network (SON).
  • Key Performance metric Indicators (KPI) may not reflect service issues:
  • E.g., accessibility KPI looks good even when only a subset of users can access the network.

3

Load balancing event

Load Time

expected load actual load

slide-9
SLIDE 9

Detecting silent failures is difficult - passive network monitoring

  • Drops in traffic/usage on network elements do not imply service disruptions:
  • Load balancing/maintenance activities.
  • Dynamic routing/Self-Organizing Network (SON).
  • Key Performance metric Indicators (KPI) may not reflect service issues:
  • E.g., accessibility KPI looks good even when only a subset of users can access the network.

3

Load balancing event

Load Time

expected load actual load

slide-10
SLIDE 10

Detecting silent failures is difficult - active service monitoring

4

EPC core RAN

slide-11
SLIDE 11

Detecting silent failures is difficult - active service monitoring

  • Sending test traffic across the network on all service paths.

4

EPC core RAN

slide-12
SLIDE 12

Detecting silent failures is difficult - active service monitoring

  • Sending test traffic across the network on all service paths.
  • Many types of customer devices, applications, huge geographic environment

to probe.

4

EPC core RAN

Active monitoring does not scale!