Classification with Partial Labels Weibin Meng , Ying Liu, Shenglin - - PowerPoint PPT Presentation

classification with partial labels
SMART_READER_LITE
LIVE PREVIEW

Classification with Partial Labels Weibin Meng , Ying Liu, Shenglin - - PowerPoint PPT Presentation

Device-Agnostic Log Anomaly Classification with Partial Labels Weibin Meng , Ying Liu, Shenglin Zhang, Dan Pei Hui Dong, Lei Song, Xulong Luo 2018/6/23 weibin 1 Motivation Architecture of Datacenter Networks Inter-DC Network Core Core


slide-1
SLIDE 1

Device-Agnostic Log Anomaly Classification with Partial Labels

2018/6/23 1 weibin

Weibin Meng, Ying Liu, Shenglin Zhang, Dan Pei Hui Dong, Lei Song, Xulong Luo

slide-2
SLIDE 2

Motivation

2018/6/23 2 weibin

Inter-DC Network ToR Switch Server Aggregation Switch Access Router Core Router IDPS Firewall VPN Load balancer IDPS Firewall VPN Load balancer L3 L2 Core

Architecture of Datacenter Networks

slide-3
SLIDE 3
  • Logs describe some events that KPI curve can’t, such as the root cause.

2018/6/23 weibin 3

Motivation

traffic flow

CPU

utilization

  • Traditional anomaly detection methods usually monitor KPI curves.
  • KPI need network operators select manually.
  • KPI methods can only find anomalous behaviors
  • Logs are most valuable data sources for device management.
slide-4
SLIDE 4

Device logs

2018/6/23 weibin 4

  • Examples of device(switch) log :

Detailed Messages are Semi- structured natural languages provided by device developers

Message types are ambiguous for accurate classification

slide-5
SLIDE 5

Drawbacks in Regular Expression

2018/6/23 weibin 5

Match

Syslog

Ignore Type 1

Configure anomalous regular expressions

Yes No

Operators

Type n

  • Regular Expression is the popular technique for anomalous log classification.
  • Drawbacks:
  • Low generality
  • Labor intensity

RE for Manufacturer A Manufacturer B logs

slide-6
SLIDE 6

Problem Definitions

2018/6/23 weibin 6

slide-7
SLIDE 7

Challenges

2018/6/23 weibin 7

  • Device-agnostic vocabulary
  • Device logs are type- specific and manufacturer- specific.
  • It is hard to fit one classification model for all different device types.
  • Partial labels
  • Network operators only label partial anomalous logs they encountered.
  • Difficult to train a traditional classification model.
slide-8
SLIDE 8

LogClass Design Overview

2018/6/23 8

Historical Logs Real-time Logs Filtering Parameters PU Binary Classifier Vocabulary Feature Vector Multiclass Classifier Anomaly Records Top-n Keywords Filtering Parameters Feature Vector Detect Anomalous Logs Classify Anomalous Logs Alarm Offline Learning Component Online Classification Component

weibin

  • 1. Log Preprocessing
  • 2. Feature vector
  • 3. Anomaly detection
  • 4. Anomaly classification
slide-9
SLIDE 9

Text feature vector

2018/6/23 weibin 9

The universal method to construct a text feature vector is the bag-of-words model.

𝑀1 Interface te-1/1/59 changed state to down 𝑀2

VlanInterface

vlan22 changed state to up 𝑀3 Neighbour vlan23 changed state from Exchange to Loading

Interface changed state to down

VlanInterface Neighbour

from Exchange Loading up 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

𝑀1 𝑀2 𝑀3

bag-of-words vectors: logs:

Vocabulary

Assign weighting values to each component in vectors. (e.g., TF-IDF)

slide-10
SLIDE 10

PU learning

unlabeled data

PU Learning

2018/6/23 weibin 10

  • Different from tradition classification.
  • In our scnario, labelling all existing

anomalous logs is not natural.

  • PU Learning input:
  • Positive set P (Anomalous logs)
  • Unlabeled set U (Unlabeled logs)

: positive data (Gang Niu et al. NIPS’16)

slide-11
SLIDE 11

Evaluation

2018/6/23 weibin 11

Dataset

  • Real-world Switch logs
  • 58 switches types
  • Two-week period
  • 1,758,456 anomalous logs
  • 16,702,547 unlabeled logs

Benchmark methods

  • Labeled-LDA
  • Regular Expression
slide-12
SLIDE 12

Evaluation on PU Learning

2018/6/23 weibin 12

Sampled anomalous logs randomly cross all switch types and assumed they have no labels. PU Learning classifier is more stable than traditional classifier.

slide-13
SLIDE 13

Evaluation on Anomalous Log Classification

2018/6/23 weibin 13

LogClass is more accurate. The overheads of L-LDA and RE are larger than LogClass

slide-14
SLIDE 14

Conclusion

2018/6/23 weibin 14

  • Device-Agnostic vocabulary
  • Partial anomalous logs have labels

Challenges

  • PU learning
  • Simple NLP techniques

LogClass

  • Real-world switch logs.

Evaluation

slide-15
SLIDE 15

Thank you!

mwb16@mails.tsinghua.edu.cn

2018/6/23 weibin 15