Practical Anomaly Detection based on Classifying Frequent Traffic - PowerPoint PPT Presentation

Practical Anomaly Detection based on Classifying Frequent Traffic Patterns Ignasi Paredes-Oliva 1 Ismael Castell-Uroz 1 Pere Barlet-Ros 1 Xenofontas Dimitropoulos 2 Josep Solé-Pareta 1 1 UPC BarcelonaTech, Spain {iparedes,icastell,pbarlet,pareta}@ac.upc.edu 2 ETH Zürich, Switzerland fontas@tik.ee.ethz.ch 15 th IEEE Global Internet Symposium (GI) Orlando, FL, United States March 30 th , 2012

Introduction Related Work Our Proposal Performance Evaluation Conclusions Outline Introduction 1 Related Work 2 Our Proposal 3 Performance Evaluation 4 Conclusions 5 2 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions The problem Growth of cyber-attacks 1 Anomaly detection systems not widely deployed e.g., too many false positives, complex black boxes Anomaly classification and root-cause analysis are still open issues e.g., manual analysis → error-prone, complex, slow and expensive 2 Our goal Simple system for automatic anomaly detection and classification High classification accuracy and low false positives Conceptually simple working scheme 1 Kim-Kwang Raymond Choo, The cyber threat landscape: Challenges and future research directions, Computers & Security, 2011. 2 M. Molina et al., Anomaly Detection in Backbone Networks: Building a Security Service Upon an Innovative Tool. TNC 2010. 4 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions Related work and contributions Many proposals on anomaly detection Anomaly classification marginally studied Contributions of this paper Novel approach for automatic anomaly detection and classification based on classifying frequent traffic patterns Evaluated using data from two large networks High classification accuracy and low false positives ratio System deployed in the Catalan NREN 6 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions System Overview Two phases: Offline: build model to classify anomalies Online: use model to classify incoming traffic Freq. Machine Feature Model Item-Set Learning Extraction Mining Freq. Feature Classification Item-Set Extraction Mining 8 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions Frequent Item-Set Mining Originally used in market basket analysis to find out products that were frequently bought together and make appealing offers ( e.g., beer and chips) What is an item-set? compact summarization of elements occurring together Why is it useful for anomaly detection? Many attacks involve high volume of flows with common features e.g., Port Scan: many flows with same sIP and dIP 9 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions Frequent Item-Set Mining Port Scan example sIP dIP sPort dPort 1 st flow X.77.17.59 Y.88.243.209 41393 21209 2 nd flow X.77.17.59 Y.88.243.209 41393 54766 3 rd flow X.77.17.59 Y.88.243.209 41393 31448 4 th flow X.77.17.59 Y.88.243.209 41393 58514 ... 2911 th flow X.77.17.59 Y.88.243.209 41393 48732 sIP dIP sPort dPort item-set X.77.17.59 Y.88.243.209 41393 * Need further information per item-set in order to classify it 10 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions Feature Extraction Computed features for each frequent item-set Value Defined Undefined Defined Src IP/Dst IP True False Src/Dst Port Port Number NaN Protocol Protocol Number NaN URG/ACK/PSH/RST/SYN/FIN True False # Bytes / # Packets Bytes per Packet (bpp) # Packets / # Flows Packet per Flow (ppf) 11 / 21

Introduction Related Work Our Proposal Performance Evaluation Conclusions Building the classifier (offline) Goal: build model taking into account manually labeled frequent item-sets Output classes Anomalous: DoS (DDoS, SYN/ACK/UDP/ICMP floods), Network Scans (ICMP/Other Network Scans), Port Scans (SYN/ACK/UDP Port Scans) Normal (legitimate traffic) Unknown (not normal and did not fit in any anomalous class) Labeled item-sets + features + output classes are given to the C5.0 algorithm (machine learning) → output: classification model 12 / 21

Practical Anomaly Detection based on Classifying Frequent Traffic - PowerPoint PPT Presentation

Practical Anomaly Detection based on Classifying Frequent Traffic Patterns Ignasi Paredes-Oliva 1 Ismael Castell-Uroz 1 Pere Barlet-Ros 1 Xenofontas Dimitropoulos 2 Josep Sol-Pareta 1 1 UPC BarcelonaTech, Spain

What is an anomaly? Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Defining

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Isolation trees Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Isolation

Anomaly Detection of Trajectories Junier B. Oliva Anomaly Detection An anomaly (or outlier)

V0D 2016 Classifying Studies V0D V0D 2016 Classifying Studies 1 2016 Classifying Studies

Anomaly Detection Jia-Bin Huang Virginia Tech Spring 2019 ECE-5424G / CS-5824 Administrative

Data Mining II Anomaly Detection Heiko Paulheim Anomaly Detection Also known as Outlier

Learning Rules for Anomaly Detection (LERAD) of Hostile Network Traffic Matt Mahoney Overview

Data Mining II Anomaly Detection Heiko Paulheim Anomaly Detection Also known as Outlier

Structure of Talk Workload-sensitive Timing Behavior Anomaly Detection 1 Motivation in Large

Dataflow Anomaly Detection Presented By Archana Viswanath Computer Science and Engineering The

<Title> Yiqun Hu, SP Group Agenda Condition monitoring & anomaly detection

In Incorporating Feedback in into Tree-based Anomaly Detection Shubhomoy Das, Weng-Keen Wong,

Detecting Attacks Anomaly-based Detection Signature-based Signature-based (Misuse)

Anomaly Based Network Intrusion Detection with Unsupervised Outlier Detection Jiong Zhang and

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Hardware Modeling 3 Timing Anomalies Peter Puschner slides credits: P. Puschner, R. Kirner, B.

Lecture Two: Working with high dimensional data In ancient times they had no statistics so

Self Learning Networks An Overview Alvaro Retana aretana@cisco.com Distinguished Engineer,

Intrusion Detection Systems CSE497b - Spring 2007 Introduction Computer and Network Security

Insider Threat Insider Threat (Database Intrusion Detection) 1 Insider Threats: Motivation and

Motivation Both human- and computer-generated programs sometimes contain data-flow anomalies .

Machine Learning-based Anomaly Detection for Post-silicon Bug Diagnosis Andrew DeOrio , Qingkun

Patient and Public Engagement Group New Congenital Heart Disease review 14 th January 2015 New

Practical Anomaly Detection based on Classifying Frequent Traffic - PowerPoint PPT Presentation

Practical Anomaly Detection based on Classifying Frequent Traffic Patterns Ignasi Paredes-Oliva 1 Ismael Castell-Uroz 1 Pere Barlet-Ros 1 Xenofontas Dimitropoulos 2 Josep Sol-Pareta 1 1 UPC BarcelonaTech, Spain

What is an anomaly? Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Defining

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Isolation trees Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Isolation

Anomaly Detection of Trajectories Junier B. Oliva Anomaly Detection An anomaly (or outlier)

V0D 2016 Classifying Studies V0D V0D 2016 Classifying Studies 1 2016 Classifying Studies

Anomaly Detection Jia-Bin Huang Virginia Tech Spring 2019 ECE-5424G / CS-5824 Administrative

Data Mining II Anomaly Detection Heiko Paulheim Anomaly Detection Also known as Outlier

Learning Rules for Anomaly Detection (LERAD) of Hostile Network Traffic Matt Mahoney Overview

Data Mining II Anomaly Detection Heiko Paulheim Anomaly Detection Also known as Outlier

Structure of Talk Workload-sensitive Timing Behavior Anomaly Detection 1 Motivation in Large

Dataflow Anomaly Detection Presented By Archana Viswanath Computer Science and Engineering The

&lt;Title&gt; Yiqun Hu, SP Group Agenda Condition monitoring &amp; anomaly detection

In Incorporating Feedback in into Tree-based Anomaly Detection Shubhomoy Das, Weng-Keen Wong,

Detecting Attacks Anomaly-based Detection Signature-based Signature-based (Misuse)

Anomaly Based Network Intrusion Detection with Unsupervised Outlier Detection Jiong Zhang and

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Hardware Modeling 3 Timing Anomalies Peter Puschner slides credits: P. Puschner, R. Kirner, B.

Lecture Two: Working with high dimensional data In ancient times they had no statistics so

Self Learning Networks An Overview Alvaro Retana aretana@cisco.com Distinguished Engineer,

Intrusion Detection Systems CSE497b - Spring 2007 Introduction Computer and Network Security

Insider Threat Insider Threat (Database Intrusion Detection) 1 Insider Threats: Motivation and

Motivation Both human- and computer-generated programs sometimes contain data-flow anomalies .

Machine Learning-based Anomaly Detection for Post-silicon Bug Diagnosis Andrew DeOrio , Qingkun

Patient and Public Engagement Group New Congenital Heart Disease review 14 th January 2015 New

<Title> Yiqun Hu, SP Group Agenda Condition monitoring & anomaly detection