Outside the Closed World: On Using Machine Learning for Network - PowerPoint PPT Presentation

Outside the Closed World: On Using Machine Learning for Network Intrusion Detection Robin Sommer Vern Paxson International Computer Science Institute, & International Computer Science Institute, & University of California, Berkeley Lawrence Berkeley National Laboratory IEEE Symposium on Security and Privacy May 2010

Network Intrusion Detection IEEE Symposium on Security and Privacy 2

Network Intrusion Detection NIDS IEEE Symposium on Security and Privacy 2

Network Intrusion Detection NIDS Detection Approaches: Misuse vs. Anomaly IEEE Symposium on Security and Privacy 2

Anomaly Detection Session Duration Session Volume IEEE Symposium on Security and Privacy 3

Anomaly Detection Training Phase: Building a profile of normal activity. Session Duration Session Volume IEEE Symposium on Security and Privacy 3

Anomaly Detection Training Phase: Building a profile of normal activity. Detection Phase: Matching observations against profile. Session Duration Session Volume IEEE Symposium on Security and Privacy 3

Anomaly Detection (2) • Assumption: Attacks exhibit characteristics that are different than those of normal traffic. • Originally introduced by Dorothy Denning in1987. • IDES: Host-level system building per-user profiles of activity. • Login frequency, password failures, session duration, resource consumption. IEEE Symposium on Security and Privacy 4

Anomaly Detection (2) · Technique Used Section References Statistical Profiling Section 7.2.1 NIDES [Anderson et al. 1994; Anderson et al. 1995; using Histograms Javitz and Valdes 1991], EMERALD [Porras and Neumann 1997], Yamanishi et al [2001; 2004], Ho et al. [1999], Kruegel at al [2002; 2003], Mahoney et al [2002; 2003; 2003; 2007], Sargor [1998] Parametric Statisti- Section 7.1 Gwadera et al [2005b; 2004], Ye and Chen [2001] cal Modeling Non-parametric Sta- Section 7.2.2 Chow and Yeung [2002] tistical Modeling Bayesian Networks Section 4.2 Siaterlis and Maglaris [2004], Sebyala et al. [2002], Valdes and Skinner [2000], Bronstein et al. [2001] Neural Networks Section 4.1 HIDE [Zhang et al. 2001], NSOM [Labib and Ve- muri 2002], Smith et al. [2002], Hawkins et al. [2002], Kruegel et al. [2003], Manikopoulos and Pa- pavassiliou [2002], Ramadas et al. [2003] Support Vector Ma- Section 4.3 Eskin et al. [2002] chines Rule-based Systems Section 4.4 ADAM [Barbara et al. 2001a; Barbara et al. 2003; Barbara et al. 2001b], Fan et al. [2001], Helmer et al. [1998], Qin and Hwang [2004], Salvador and Chan [2003], Otey et al. [2003] Clustering Based Section 6 ADMIT [Sequeira and Zaki 2002], Eskin et al. [2002], Wu and Zhang [2003], Otey et al. [2003] Nearest Neighbor Section 5 MINDS [Ertoz et al. 2004; Chandola et al. 2006], based Eskin et al. [2002] Spectral Section 9 Shyu et al. [2003], Lakhina et al. [2005], Thottan and Ji [2003],Sun et al. [2007] Information Theo- Section 8 Lee and Xiang [2001],Noble and Cook [2003] retic Source: Chandola et al. 2009 IEEE Symposium on Security and Privacy 4

Anomaly Detection (2) · Features used Technique Used Section References packet sizes Statistical Profiling Section 7.2.1 NIDES [Anderson et al. 1994; Anderson et al. 1995; using Histograms Javitz and Valdes 1991], EMERALD [Porras and IP addresses Neumann 1997], Yamanishi et al [2001; 2004], Ho ports et al. [1999], Kruegel at al [2002; 2003], Mahoney header fields et al [2002; 2003; 2003; 2007], Sargor [1998] Parametric Statisti- Section 7.1 Gwadera et al [2005b; 2004], Ye and Chen [2001] timestamps cal Modeling inter-arrival times Non-parametric Sta- Section 7.2.2 Chow and Yeung [2002] session size tistical Modeling Bayesian Networks Section 4.2 Siaterlis and Maglaris [2004], Sebyala et al. [2002], session duration Valdes and Skinner [2000], Bronstein et al. [2001] session volume Neural Networks Section 4.1 HIDE [Zhang et al. 2001], NSOM [Labib and Ve- payload frequencies muri 2002], Smith et al. [2002], Hawkins et al. [2002], Kruegel et al. [2003], Manikopoulos and Pa- payload tokens pavassiliou [2002], Ramadas et al. [2003] payload pattern Support Vector Ma- Section 4.3 Eskin et al. [2002] ... chines Rule-based Systems Section 4.4 ADAM [Barbara et al. 2001a; Barbara et al. 2003; Barbara et al. 2001b], Fan et al. [2001], Helmer et al. [1998], Qin and Hwang [2004], Salvador and Chan [2003], Otey et al. [2003] Clustering Based Section 6 ADMIT [Sequeira and Zaki 2002], Eskin et al. [2002], Wu and Zhang [2003], Otey et al. [2003] Nearest Neighbor Section 5 MINDS [Ertoz et al. 2004; Chandola et al. 2006], based Eskin et al. [2002] Spectral Section 9 Shyu et al. [2003], Lakhina et al. [2005], Thottan and Ji [2003],Sun et al. [2007] Information Theo- Section 8 Lee and Xiang [2001],Noble and Cook [2003] retic Source: Chandola et al. 2009 IEEE Symposium on Security and Privacy 4

The Holy Grail ... IEEE Symposium on Security and Privacy 5

The Holy Grail ... • Anomaly detection is extremely appealing. • Promises to find novel attacks without anticipating specifics. • It’s plausible : machine learning works so well in other domains. IEEE Symposium on Security and Privacy 5

The Holy Grail ... • Anomaly detection is extremely appealing. • Promises to find novel attacks without anticipating specifics. • It’s plausible : machine learning works so well in other domains. • But guess what’s used in operation ? Snort. • We find hardly any machine learning NIDS in real-world deployments. IEEE Symposium on Security and Privacy 5

The Holy Grail ... • Anomaly detection is extremely appealing. • Promises to find novel attacks without anticipating specifics. • It’s plausible : machine learning works so well in other domains. • But guess what’s used in operation ? Snort. • We find hardly any machine learning NIDS in real-world deployments. • Could using machine learning be harder than it appears? IEEE Symposium on Security and Privacy 5

Why is Anomaly Detection Hard? The intrusion detection domain faces challenges that make it fundamentally different from other fields. IEEE Symposium on Security and Privacy 6

Why is Anomaly Detection Hard? The intrusion detection domain faces challenges that make it fundamentally different from other fields. Outlier detection and the high costs of errors ! How do we find the opposite of normal? Interpretation of results ! What does that anomaly mean ? Evaluation ! ! How do we make sure it actually works? Training data ! What do we train our system with? Evasion risk ! Can the attacker mislead our system? IEEE Symposium on Security and Privacy 6

Machine Learning for Classification Feature Y Feature X IEEE Symposium on Security and Privacy 7

Machine Learning for Classification Feature Y B A C Feature X IEEE Symposium on Security and Privacy 7

Machine Learning for Classification Feature Y B Classification Problems A Optical Character Recognition Google’s Machine Translation Amazon’s Recommendations Spam Detection C Feature X IEEE Symposium on Security and Privacy 7

Outlier Detection Feature Y Feature X IEEE Symposium on Security and Privacy 8

Outlier Detection Feature Y Closed World Assumption Specify only positive examples. Adopt standing assumption that the rest is negative. Can work well if the model is very precise, or mistakes are cheap. Feature X IEEE Symposium on Security and Privacy 8

What is Normal? • Finding a stable notion of normal is hard for networks. • Network traffic is composed of many individual sessions. • Leads to enormous variety and unpredictable behavior. • Observable on all layers of the protocol stack. IEEE Symposium on Security and Privacy 9

Outside the Closed World: On Using Machine Learning for Network - PowerPoint PPT Presentation

Outside the Closed World: On Using Machine Learning for Network Intrusion Detection Robin Sommer Vern Paxson International Computer Science Institute, & International Computer Science Institute, & University of California, Berkeley

outside the Gospels Sayings of Jesus outside the Gospels Sayings of Jesus outside the Gospels

Classification of curves Simple, not closed Simple, closed Closed, not simple Not simple, not

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Supporting Open and Supporting Open and Closed World Reasoning Closed World Reasoning in the

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Impact Evaluation of a Cluster Program: An Application of Synthetic Control Methods Diego Aboal*,

Paper Summaries Any takers? Light and Color Plan for today Computer Graphics as Virtual

International Material Resource Dependency in an Input-Output Framework Maaike C. Bouwmeester

COMMON CORE IN CHINA Curriculum mapping American Standards from an International Perspective

Unit-based Simulation for the Bedside Registered Nurse Jocelyn Disher, BSN, MSN, RN Anisha

Ast stro Pi Pi: P Pyt ython o n on the he In Internationa nal Sp Space ce S Station n

Unit 3 Part 1 Introduction to Military Component Planning Process UN Peacekeeping PDT Standards,

Fast Analytics on Big Data with H20 0xdata.com, h2o.ai Tomas Nykodym, Petr Maj Team About H2O