Cloudy with a Chance of Breach: Forecasting Cyber Security Incidents - PowerPoint PPT Presentation

Cloudy with a Chance of Breach: Forecasting Cyber Security Incidents Yang Liu § , Armin Sarabi § , Jing Zhang § , Parinaz Naghizadeh § Manish Karir ♯ , Michael Bailey ∗ , Mingyan Liu § ,♯ § EECS Department, University of Michigan, Ann Arbor ♯ QuadMetrics, Inc. ∗ ECE Department, University of Illinois, Urbana-Champaign http://grs.eecs.umich.edu Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 1 / 28

Intro Introduction Motivation Increasingly frequent and high-impact data breaches ◮ Target, JP Morgan Chase, Home Depot, to name a few ◮ Increasing social and economic impact of such cyber incidents Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 2 / 28

Intro Introduction Limitation of current approaches ◮ Heavily detection based ◮ Fail to detect, or too late by the time a breach is detected ◮ Not suited for cost/damage control ◮ Urgent need for more proactive measures Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 3 / 28

Intro Introduction Prediction Detection ◮ predicting whether a presently ◮ analogous to diagnosing a healthy person may become ill patient who may already be ill based on a variety of relevant (e.g., by using biopsy). factors. ◮ [Qian et al. NDSS14, Wang ◮ [Soska & Christin, USENIX et al. USENIX Sec14] Sec14] Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 4 / 28

Intro Introduction Prediction Detection ◮ predicting whether a presently ◮ analogous to diagnosing a healthy person may become ill patient who may already be ill based on a variety of relevant (e.g., by using biopsy). factors. ◮ [Qian et al. NDSS14, Wang ◮ [Soska & Christin, USENIX et al. USENIX Sec14] Sec14] Our goal: ◮ Understand the extent to which one can forecast incidents on an organizational level. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 4 / 28

Intro Introduction Objective To develop the ability to forecast security incidences ◮ Applicability: we rely solely on externally observed data; do not require information on the internal workings of a network or its hosts. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 5 / 28

Intro Introduction Objective To develop the ability to forecast security incidences ◮ Applicability: we rely solely on externally observed data; do not require information on the internal workings of a network or its hosts. ◮ Robustness: we do not have control over or direct knowledge of the error embedded in the data. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 5 / 28

Intro Introduction Objective To develop the ability to forecast security incidences ◮ Applicability: we rely solely on externally observed data; do not require information on the internal workings of a network or its hosts. ◮ Robustness: we do not have control over or direct knowledge of the error embedded in the data. Key idea: ◮ tap into a diverse set of data that captures different aspects of a network’s security posture, ranging from the explicit to latent . Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 5 / 28

Intro Introduction Why prediction? Forecast enables entirely new classes of applications which are otherwise not feasible. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 6 / 28

Intro Introduction Why prediction? Forecast enables entirely new classes of applications which are otherwise not feasible. ◮ Prediction allows proactive policies and measures to be adopted rather than reactive measures following the detection. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 6 / 28

Intro Introduction Why prediction? Forecast enables entirely new classes of applications which are otherwise not feasible. ◮ Prediction allows proactive policies and measures to be adopted rather than reactive measures following the detection. Forecast enables effective risk management schemes Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 6 / 28

Intro Introduction Why prediction? Forecast enables entirely new classes of applications which are otherwise not feasible. ◮ Prediction allows proactive policies and measures to be adopted rather than reactive measures following the detection. Forecast enables effective risk management schemes ◮ Internal to an org.: more informed decisions on resource allocation. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 6 / 28

Intro Introduction Why prediction? Forecast enables entirely new classes of applications which are otherwise not feasible. ◮ Prediction allows proactive policies and measures to be adopted rather than reactive measures following the detection. Forecast enables effective risk management schemes ◮ Internal to an org.: more informed decisions on resource allocation. ◮ External to an org.: incentive mechanisms such as cyber insurance. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 6 / 28

Intro Introduction Outline of the talk ◮ Data and Preliminaries - Description of the data - Data pre-processing ◮ Forecasting methods - Construction of the predictor ◮ Forecasting results - Main prediction results & analysis Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 7 / 28

Data Methodology Datasets at a glance Category Collection period Datasets Mismanagement Feb’13 - Jul’13 Open Recursive Resolvers, DNS Source Port, symptoms BGP misconfiguration, Untrusted HTTPS, Open SMTP Mail Relays Malicious May’13 - Dec’14 CBL, SBL, SpamCop, UCEPROTECT, activities WPBL, SURBL, PhishTank, hpHosts, Darknet scanners list, Dshield, OpenBL Incident Aug’13 - Dec’14 VERIS Community Database, reports Hackmageddon, Web Hacking Incidents ◮ Mismanagement and malicious activities used to extract features. ◮ Incident reports used to generate labels for training and testing. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 8 / 28

Data Methodology Security posture data Mismanagement symptoms ◮ Deviation from known best practices; indicators of lack of policy or expertise: - Misconfigured- HTTPS cert, DNS (resolver+source port), mail server, BGP. ◮ Collected around mid-2013 (pre-incidnts). Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 9 / 28

Data Methodology Security posture data Mismanagement symptoms ◮ Deviation from known best practices; indicators of lack of policy or expertise: - Misconfigured- HTTPS cert, DNS (resolver+source port), mail server, BGP. ◮ Collected around mid-2013 (pre-incidnts). Malicious Activity Data: a set of 11 reputation blacklists (RBLs) ◮ Daily collections of IPs seen engaged in some malicious activity. ◮ Three malicious activity types: spam, phishing, scan. ◮ Use data between May 2013 and December 2014. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 9 / 28

Data Methodology Security incident Data Three incident datasets ◮ Hackmageddon ◮ Web Hacking Incidents Database (WHID) ◮ VERIS Community Database (VCDB) Incident type SQLi Hijacking Defacement DDoS Hackmageddon 38 9 97 59 WHID 12 5 16 45 Incident type Crimeware Cyber Esp. Web app. Else VCDB 59 16 368 213 Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 10 / 28

Data Data Pre-processing Data Pre-processing Incident cleaning. ◮ Remove irrelevant cases, e.g., robbery at liquor store, something happened etc. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 11 / 28

Data Data Pre-processing Data Pre-processing Incident cleaning. ◮ Remove irrelevant cases, e.g., robbery at liquor store, something happened etc. Data diversity presents challenge in alignment in time and space. ◮ Security posture records information at the host IP-address level. ◮ Cyber incident reports associated with an organization. ◮ Such alignment is not travial: reallocation makes boundary unclear. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 11 / 28

Data Data Pre-processing Data Pre-processing Incident cleaning. ◮ Remove irrelevant cases, e.g., robbery at liquor store, something happened etc. Data diversity presents challenge in alignment in time and space. ◮ Security posture records information at the host IP-address level. ◮ Cyber incident reports associated with an organization. ◮ Such alignment is not travial: reallocation makes boundary unclear. A mapping process: ◮ Summarizing owner IDs from RIR databases. ◮ 4.4 million prefixes listed under 2.6 million owner IDs: finer degree compared to routing table. ◮ Sample IP from organization + search in above table. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 11 / 28

Forecast Outline of the talk ◮ Data and Preliminaries - Description of the data - Data pre-processing ◮ Forecasting methods - Construction of the predictor ◮ Forecasting results - Main prediction results & analysis Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 12 / 28

Forecast Methodology Approach at a glance Feature extraction ◮ 258 features extracted from the datasets: Primary + Secondary features. Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 13 / 28

Forecast Methodology Approach at a glance Feature extraction ◮ 258 features extracted from the datasets: Primary + Secondary features. Label generation ◮ 1,000+ incident reports from the three incident sets Y.Liu (U. Michigan) Forecasting Cyber Security Incidents 13 / 28

Cloudy with a Chance of Breach: Forecasting Cyber Security Incidents - PowerPoint PPT Presentation

Cloudy with a Chance of Breach: Forecasting Cyber Security Incidents Yang Liu , Armin Sarabi , Jing Zhang , Parinaz Naghizadeh Manish Karir , Michael Bailey , Mingyan Liu , EECS Department, University of Michigan, Ann

CYBER CYBER-SAFETY CYBER CYBER SAFETY SAFETY SAFETY BASICS BASICS Engineering Staff College

The IT Forecast: Cloudy With A Chance of Breach? Christopher Yula Director of Sales, Siwel

Cyber/Information Cyber/Information Security Cyber/Information Cyber/Information Security

Michael McCartney President, Avalon Cyber The Cyber Problem Data Breach Landscape

Functional Encryptions and Cloudy Applications Function on a Cloudy Day Giuseppe Persiano

ASL Cloudy RTA L. Strow Cloudy Radiative Transfer for AIRS Overview RTA Codes Comparisons to

Single field-of-view cloudy sounding retrieval from Single field-of-view cloudy sounding retrieval

Cyber Security in Mining Automation Ragnar Schierholz, Head of Cyber Security, Industrial

Cyber Security Risk Using the Cyber Security Measurement Framework Cyber Security Framework

CYBER INSURANCE Luxury or necessary protection? What is a data breach? A breach is defined as an

Breaking Databases via SQLi attacks Azqa Nadeem PhD Student @ Cyber Security Group The Cyber

Beazley Breach Response Select p g Making the connection on data breach complexities P

Compliance Component Simon Bingham Development Unit Manager 4 th December, Golden Lion

PRIVACY BREACH GUIDELINES Purpose The Privacy Breach Guidelines may provide to contain, assess

Auditd for the Masses Philipp Krenn @xeraa Learn about a breach From the press or

Mark Fernandes Principal, Cyber Risk Services + FUTURE OF CYBER CYB CYBER SINGU GULAR ARIT

Market Capacity Definitions and Generation Adequacy Cynthia Bothwell Benjamin F. Hobbs Johns

Criteria Considerations Warren Lasher Director of System Planning PUC Workshop Commission

Michel Juillard Bayesian Estimation of GPM with Dynare MacroFinancial Linkages, Oil Prices and

Obtaining simultaneous equation models from a set of variables through genetic algorithms Jos

Predic'ng Coherence Communica'on by Tracking Synchroniza'on Points

Lecture 4: Introduction to Regression CS109A Introduction to Data Science Pavlos Protopapas,

techniques for daily precipitation: Results from the CORDEX Flagship Pilot Study in South America

Veterans Employment Preferences Texas A&M University Rita Bowden, Manager of Recruitment

Cloudy with a Chance of Breach: Forecasting Cyber Security Incidents - PowerPoint PPT Presentation

Cloudy with a Chance of Breach: Forecasting Cyber Security Incidents Yang Liu , Armin Sarabi , Jing Zhang , Parinaz Naghizadeh Manish Karir , Michael Bailey , Mingyan Liu , EECS Department, University of Michigan, Ann

CYBER CYBER-SAFETY CYBER CYBER SAFETY SAFETY SAFETY BASICS BASICS Engineering Staff College

The IT Forecast: Cloudy With A Chance of Breach? Christopher Yula Director of Sales, Siwel

Cyber/Information Cyber/Information Security Cyber/Information Cyber/Information Security

Michael McCartney President, Avalon Cyber The Cyber Problem Data Breach Landscape

Functional Encryptions and Cloudy Applications Function on a Cloudy Day Giuseppe Persiano

ASL Cloudy RTA L. Strow Cloudy Radiative Transfer for AIRS Overview RTA Codes Comparisons to

Single field-of-view cloudy sounding retrieval from Single field-of-view cloudy sounding retrieval

Cyber Security in Mining Automation Ragnar Schierholz, Head of Cyber Security, Industrial

Cyber Security Risk Using the Cyber Security Measurement Framework Cyber Security Framework

CYBER INSURANCE Luxury or necessary protection? What is a data breach? A breach is defined as an

Breaking Databases via SQLi attacks Azqa Nadeem PhD Student @ Cyber Security Group The Cyber

Beazley Breach Response Select p g Making the connection on data breach complexities P

Compliance Component Simon Bingham Development Unit Manager 4 th December, Golden Lion

PRIVACY BREACH GUIDELINES Purpose The Privacy Breach Guidelines may provide to contain, assess

Auditd for the Masses Philipp Krenn @xeraa Learn about a breach From the press or

Mark Fernandes Principal, Cyber Risk Services + FUTURE OF CYBER CYB CYBER SINGU GULAR ARIT

Market Capacity Definitions and Generation Adequacy Cynthia Bothwell Benjamin F. Hobbs Johns

Criteria Considerations Warren Lasher Director of System Planning PUC Workshop Commission

Michel Juillard Bayesian Estimation of GPM with Dynare MacroFinancial Linkages, Oil Prices and

Obtaining simultaneous equation models from a set of variables through genetic algorithms Jos

Predic'ng Coherence Communica'on by Tracking Synchroniza'on Points

Lecture 4: Introduction to Regression CS109A Introduction to Data Science Pavlos Protopapas,

techniques for daily precipitation: Results from the CORDEX Flagship Pilot Study in South America

Veterans Employment Preferences Texas A&amp;M University Rita Bowden, Manager of Recruitment

Veterans Employment Preferences Texas A&M University Rita Bowden, Manager of Recruitment