Vulnerability Prediction Models: A case study on the Linux Kernel - PowerPoint PPT Presentation

Vulnerability Prediction Models: A case study on the Linux Kernel Matthieu Jimenez Mike Papadakis Yves Le Traon Jimenez et al. “Vulnerability Prediction Models: A Case Study on the Linux Kernel” SCAM’16 � 1 Slides: Matthieu Jimenez Thème: Sébastien Mosser

Vulnerabilities ? � 2

A vulnerability “An information security ‘ vulnerability’ is a mistake in a software that can be directly used by a hacker to gain access to a system or network.” ~ CVE - website ~ � 3

Vulnerabilities are special More Important - Critical There are more bugs than vulnerabilities Uncovered differently - defects can be easily noticed, while vulnerabilities not. � 4

Vulnerabilities are Web server used to remotely control the glassware-cleaning machine CVE for that… � 5

Prediction Model ? � 6

Prediction Models Models analysing current and historical events to make prediction about the future and/or unknown events ! � 7

Vulnerability Prediction Model ? � 8

Vulnerability Prediction Take advantage of the knowledge on some part of a software system and/ or previous releases � 9

Vulnerability Prediction to automatically classify software entities as vulnerable or not ! � 10

Software Entities ? � 11

Granularity Possibility to work at : • module level • file level • function level •… � 12

In this work , we stay at the file level ! � 13 *Morrison et al. “Challenges with applying vulnerability prediction models,” in HotSoS’15.

GOAL � 14

Replicating and comparing the main VPMs approaches on the same software system . � 15

Replication … � 16

Exact independent replication � 17

Exact replication procedures of an experiment are followed as closely as possible e.g. here we replicate using the same machine learning settings � 18

Independent replication deliberately vary one or more major aspects of the conditions of the experiment e.g. we use our dataset � 19

Approaches … � 20

#Include and f(n) calls � 21

Include & Function calls Introduced by Neuhaus et al. at CCS’07 � 22

Include & Function calls Introduced by Neuhaus et al. at CCS’07 Intuition : vulnerable files share similar set of imports and function calls � 23

Include & Function calls Introduced by Neuhaus et al. at CCS’07 Intuition : vulnerable files share similar set of imports and function calls build a model based on either includes or function calls of a file . � 24

Overview Preprocessing Learning Retrieve all include Include & function SVM with a linear and function calls of a calls kernel file 2 models are build � 25

Software Metrics � 26

Software Metrics Several works on using metrics to predict vulnerabilities , mostly by Shin et al. � 27

Software Metrics Several works on using metrics to predict vulnerabilities , mostly by Shin et al. Software metrics are used in defect prediction build a model based software metrics (complexity, code churn, …) � 28

Overview Preprocessing Learning Compute complexity metrics of each function (keeping sum, avg and max) Software Metrics Logistic regression code churn and the number of authors of every files. � 29

Text Mining � 30

Text Mining suggested by Scandariato et al. in 2014. � 31

Text Mining suggested by Scandariato et al. in 2014. Aim : building a model requiring no human intuition for feature selection � 32

Text Mining suggested by Scandariato et al. in 2014. Aim : building a model requiring no human intuition for feature selection build a model based on a bag of word extracted from a file � 33

Overview Preprocessing Learning •Discretisation of the Creating a bag of features (making word (splitting the them boolean) code according to the •Remove of all Text mining language grammar) features considered for every files useless •Random Forest with 100 trees � 34

Dataset � 35

Introducing the dataset based on commit and not release � 36

Introducing the dataset • CVE-NVD database as a source of vulnerabilities • Bugzilla as a source of bugs � 37

Introducing the dataset •build automatically •with the latest data available •on the Linux Kernel � 38

Overall dataset statistics 2006-June 2016 • 1,640 vulnerable files , accounting for 743 vulnerabilities • 4,900 buggy files related to 3,400 bug reports • more than 50,000 files in total � 39

Research Questions • RQ1 . Can we distinguish between buggy and vulnerable files ? � 40

Research Questions • RQ1 . Can we distinguish between buggy and vulnerable files ? • RQ2 . Can we distinguish between vulnerable and non vulnerable files ? � 41

Research Questions • RQ1 . Can we distinguish between buggy and vulnerable files ? • RQ2 . Can we distinguish between vulnerable and non vulnerable files ? • RQ3 . Can we predict future vulnerable when using past data ? � 42

Research Questions • RQ1 . Can we distinguish between buggy and vulnerable files ? • RQ2 . Can we distinguish between vulnerable and non vulnerable files ? • RQ3 . Can we predict future vulnerable when using past data ? ✦ Distinguish between buggy and vulnerable files ✦ Distinguish between vulnerable and non vulnerable files ? � 43 •

Experimental Dataset * Buggy vs Vulnerable files � 44

Experimental dataset Can we distinguish between buggy and vulnerable files ? • files related to bug report patches vs files from vulnerability patches • ratio 3.3 : 1 � 45

Realistic Dataset * Vulnerable vs Non-Vulnerable files � 46

Realistic dataset • Can we distinguish between Vulnerable and Non-Vulnerable files? • Reproduce observed ratio between different categories of files • 3% of (likely) vulnerable files • 47% of (likely) buggy files • 50% of clear files � 47

Evaluation � 48

RQ1 - Bugs vs Vulnerabilities 1.0 0.8 0.6 MCC 0.4 0.2 ● 0.0 Function Calls Includes Software Metrics Text Mining � 49

RQ2 - Vulnerable vs Non- 1.0 0.8 ● ● ● 0.6 MCC 0.4 ● 0.2 ● 0.0 Function Calls Includes Software Metrics Text Mining � 50

RQ3 Time - Bugs vs Precision Recall

RQ3 Time - Bugs vs 1.00 ● Function Calls Includes Software Metrics Text Mining 0.75 mcc 0.50 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.25 0.00 5 10 15 20 release

RQ3 Time - Vulnerable vs Non- 1.00 1.00 ● ● ● ● ● ● ● ● 0.75 0.75 ● ● ● ● ● ● ● ● ● ● ● precision ● ● ● ● recall ● 0.50 0.50 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.25 0.25 0.00 0.00 5 10 15 20 5 10 15 20 release release � 53

RQ3 Time - Vulnerable vs 1.00 ● Function Calls Includes Software Metrics Text Mining 0.75 ● ● ● ● ● ● ● ● ● ● ● ● ● mcc ● ● ● ● ● ● ● 0.50 0.25 0.00 5 10 15 20 release

Discussion - Findings � 55

1 VPM’s are working well with historical data � 56

2 Good precision observed even with unbalanced data � 57

3 In the practical case , the best trade off is in favour of include and function calls � 58

4 In the general case , or favouring precision the best one is text mining . � 59

Previous studies Include and Function calls There is no comparison with Metrics or Text Mining There are no results related to time In the context of Linux We found Reported we have similar results… Precision 70% Precision 70% Recall 45% Recall 64% Neuhaus et al. “Predicting vulnerable software components” CCS’07. � 60

Previous studies Software Metrics Reported 10 fold cross validation We found In the context of Linux Precision 3-5, 9, 2-52% Precision 65% Recall 87-90, 91, 66-79% Recall 22% there are significant differences… Reported results based on time We found Precision 3% Precision 42 : 39% Recall 79-85% Recall 16 : 24% Shin et al. “Evaluating Complexity, Code Churn, and Developer Activity Metrics as Indicators of Software Vulnerabilities” TSE ’11. Shinand et al. “Cantraditionalfaultpredictionmodelsbeused for vulnerability prediction?” ESE’ 13. Walden et al. “Predicting Vulnerable Components: Software Metrics vs Text Mining” ISSRE’14. � 61

Previous studies Text Mining Reported We found 10 fold cross validation In the context of Linux Precision 90, 2-57% Precision 76% there are again Recall 77, 74-81% Recall 58% significant differences Reported results based on time We found Precision 86% Precision 74 : 93% Recall 77% Recall 37 : 27% Scandariato et al.“Predicting Vulnerable Software Components via Text Mining” TSE ’14. � 62 Walden et al. “Predicting Vulnerable Components: Software Metrics vs Text Mining” ISSRE’14.

DataSet and Replication package and additional results will be available soon … Please contact Matthieu Jimenez ( Matthieu.Jimenez@uni.lu ) � 63

Thank you for your attention ! � 64

Vulnerability Prediction Models: A case study on the Linux Kernel - PowerPoint PPT Presentation

Vulnerability Prediction Models: A case study on the Linux Kernel Matthieu Jimenez Mike Papadakis Yves Le Traon Jimenez et al. Vulnerability Prediction Models: A Case Study on the Linux Kernel SCAM16 1 Slides: Matthieu

Earthquake Vulnerability Earthquake Vulnerability Vulnerability Assessment & EVR measures

Introduction to Linux Aline Abler Aline Abler Linux, whats that? The pieces of a Linux

Linux Overview Amir Hossein Payberah payberah@gmail.com 1 Agenda Linux Overview Linux

Linux from Sensors to Servers ! When is Linux Not Linux? ! 1 1 Linux runs across a huge range

Vulnerability Management Spring 2020 Jay Chen What is a vulnerability? A vulnerability is a

Linux Kung Fu Introduction What is Linux? Why Linux? What is the difference between a client

Crisis and Crisis and Vulnerability Vulnerability ILO Crisis Response : Trainers Guide

Linux-iSCSI.org BoF Linux-iSCSI.org BoF Current Status and Future of iSCSI on the Current Status

The State of the Linux Desktop An OSDL Perspective John Cherry OSDL Desktop Linux (DTL)

Introduction to Linux Introduction to Linux Phil Mercurio The Scripps Research Institute

Quantitative Security Colorado State University Yashwant K Malaiya CS 559 Vulnerability Life

Case study 2 Case study 2 Case study 2 Case study 2 Former Industrial Site, London: How has

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Vulnerability Assessm ent 2 0 0 4 Luanda, 2 5 June 2 0 0 4 Vulnerability Assessment 2004 - p1

vulnerability in urban area Olivier SANTONI FERDI AREQUIPA - May 2017 Risk assessment

Viden: iden: Attac acker er Ident dentif ifica ication ion on on In- n- Vehic ehicle

Programming by Examples Sumit Gulwani ECML/PKDD Conference Microsoft Sep 2019 Example-based

MOBISYS 2011 MOBISYS 2011 The 9th International Conference on Mobile System, Applications, and

Introd u ction to the NASA fireball data set BU IL D IN G DASH BOAR D S W ITH SH IN YDASH BOAR

JANUS: Fast and Flexible Deep Learning via Symbolic Graph Execution of Imperative Programs Eunji

4nterconnect !assan Wassel 6 7 Mohit !iwari 6 7 9onathan :alamehr ; 7 <u>e !heogara@an ; 7

DECISION-CTO Optimal Medical Therapy With or Without Stenting For Coronary Chronic Total

Healthcare Association of New York State www.hanys.org Federal HIT Issues Update Latest from

Vulnerability Prediction Models: A case study on the Linux Kernel - PowerPoint PPT Presentation

Vulnerability Prediction Models: A case study on the Linux Kernel Matthieu Jimenez Mike Papadakis Yves Le Traon Jimenez et al. Vulnerability Prediction Models: A Case Study on the Linux Kernel SCAM16 1 Slides: Matthieu

Earthquake Vulnerability Earthquake Vulnerability Vulnerability Assessment &amp; EVR measures

Introduction to Linux Aline Abler Aline Abler Linux, whats that? The pieces of a Linux

Linux Overview Amir Hossein Payberah payberah@gmail.com 1 Agenda Linux Overview Linux

Linux from Sensors to Servers ! When is Linux Not Linux? ! 1 1 Linux runs across a huge range

Vulnerability Management Spring 2020 Jay Chen What is a vulnerability? A vulnerability is a

Linux Kung Fu Introduction What is Linux? Why Linux? What is the difference between a client

Crisis and Crisis and Vulnerability Vulnerability ILO Crisis Response : Trainers Guide

Linux-iSCSI.org BoF Linux-iSCSI.org BoF Current Status and Future of iSCSI on the Current Status

The State of the Linux Desktop An OSDL Perspective John Cherry OSDL Desktop Linux (DTL)

Introduction to Linux Introduction to Linux Phil Mercurio The Scripps Research Institute

Quantitative Security Colorado State University Yashwant K Malaiya CS 559 Vulnerability Life

Case study 2 Case study 2 Case study 2 Case study 2 Former Industrial Site, London: How has

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Vulnerability Assessm ent 2 0 0 4 Luanda, 2 5 June 2 0 0 4 Vulnerability Assessment 2004 - p1

vulnerability in urban area Olivier SANTONI FERDI AREQUIPA - May 2017 Risk assessment

Viden: iden: Attac acker er Ident dentif ifica ication ion on on In- n- Vehic ehicle

Programming by Examples Sumit Gulwani ECML/PKDD Conference Microsoft Sep 2019 Example-based

MOBISYS 2011 MOBISYS 2011 The 9th International Conference on Mobile System, Applications, and

Introd u ction to the NASA fireball data set BU IL D IN G DASH BOAR D S W ITH SH IN YDASH BOAR

JANUS: Fast and Flexible Deep Learning via Symbolic Graph Execution of Imperative Programs Eunji

4nterconnect !assan Wassel 6 7 Mohit !iwari 6 7 9onathan :alamehr ; 7 &lt;u&gt;e !heogara@an ; 7

DECISION-CTO Optimal Medical Therapy With or Without Stenting For Coronary Chronic Total

Healthcare Association of New York State www.hanys.org Federal HIT Issues Update Latest from

Earthquake Vulnerability Earthquake Vulnerability Vulnerability Assessment & EVR measures

4nterconnect !assan Wassel 6 7 Mohit !iwari 6 7 9onathan :alamehr ; 7 <u>e !heogara@an ; 7