Evaluating Software Sensors for Actively Profiling Windows 2000 - PowerPoint PPT Presentation

Evaluating Software Sensors for Actively Profiling Windows 2000 Computer Users Mark Shavlik Jude Shavlik Michael Fahland

Motivation and General Approach � Identify ˜ unique characteristics of each user/server’s behavior � Every second, measure 100’s of Windows 2000 properties � in/out network traffic, programs running, keys pressed, kernel usage, etc � Predict Prob( normal | measurements ) � Raise alarm if recent measurements seem unlikely for this user/server

Goal: Choose “Measurement Space” that Widely Separates User from General Population Specific User Probability General Population Possible Measurements

Initial Experiment � Subjects: 10 users at Shavlik Technologies � Unobtrusively collected data for 6 weeks � 7 GBytes archived � Task: Are current measurements from user X? � Initial Focus: Keystroke data � Which key pressed? � Time key down � Time since previous key press

Training, Tuning, and Testing Sets � Very important in machine learning to not use testing data to optimize parameters! � Train Set: first two weeks of data � Build a (statistical) model � Tune Set: middle two weeks of data � Choose good parameter settings � Test Set: last two weeks of data � Evaluate “frozen” model

Our Intrusion-Detection Template Last W ( window width) keystrokes ... time If prob(current keystroke) < T then raise “mini” alarm If # “mini” alarms in window > F then predict intrusion Use tuning set to choose good values for T and F

Alarm #1 - Probability We Estimate Prob( current keystroke = K3 and previous keystroke = K2 and two-ago keystroke = K1 and time between K2 and K3 = Interval23 and time between K1 and K2 = Interval12 and time K3 was down = Downtime3 )

Visualizing Alarm #1 Interval12 Interval23 K1 K2 K3 alpha very short digit punct ... very long During training count how often each path taken (per user)

Testset Results – Alarm #1 “Intrusion” Detection Rates (with < 1 false alarm per day per user) 100% Detection Rate 80% on Testset 60% Absolute Prob 40% 20% 0% 10 20 40 80 160 320 640 Window Width (W)

Using Relative Probabilities Alarm #2: Prob( keystrokes | machine owner ) Prob( keystrokes | population ) 100% Detection Rate 80% on Testset Relative Prob 60% 40% Absolute Prob 20% 0% 10 20 40 80 160 320 640 Window Width (W)

Using Two Best Alarm Types (Chosen on Tuning Set) We are also investigating other keystroke-related alarms (eg, length of words, sentences, etc) 100% Detection Rate 80% Best 2 Alarms on Testset 60% Relative Prob 40% Absolute Prob 20% 0% 10 20 40 80 160 320 640 Window Width (W)

Cascading Window Sizes � Alarm in Window Size = W also if alarm in any smaller window � (To Do: Re-choose thresholds for this scenario) W / 8 W / 4 W / 2 W

Cascading Window Sizes - Results Can detect intrusions before window W completely full 100% Cascaded Alarm #2 Detection Rate 80% Uncascaded on Testset Alarm #2 60% Cascaded 40% False Alarms 20% Uncascaded False Alarms 0% One False Alarm 10 20 40 80 160 320 640 per Day Window Width (W)

Tradeoff between False Alarms and Detected Intrusions (ROC Curve) 100% Detection Rate 80% on Testset W=80 60% W=160 40% 20% one / day 0% 0.0% 0.5% 1.0% 1.5% 2.0% 2.5% False-Alarm Rate on Testset Note: left-most values result from ZERO tune-set false alarms

Current Work � Extend to non-keystroke data � Condition probabilities on other measurements � Prob( keystrokes | MS Office running ), Prob( keystrokes | browser running ), … � Combine additional alarms � Approx full joint probability distribution (Bayes nets) on user’s measurements most divergent from general population � Train standard machine learners to distinguish user X from general population

Some Related Work � Machine learning for intrusion detection � Gosh et al. (1999) � Lane & Brodley (1998) � Lee et al. (1999) � Warrender et al. (1999) � Typically Unix-based; system calls &TCP analyzed � Analysis of keystroke dynamics � Monrose & Rubin (1997) � For authenticating passwords

Conclusion � Can accurately characterize individual user behavior using simple models � Separate data into train , tune , and test sets � “Let the data decide” good parameter settings, on per-user basis � Normalize prob’s by general-population prob’s � Separate rare for this user/server from rare for everyone

Evaluating Software Sensors for Actively Profiling Windows 2000 - PowerPoint PPT Presentation

Evaluating Software Sensors for Actively Profiling Windows 2000 Computer Users Mark Shavlik Jude Shavlik Michael Fahland Motivation and General Approach Identify unique characteristics of each user/servers behavior Every second,

Our Product Range Color Sensors Luminescence Sensors Contrast Sensors Opacity

Real-Time Monitoring Sensors Rain Gauge Sensors: Water Level Sensors: Other Sensors:

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics

Profiling of Data-Parallel Processors Daniel Kruck 09/02/2014 09/02/2014 Profiling Daniel

Leaving no one behind The role of evidence-building and profiling to include displacement in

Expression Profiling Mark Voorhies 4/4/2011 Mark Voorhies Expression Profiling Review

Web User Profiling using Data Redundancy http://aminer.org/profiling Xiaotao Gu, Hong Yang, Jie

COZ : Finding Code that Counts with Causal Profiling Anuja Golechha Agenda Profiling

Optimization Profiling VisualVM Exercise Meme Credit: Randall Munroe, hrefhttp://xkcd.comxkcd

Profiling of Algorithms Profiling refers to the experimental measurement of the performance of

An introduction to Profiling Physics Coding Club: 09/06/2017 D. Dickinson

From atom to bits Ermanno Pietrosemoli 1 Sensors Sensors are the bridge between the physical

From Performance Profiling to Predictive Analytics while evaluating Hadoop performance using

Provider Profiling Prepared by Melissa Reagan, MSW, LSW, Quality Performance Specialist Agenda

TulStat 911 PSC The City Experience Inside City Hall April 17, 2017 911 PSC Mission

Overview Last Week: How to program UNIX processes (Chapters 7-9) fork() and exec() Unix

Artificial Intelligence Probabilistic Reasoning CS 444 Spring 2019 Dr. Kevin Molloy

Bayesian Networks Part 1 CS 760@UW-Madison Goals for the lecture you should understand the

Probabilistic Models CS 4100: Artificial Intelligence Bayes Nets Models describe how (a

Baysian Networks Marco Chiarandini Department of Mathematics & Computer Science University

1 Building the (Entire) Joint Example: Alarm Network We can take a Bayes net and build any

Embedded Systems Programming Signaling (Module 24) Yann-Hang Lee Arizona State University

Evaluating Software Sensors for Actively Profiling Windows 2000 - PowerPoint PPT Presentation

Evaluating Software Sensors for Actively Profiling Windows 2000 Computer Users Mark Shavlik Jude Shavlik Michael Fahland Motivation and General Approach Identify unique characteristics of each user/servers behavior Every second,

Our Product Range Color Sensors Luminescence Sensors Contrast Sensors Opacity

Real-Time Monitoring Sensors Rain Gauge Sensors: Water Level Sensors: Other Sensors:

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Robotics Sensors for

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Robotics Sensors for

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Sensors for Robotics

Profiling of Data-Parallel Processors Daniel Kruck 09/02/2014 09/02/2014 Profiling Daniel

Leaving no one behind The role of evidence-building and profiling to include displacement in

Expression Profiling Mark Voorhies 4/4/2011 Mark Voorhies Expression Profiling Review

Web User Profiling using Data Redundancy http://aminer.org/profiling Xiaotao Gu, Hong Yang, Jie

COZ : Finding Code that Counts with Causal Profiling Anuja Golechha Agenda Profiling

Optimization Profiling VisualVM Exercise Meme Credit: Randall Munroe, hrefhttp://xkcd.comxkcd

Profiling of Algorithms Profiling refers to the experimental measurement of the performance of

An introduction to Profiling Physics Coding Club: 09/06/2017 D. Dickinson

From atom to bits Ermanno Pietrosemoli 1 Sensors Sensors are the bridge between the physical

From Performance Profiling to Predictive Analytics while evaluating Hadoop performance using

Provider Profiling Prepared by Melissa Reagan, MSW, LSW, Quality Performance Specialist Agenda

TulStat 911 PSC The City Experience Inside City Hall April 17, 2017 911 PSC Mission

Overview Last Week: How to program UNIX processes (Chapters 7-9) fork() and exec() Unix

Artificial Intelligence Probabilistic Reasoning CS 444 Spring 2019 Dr. Kevin Molloy

Bayesian Networks Part 1 CS 760@UW-Madison Goals for the lecture you should understand the

Probabilistic Models CS 4100: Artificial Intelligence Bayes Nets Models describe how (a

Baysian Networks Marco Chiarandini Department of Mathematics &amp; Computer Science University

1 Building the (Entire) Joint Example: Alarm Network We can take a Bayes net and build any

Embedded Systems Programming Signaling (Module 24) Yann-Hang Lee Arizona State University

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics

Baysian Networks Marco Chiarandini Department of Mathematics & Computer Science University