Time Intervals as a Behavioral Biometric John (Vinnie) Monaco - - PowerPoint PPT Presentation

time intervals as a behavioral biometric
SMART_READER_LITE
LIVE PREVIEW

Time Intervals as a Behavioral Biometric John (Vinnie) Monaco - - PowerPoint PPT Presentation

Introduction Data Modeling Conclusions Time Intervals as a Behavioral Biometric John (Vinnie) Monaco Seidenberg School of CSIS, Pace University November 11, 2015 http://vmonaco.com/dissertation John (Vinnie) Monaco Time Intervals as a


slide-1
SLIDE 1

Introduction Data Modeling Conclusions

Time Intervals as a Behavioral Biometric

John (Vinnie) Monaco

Seidenberg School of CSIS, Pace University

November 11, 2015

http://vmonaco.com/dissertation

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-2
SLIDE 2

Introduction Data Modeling Conclusions

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-3
SLIDE 3

Introduction Data Modeling Conclusions Motivation Background

“You are what when you eat”

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-4
SLIDE 4

Introduction Data Modeling Conclusions Motivation Background

Newell’s time scale.

Newell’s time scale of human action

Scale (sec) Time Units System World (theory) 107 Months SOCIAL BAND 106 Weeks 105 Days 104 Hours Task RATIONAL BAND 103 10 min Task 102 Minutes Task 101 10 sec Unit task COGNITIVE BAND 100 1 sec Operations 10-1 100 ms Deliberate act 10-2 10 ms Neural circuit BIOLOGICAL BAND 10-3 1 ms Neuron 10-4 100 µs Organelle

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-5
SLIDE 5

Introduction Data Modeling Conclusions Motivation Background

Behavioral biometrics.

The measure of human behavior for the purpose of identification or verification.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-6
SLIDE 6

Introduction Data Modeling Conclusions Motivation Background

Timestamped events and time intervals.

Timestamped events: keystrokes, touchscreen gestures, financial transactions, source code contributions... Given a series of events that occur at times t0,t1,...,tN Time interval between events τn = tn −tn−1

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-7
SLIDE 7

Introduction Data Modeling Conclusions Motivation Background

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-8
SLIDE 8

Introduction Data Modeling Conclusions Motivation Background

Why focus on timestamps?

Timestamps are truly ubiquitous Timestamps are persistent Timestamps are resilient to encryption and masking Timestamps can generally be collected without cooperation Timestamps can be incorporated into domain-specific models

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-9
SLIDE 9

Introduction Data Modeling Conclusions Motivation Background

Problems.

Identification Given a sequence of events, decide who they belong to (1 out of N) Verification Given a sequence of events with claimed responsibility, decide whether the claim is legitimate (binary classification) Prediction Given a sequence of events, predict the time of a future event

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-10
SLIDE 10

Introduction Data Modeling Conclusions Motivation Background

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-11
SLIDE 11

Introduction Data Modeling Conclusions Motivation Background

Bursts of activity in human behavior.

Random process (Poisson process, exponential inter-event times) Bursty process (power-law inter-event times) Barabasi, 2005

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-12
SLIDE 12

Introduction Data Modeling Conclusions Motivation Background

Time intervals of a random vs. bursty process.

Random process Bursty process

Barabasi, 2005 John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-13
SLIDE 13

Introduction Data Modeling Conclusions Motivation Background

Psychology of human timing.

Implicit and explicit timing

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-14
SLIDE 14

Introduction Data Modeling Conclusions Motivation Background

Neurophysiology of human timing.

Praamstra, 2006 Wiener, 2011

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-15
SLIDE 15

Introduction Data Modeling Conclusions Description Empirical patterns

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-16
SLIDE 16

Introduction Data Modeling Conclusions Description Empirical patterns

Datasets.

Dataset Source Size Freq.(Hz) Keystroke fixed-text Monaco et al. (2013) 24k keystrokes, 60 users 4.4 Keystroke free-text Villani et al. (2006) 251k keystrokes, 56 users 3.8 Mobile Jain et al. (2014) 11k gestures, 52 users 3.1 Keypad Bakelman et al. (2013) 6.6k keystrokes, 30 users 2.9 Bitcoin transactions Reid et al. (2013) 239k transactions, 61 users 2.8×10−4 Linux kernel commits Passos et al. (2014) 16k commits, 52 authors 2.6×10−6 White House visits Hudson (2015) 2.7k visits, 18 people 1.4×10−6 Terrorist events LaFree et al. (2007) 1.8k events, 10 groups 2.8×10−7 John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-17
SLIDE 17

Introduction Data Modeling Conclusions Description Empirical patterns

Keystroke.

Non-overlapping and overlapping keystrokes

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-18
SLIDE 18

Introduction Data Modeling Conclusions Description Empirical patterns

Bitcoin transaction.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-19
SLIDE 19

Introduction Data Modeling Conclusions Description Empirical patterns

Terrorist activity.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-20
SLIDE 20

Introduction Data Modeling Conclusions Description Empirical patterns

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-21
SLIDE 21

Introduction Data Modeling Conclusions Description Empirical patterns

Heavy tails.

101 102 103 104

τ

10−2 10−1 100

P(τ)

Keystroke (fixed) 101 102 103 104 105

τ

10−2 10−1 100 Keystroke (free) 101 102 103 104 105

τ

10−2 10−1 100 Bitcoin 100 101 102

τ

10−2 10−1 100

P(τ)

Kernel commits 101 102 103 104 105 106

τ

10−2 10−1 100 White House visits 100 101 102 103 104

τ

10−2 10−1 100 Terrorist activity John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-22
SLIDE 22

Introduction Data Modeling Conclusions Description Empirical patterns

Preference for a log-normal.

Power law vs log-normal loglikelihood ratio tests

Dataset Power law Log-normal Keystroke (free) 0.00 (0.00) 1.00 (1.00) Keystroke (fixed) 0.00 (0.00) 1.00 (1.00) Bitcoin 0.00 (0.00) 1.00 (1.00) Kernel commits 0.75 (0.56) 0.25 (0.08) White House visits 0.00 (0.00) 1.00 (1.00) Terrorist activity 0.70 (0.20) 0.30 (0.00)

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-23
SLIDE 23

Introduction Data Modeling Conclusions Description Empirical patterns

Time dependence.

0.0 0.2 0.4 0.6 0.8 1.0

Time of word

0.0 0.5 1.0 1.5 2.0

Density

Keystroke (fixed) 0.0 0.2 0.4 0.6 0.8 1.0

Time of word

0.0 0.5 1.0 1.5 2.0

Density

Keystroke (free) 00:00 06:00 12:00 18:00 24:00

Time of day

0.0 0.5 1.0 1.5 2.0

Density

Bitcoin M T W Th F Sa Su

Day of week

0.00 0.05 0.10 0.15 0.20 0.25 0.30

Density

Kernel commits M T W Th F Sa Su

Day of week

0.00 0.05 0.10 0.15 0.20 0.25 0.30

Density

White House visits M T W Th F Sa Su

Day of week

0.00 0.05 0.10 0.15 0.20 0.25 0.30

Density

Terrorist activity John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-24
SLIDE 24

Introduction Data Modeling Conclusions Description Empirical patterns

Non-stationarity.

1 2 3 4

Train sample

1 2 3 4

Predict sample

Keystroke (fixed) 1 2 3 4 5 6

Train sample

1 2 3 4 5 6

Predict sample

Keystroke (free) 1 2 3 4 5 6

Train sample

1 2 3 4 5 6

Predict sample

Bitcoin 1 2 3 4

Train sample

1 2 3 4

Predict sample

Kernel commits 1 2 3 4

Train sample

1 2 3 4

Predict sample

White House visits 1 2 3 4

Train sample

1 2 3 4

Predict sample

Terrorist activity John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-25
SLIDE 25

Introduction Data Modeling Conclusions Description Empirical patterns

Temporal clustering.

100 101 102 103 104

T (millisecond)

10−1 100 101 102

AF(T)

Keystroke (fixed) 102 103 104 105 106

T (millisecond)

10−1 100 101 102 Keystroke (free) 103 104 105 106 107

T (second)

10−1 100 101 102 Bitcoin 10−5 10−4 10−3 10−2 10−1

T (hour)

10−1 100 101 102

AF(T)

Kernel commits 10−3 10−2 10−1 100 101

T (day)

10−1 100 101 102 White House visits 10−5 10−4 10−3 10−2 10−1

T (day)

10−1 100 101 102 Terrorist activity John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-26
SLIDE 26

Introduction Data Modeling Conclusions Model specification Experimental results

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-27
SLIDE 27

Introduction Data Modeling Conclusions Model specification Experimental results

Modeling approaches.

Windowed observations and event intensity Time interval observations and event count

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-28
SLIDE 28

Introduction Data Modeling Conclusions Model specification Experimental results

Time interval distribution.

Log-normal f (τ;µ,σ) = 1 τσ √ 2π exp −(lnτ − µ)2 2σ2

  • τ > 0

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-29
SLIDE 29

Introduction Data Modeling Conclusions Model specification Experimental results

Transitioning between hidden states.

zt=0 zt=1

Active Passive

1−a0 1−a1 a1 a0

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-30
SLIDE 30

Introduction Data Modeling Conclusions Model specification Experimental results

Hidden Markov model.

z1 z2 zT−1 zT

1 2 T−1 T

Hidden state Observations

x1 x2 xT −1 xT

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-31
SLIDE 31

Introduction Data Modeling Conclusions Model specification Experimental results

Partially-Observable Hidden Markov Model.

z1 z2 zT−1 zT ω1 ω2 ωT −1 ωT

Partially-observable state Hidden state Observations

1 2 T−1 T

x1 x2 xT −1 xT

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-32
SLIDE 32

Introduction Data Modeling Conclusions Model specification Experimental results

POHMM as an extension to the HMM.

Introduces a dependency into the HMM to account for event types (e.g., key names). Can handle missing or incomplete observations by using the marginal distributions. Avoids overfitting through parameter mixing (or smoothing).

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-33
SLIDE 33

Introduction Data Modeling Conclusions Model specification Experimental results

Consistency.

To be consistent the model must be: Convergent

Will our estimator always converge to a value?

Asymptotically unbiased

Given a sample generated from a model with known parameters, can we recover the model parameters as the size

  • f the sample increases?

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-34
SLIDE 34

Introduction Data Modeling Conclusions Model specification Experimental results

Residuals.

µ1|. µ2|. µ1|a µ2|a µ1|b µ2|b µ1|c µ2|c

Parameter

−1.0 −0.5 0.0 0.5 1.0

Scaled residual

N 10 100 1000 10000 John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-35
SLIDE 35

Introduction Data Modeling Conclusions Model specification Experimental results

Outline

1

Introduction Motivation Background

2

Data Description Empirical patterns

3

Modeling Model specification Experimental results

4

Conclusions

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-36
SLIDE 36

Introduction Data Modeling Conclusions Model specification Experimental results

Evaluation criteria.

Identification: rank-1 classification accuracy (ACC). Verification: equal error rate (EER), the point on the ROC curve where P(false accept) = P(false reject). Continuous verification: average maximum rejection time (AMRT), the average number of events before an impostor is detected without falsely rejecting the genuine user.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-37
SLIDE 37

Introduction Data Modeling Conclusions Model specification Experimental results

Evaluation procedure.

Folds Reference Query 1 2 3 k A B A C A B A C A B A C A B A C

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-38
SLIDE 38

Introduction Data Modeling Conclusions Model specification Experimental results

Fitted model example.

250 500 750 1000

τ

0.000 0.001 0.002 0.003 0.004 0.005 0.006 0.007

Density

Active state Passive state 250 500 750 1000

τ

0.000 0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008

Density

Active state Passive state

Fixed-text Free-text

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-39
SLIDE 39

Introduction Data Modeling Conclusions Model specification Experimental results

Keystroke experimental results.

Folds Dichotomy POHMM p-value Nursery rhymes 4 0.11 (0.04) 0.00 (0.01) 0.003 Keystroke (fixed) 4 0.13 (0.02) 0.08 (0.04) 0.041 Keystroke (free) 6 0.02 (0.01) 0.06 (0.01) 8.9×10−5 Keypad 20 0.11 (0.03) 0.05 (0.02) 1.3×10−8 Mobile (w/o sensors) 20 0.20 (0.03) 0.10 (0.02) 2.7×10−14 Mobile (w/ sensors) 20 0.01 (0.01) 0.01 (0.01) 0.500

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-40
SLIDE 40

Introduction Data Modeling Conclusions Model specification Experimental results

Continuous verification.

20 40 60 80 Event 100 200 300 400 500 600 700 800 900 Penalty Impostor Genuine Threshold 100 200 300 400 500 600 700 800 Event 100 200 300 400 500 600 700 800 Penalty Impostor Genuine Threshold

Fixed-text Free-text

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-41
SLIDE 41

Introduction Data Modeling Conclusions Model specification Experimental results

Bitcoin experimental results.

Hidden states are partially observable through the transaction direction (incoming or outgoing). 0.42 ACC 0.14 EER 139 AMRT

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-42
SLIDE 42

Introduction Data Modeling Conclusions Model specification Experimental results

Linux kernel commit experimental results.

Hidden states are partially observable through the commit intention (bug fix or feature addition). 0.17 ACC 0.36 EER 41 AMRT

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-43
SLIDE 43

Introduction Data Modeling Conclusions Model specification Experimental results

White House visit experimental results.

Hidden states are partially observable through the size of the group (small or large). 0.31 ACC 0.28 EER 19 AMRT

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-44
SLIDE 44

Introduction Data Modeling Conclusions Model specification Experimental results

Terrorist activity experimental results.

Hidden states are partially observable through the group intention. 0.15 ACC 0.45 EER 37 AMRT

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-45
SLIDE 45

Introduction Data Modeling Conclusions Model specification Experimental results

What about anonymity?

Timestamps can reveal your identity. Encryption, VPN, TOR, etc., cannot prevent that.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-46
SLIDE 46

Introduction Data Modeling Conclusions Model specification Experimental results

Masking temporal behavior.

Alice and Bob want to be anonymous.

MIX Bob Alice MIX Eve

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-47
SLIDE 47

Introduction Data Modeling Conclusions Model specification Experimental results

Masking strategy properties.

Finite The expected delay between the user and the arrival process should not grow unbounded. Anonymous The mix should make it difficult to identify the user. Unpredictable The mix should make it difficult to predict future behavior.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-48
SLIDE 48

Introduction Data Modeling Conclusions Model specification Experimental results

Proposed mixing strategies experimental results.

Masking capability increases as the tolerable lag increases.

100 200 300 400 500 600

¯ δ

0.0 0.2 0.4 0.6 0.8 1.0

ACC

Keystroke (fixed) 0 100 200 300 400 500 600 700 800

¯ δ

0.0 0.2 0.4 0.6 0.8 1.0

ACC

Keystroke (free) 0.0 0.5 1.0 1.5 2.0 2.5 3.0

¯ δ

×104 0.0 0.2 0.4 0.6 0.8 1.0

ACC

Bitcoin 1 2 3 4 5 6 7 8 9

¯ δ

0.0 0.2 0.4 0.6 0.8 1.0

ACC

Kernel commits 0.0 0.5 1.0 1.5 2.0 2.5

¯ δ

×104 0.0 0.2 0.4 0.6 0.8 1.0

ACC

White House visits 10 20 30 40 50 60 70

¯ δ

0.0 0.2 0.4 0.6 0.8 1.0

ACC

Terrorist activity Delay mix Interval mix John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-49
SLIDE 49

Introduction Data Modeling Conclusions

Conclusions.

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric

slide-50
SLIDE 50

Introduction Data Modeling Conclusions

Questions.

Thank you

John (Vinnie) Monaco Time Intervals as a Behavioral Biometric