P-E-R-S-I-S-T-E-N-C-E and DISTINCTIVENESS of
In Inter er-ev event Time Distributions
in Online Human Behavior
Jiwan Jeong and Sue Moon
School of Computing, KAIST In TempWeb ’17 (WWW ’17 Companion) April 3, 2017
In Inter er-ev event Time Distributions in Online Human Behavior - - PowerPoint PPT Presentation
P-E-R-S-I-S-T-E-N-C-E and D I S T I N C T I V E N E S S of In Inter er-ev event Time Distributions in Online Human Behavior Jiwan Jeong and Sue Moon School of Computing, KAIST In TempWeb 17 (WWW 17 Companion) April 3, 2017 What is
P-E-R-S-I-S-T-E-N-C-E and DISTINCTIVENESS of
In Inter er-ev event Time Distributions
in Online Human Behavior
Jiwan Jeong and Sue Moon
School of Computing, KAIST In TempWeb ’17 (WWW ’17 Companion) April 3, 2017
What is inter-event time?
Our definition of inter-event time
Previous studies focused on
We focus on individual-level
Individuals have in
inter erval al patter erns
that are pe
persistent over time,
but distinct
ctive from others.
6Tweets by El
Ellen n DeGene neres
Twitter timeline ✂
✂ ✂
7Tweets by Ji
Jimmy y Fallon
8Tweets by Su
Sue Mo Moon
9Tweets by Al
Albe bert-Lá László Ba Barabási si
10Tweets by Ey
Eytan Ada Adar
11Tweets by Aa
Aaron n Cl Clause set
12Tweets by Ni
Nicolas C Christakis
13Tweets by Al
Alex x Ve Vespagini
14Tweets by Andr
Andrew w Ng
15Tweets by Ed
Ed Chi
16Tweets by Bru
Bruno Go Gonçalv alves
17Tweets by Hae
Haewoon Kw Kwak
18Tweets by Ca
Carl rlos s Ca Castillo
19Tweets by Pe
Peter Do Dodds
20In this work
Datasets for this study
Estimate interval patterns Compare interval patterns Design computation framework
23Estimate interval patterns Compare interval patterns Design computation framework
24Convert di
discrete e in inter ervals als to co continuous PDF
Gaussian kernel density estimation
For multi-modal distributions, we use Sheather and Jones’ bandwidth [Sheater J R Stat Soc B 1991]
26Now, we can estimate interval patterns!
Estimate interval patterns Compare interval patterns Design computation framework
28Calculate di
distanc nce between interval patterns
Jensen-Shannon distance
Now, we can compare interval patterns!
Estimate interval patterns Compare interval patterns Design computation framework
32Define se
self-di distanc nce and re refere rence di distanc nce
dself dref
33Experimental settings for longitudinal analysis
W1 W2 … W9 W10
34P-E-R-S-I-S-T-E-N-C-E & DISTINCTIVENESS
35Persistence and distinctiveness are relative
𝑒1234 vs 𝑒624
37How long do interval patterns persist?
Wi Wj
38Persistence over time
Binned into 6 groups
39Persistence over time
40Persistence over time
41Do interval patterns persist after long inactivity?
Wi Wj
42Persistence after inactivity
43Persistence after inactivity
44Do interval patterns persist through changing daily routine?
Wi Wj 24 24 12 12
Circadian distance
45Persistence through changing daily routine
46In summary,
User Identification Using Interval Signatures
APPLICATION
48User identification: Problem definition
WA WB
49A very simple identifier
WA WB
Calculate the distance d
If d < threshold, Else,
50Identification performance (1 − 𝐹𝑟𝑣𝑏𝑚 𝐹𝑠𝑠𝑝𝑠 𝑆𝑏𝑢𝑓)
Wikipedia me2day Twitter Enron Consecutive 80% 87% 83% 76% > 1 year gap 71% 78% 76% 71%
51Follow-up questions
P-E-R-S-I-S-T-E-N-C-E and DISTINCTIVENESS of
In Inter er-ev event Time Distributions
in Online Human Behavior
Dataset statistics
# of users Wikipedia me2day Twitter Enron With >25 actions 521K 587K 921K 937K With >100 actions 165K 203K 768K 542K With >500 actions 47K 43K 334K 65K
54𝑒1234 vs 𝑒624 at different window sizes
55K-means clustering of interval patterns
56Joint probability matrix for transition 𝑋
D → 𝑋 DF+
57