TOPTRAC: Topical Trajectory Pattern Mining Source: KDD 2015 - - PowerPoint PPT Presentation

toptrac topical
SMART_READER_LITE
LIVE PREVIEW

TOPTRAC: Topical Trajectory Pattern Mining Source: KDD 2015 - - PowerPoint PPT Presentation

TOPTRAC: Topical Trajectory Pattern Mining Source: KDD 2015 Advisor: Jia-Ling Koh Speaker: Hsiu-Yi,Chu Date: 2018/1/21 Outline Introduction Method Experience conclusion Introduction Introduction Goal Topical trajectory


slide-1
SLIDE 1

TOPTRAC: Topical Trajectory Pattern Mining

Source: KDD 2015 Advisor: Jia-Ling Koh Speaker: Hsiu-Yi,Chu Date: 2018/1/21

slide-2
SLIDE 2

Outline

Introduction Method Experience conclusion

slide-3
SLIDE 3

Introduction

slide-4
SLIDE 4

Introduction

Goal

Topical trajectory mining problem: Given a collection of geo-tagged message trajectories, it’s to find topical transition pattern and the top-k transition snippets which best represent each transition pattern

slide-5
SLIDE 5

Introduction

Transition pattern:

“Statue of Liberty” ”Time Square”

Transition snippet:

(m1,1, m1,2)in s1 (m4,1, m4,2)in s2

slide-6
SLIDE 6

Introduction

Definition

Trajectory(st)

geo-tagged message (mt,i)

Geo-tag Gt,i : 2-dim vector(Gt,i,x,Gt,i,y) Bag-of-word wt,i : N words{wt,i,1,…, wt,i,n}

slide-7
SLIDE 7

Introduction

Definition

Latent semantic region: a geographical location where messages are posted with the same topic preference Topical transition pattern: a movement from one semantic region to another frequently

slide-8
SLIDE 8

Outline

Introduction Method Experience conclusion

slide-9
SLIDE 9

Method

Generative Model

Assume there are M latent semantic regions K hidden topics in the collection of geo-tagged messages

slide-10
SLIDE 10

Method

Variables

slide-11
SLIDE 11

Method

Generative process

slide-12
SLIDE 12

Method

Select Geo-tag Gt,i according to a 2- dimensional Gaussian probability function:

slide-13
SLIDE 13

Method

Likelihood

slide-14
SLIDE 14

Method

Variational EM Algorithm

Maximum likelihood estimation

slide-15
SLIDE 15

Method

Finding the Most Likely Sequence

Notations:

slide-16
SLIDE 16

Method

Compute :

Compute

:

 case1: St,i-1 = 0 ; case2 : St,i-1 = 1 

slide-17
SLIDE 17

Method

Finding Frequent Transition Patterns

st’ = {(st,1, rt,1, zt,1),…,(st,n, rt,n, zt,n)}

Transition Patterns = {( r1, z1)(r2, z2)} Start with (1, r1, z1) and ends with (1, r2, z2)

τ : minimum support

slide-18
SLIDE 18

Method

Example

s1’={(0,1,1)(1,1,2)(1,2,1)}, s2’={(1,1,2)(0,2,1)(1,2,1)} with τ = 2 → {(1,2)(2,1)} is a transition pattern

Top-k transition snippets  k largest probabilities of

slide-19
SLIDE 19

Outline

Introduction Method Experience conclusion

slide-20
SLIDE 20

Experience

Data sets

NYC

9070 trajectories, 266808 geo-tagged messages M = 30, K = 30, τ = 100

SANF

809 trajectories,19664 geo-tagged messages M = 20, K = 20, τ = 10

slide-21
SLIDE 21

Experience

Baseline

LGTA

Run the inference algorithm and find frequent trajectory patterns similar in page15,16

NAÏVE

First groups messages using EM clustering Cluster the messages in each group with LDA

slide-22
SLIDE 22

Experience

slide-23
SLIDE 23

Experience

slide-24
SLIDE 24

Experience

slide-25
SLIDE 25

Experience

slide-26
SLIDE 26

Experience

slide-27
SLIDE 27

Outline

Introduction Method Experience conclusion

slide-28
SLIDE 28

Conclusion

Propose a trajectory pattern mining algorithm, called TOPTRAC, using probabilistic model to capture the spatial and topical patterns of users. Developed an efficient inference algorithm for

  • ur model and also devised algorithms to find

frequent transition patterns as well as the best representative snippets of each pattern.