Trails and Networks: Loom; Going from Trails to Networks and - - PDF document

trails and networks loom going from trails to networks
SMART_READER_LITE
LIVE PREVIEW

Trails and Networks: Loom; Going from Trails to Networks and - - PDF document

<Your Name> Trails and Networks: Loom; Going from Trails to Networks and Networks to Trails Mihovil Bartulovic mbartulovic@cmu.edu Dr. Kathleen M. Carley kathleen.carley@cs.cmu.edu Center for Computational Analysis of Social and


slide-1
SLIDE 1

<Your Name> 1

Center for Computational Analysis of Social and Organizational Systems http://www.casos.cs.cmu.edu/

Trails and Networks: Loom; Going from Trails to Networks and Networks to Trails

Mihovil Bartulovic mbartulovic@cmu.edu

  • Dr. Kathleen M. Carley

kathleen.carley@cs.cmu.edu

June 2019 2 CASOS Summer Institute 2019

Overview

  • What is a trail?
  • How do we get trail data?

– Characterize trail as network data

  • Trails and Loom

– Visualization – Networks from trails – Finding similar trails

slide-2
SLIDE 2

<Your Name> 2

June 2019 3 CASOS Summer Institute 2019

What is a Trail?

  • A trail is a trace of the movement of something
  • ver time
  • For example, the movement of an attachment

through a series of email communications creates a trail

  • What are some other examples of trails?

– People moving from place to place – geospatial trails – Twitter hashtags – …

June 2019 4 CASOS Summer Institute 2019

Geospatial Trails

  • Usually geospatial trails represent agents

travelling in continuous space and time.

  • Network data: discrete node and discrete time.

Continuous space Discrete location node Aggregate vs Continuous time Discrete time Slice

slide-3
SLIDE 3

<Your Name> 3

June 2019 5 CASOS Summer Institute 2019

Geospatial Trails

Time Location 2017, June 7, 9 am Green St. 2017, June 7, 10 am Design District 2017, June 7, 11 am Chinatown Gate 2017, June 7, 12 am 16 th st. ……. ……. Aggregate Slice

June 2019 6 CASOS Summer Institute 2019

Trails visualization

  • ORA Over-time visualizer

– Benefit: Can see changes in network structure over time – Drawback: For sparse trail data, not very effective

  • ORA GIS Visualizer

– Benefit: Can see the spatial distribution of trails – Drawback: Lose the temporal information

  • Loom

– Benefit: Can see the temporal distribution and the places travelled to – Drawback: Spatial distances, where they exist, are not preserved

slide-4
SLIDE 4

<Your Name> 4

June 2019 7 CASOS Summer Institute 2019

What we’ll do

  • Import a “DynamicMetaNetwork” with spatial

information

  • Visualization

– Understand the benefits and drawbacks of different visualizations of trail data

  • ORA Over-time visualizer
  • ORA GIS visualizer
  • Loom
  • Finding Similar trails

– Use Loom to cluster trails

  • Obtain networks from trails

June 2019 8 CASOS Summer Institute 2019

Import a dynamic meta-network

  • Same as importing a regular meta-network

– Drag-and-drop – File->Open Meta Network

  • Import TrailsDataset.xml
slide-5
SLIDE 5

<Your Name> 5

June 2019 9 CASOS Summer Institute 2019

Importing

June 2019 10 CASOS Summer Institute 2019

The Data

  • Our trail:

– Locations are our nodes – Agents are what is moving between them

  • Lets explore the data

– In ORA – Networks over time visualizer – Geospatial visualizer

slide-6
SLIDE 6

<Your Name> 6

June 2019 11 CASOS Summer Institute 2019

ORA Main Window

June 2019 12 CASOS Summer Institute 2019

Networks Over Time Visualizer

slide-7
SLIDE 7

<Your Name> 7

June 2019 13 CASOS Summer Institute 2019

Networks Over Time Visualizer

June 2019 14 CASOS Summer Institute 2019

Geospatial Visualizer

slide-8
SLIDE 8

<Your Name> 8

June 2019 15 CASOS Summer Institute 2019

Geospatial Visualizer

June 2019 16 CASOS Summer Institute 2019

Geospatial Visualizer

slide-9
SLIDE 9

<Your Name> 9

June 2019 17 CASOS Summer Institute 2019

Geospatial Visualizer

June 2019 18 CASOS Summer Institute 2019

Geospatial Visualizer

slide-10
SLIDE 10

<Your Name> 10

June 2019 19 CASOS Summer Institute 2019

Loom

June 2019 20 CASOS Summer Institute 2019

Loom

slide-11
SLIDE 11

<Your Name> 11

June 2019 21 CASOS Summer Institute 2019

Loom

June 2019 22 CASOS Summer Institute 2019

Trails and Loom

  • Visualization over time is hard

– State of the art revolves around animation – Loom allows us to visualize trails over time in a static, understandable environment

  • Trails may have similar patterns, but these are

difficult to observe

– Loom allows us to cluster similar trails together

  • We can get networks from trails, for example,

who is connected by the given attachment?

– Loom allows us to easily export such networks to ORA

slide-12
SLIDE 12

<Your Name> 12

June 2019 23 CASOS Summer Institute 2019

What we’ll do

  • Import a “DynamicMetaNetwork” with spatial

information

  • Visualization

– Understand the benefits and drawbacks of different visualizations of trail data

  • ORA Over-time visualizer
  • ORA GIS visualizer
  • Loom
  • Finding Similar trails

– Use Loom to cluster trails

  • Obtain networks from trails

June 2019 24 CASOS Summer Institute 2019

Why cluster?

  • Why are we interested in

trails and trail clustering?

– Gain information by analyzing agents across space and time together. – Interested in grouping agents that display same behavior across time. E.g. visit the same locations across time.

slide-13
SLIDE 13

<Your Name> 13

June 2019 25 CASOS Summer Institute 2019

Feature vector representation using PFSA

βαααβαββααββααββα…..

α β αα αβ βα ββ

ααα ααβ αβα αββ βαα βαβ ββα βββ

Depth = 1 Depth = 2 Depth = 3 State Probability Vector State Transition matrix

June 2019 26 CASOS Summer Institute 2019

Clustering of Trails using PFSA

  • Each trail is now represented by a numerical feature vector,

the state probability vector of the derived PFSA (the model

  • f the generative process).
  • To look at joint spatiotemporal behavior we now cluster the

agent trails based on their feature vectors.

  • This is done using a two step process.

– A coarse clustering step : Trails are initially grouped coarsely according to the locations visited, irrespective

  • f the frequency of the visits.

– A cluster refining step : The coarse clusters are each then clustered using agglomerative clustering to derive groups of trails which visit “similar” locations with “similar” frequencies.

slide-14
SLIDE 14

<Your Name> 14

June 2019 27 CASOS Summer Institute 2019

Refining the Coarse Clustering

α β αα αβ βα ββ

ααα ααβ αβα αββ βαα βαβ ββα βββ

Depth = 1 Depth = 2 Depth = 3

June 2019 28 CASOS Summer Institute 2019

Viewing time sequences

  • Each cluster contains trails with similar patterns in the sequences
  • f locations visited
  • Thus extract the longest common subsequence amongst all the

trails belonging to a cluster. B A B A N A N A N A N A A T A T A N A A N A A N A A N A A A A A N A N A Longest common string Longest common subsequence

slide-15
SLIDE 15

<Your Name> 15

June 2019 29 CASOS Summer Institute 2019

What we’ll do

  • Import a “DynamicMetaNetwork” with spatial

information

  • Understand the benefits and drawbacks of

different visualizations of trail data

– ORA Over-time visualizer – ORA GIS visualizer – Loom

  • Use Loom to cluster similar trails

– The high level concept – The details

  • Obtain networks from trails

June 2019 30 CASOS Summer Institute 2019

Generating Networks from Trails

  • We can better understand how different cities

relate via championships by getting networks out

  • f them

What we’ll do

  • Generate the networks
  • View them in ORA
  • Use ORA Network Visualizer
slide-16
SLIDE 16

<Your Name> 16

June 2019 31 CASOS Summer Institute 2019

Exporting the Matrices

June 2019 32 CASOS Summer Institute 2019

What we now have

  • ORA uses all of the trails and
  • utputs a single meta-network

– Colocation – An edge is created between the trophies if they ever existed at the same place at the same time – Visit Matrix – An edge is created between city and trophy if the city ever won that trophy – Transition – An edge is created between cities if a trophy ever traveled from one to the other in consecutive years

slide-17
SLIDE 17

<Your Name> 17

June 2019 33 CASOS Summer Institute 2019

Colocation

June 2019 34 CASOS Summer Institute 2019

Transition

slide-18
SLIDE 18

<Your Name> 18

June 2019 35 CASOS Summer Institute 2019

Visit

June 2019 36 CASOS Summer Institute 2019

Summary

  • We discussed what a trail was – a trace of the

movement of something through a network over time

  • We used an example dataset and looked at trail

data three different ways – in the Networks Over Time visualizer, the GIS visualizer and Loom

  • We talked about how to find similar trails in Loom
  • We looked at how we can get new, interested

networks out of our trail data