Week 9 Sensor and dataset processing p g Lovett et al Lovett et - - PowerPoint PPT Presentation

week 9 sensor and dataset processing p g
SMART_READER_LITE
LIVE PREVIEW

Week 9 Sensor and dataset processing p g Lovett et al Lovett et - - PowerPoint PPT Presentation

Week 9 Sensor and dataset processing p g Lovett et al Lovett et al. The Calendar as a Sensor: The Calendar as a Sensor: Analysis and Improvement Using Data Fusion with Social Networks and Locations with Social Networks and


slide-1
SLIDE 1

Week 9 – Sensor and dataset processing p g

Lovett et al “The Calendar as a Sensor: Lovett et al. – “The Calendar as a Sensor: Analysis and Improvement Using Data Fusion with Social Networks and Locations” with Social Networks and Locations

Adam Goldstein Adam Goldstein abg@wpi.edu CS 525w – Mobile Computing (03/22/2011)

slide-2
SLIDE 2

The Calendar as a Sensor The Calendar as a Sensor

  • Hypothetically calendars contain

Hypothetically, calendars contain information about when people are where

  • In reality this 1:1 relationship does not
  • In reality, this 1:1 relationship does not

exist.

N t ll t tt d t ll tt d – Not all event attenders actually attend – Not all events are even real events

2

  • Here are 4 pages about why calendars

are inaccurate.

Worcester Polytechnic Institute 2

slide-3
SLIDE 3

Preliminary Study Preliminary Study

  • Authors started with a 6 week study

Authors started with a 6 week study

– About 200 software developers and engineers engineers

  • Actually included about 20 employees

– Used MS Outlook Used MS Outlook

  • Events were mined from participants calendars
  • Actual activity was discovered by observation,

3

y y interviews, and participant diaries.

Worcester Polytechnic Institute 3

slide-4
SLIDE 4

Event Types (474 total) Event Types (474 total)

  • Genuine events (38 total, 8%)

( , )

– Shared cal event with >= 1 participant

  • Placeholder events (152, 32%)

– >= 1 participant but doesn’t actually occur, e.g. Repeated meeting that was cancelled

  • Personal reminders (232, 49%)

Personal reminders (232, 49%)

– Not a physical event, but a note by one person to themselves

Sh d i d (52 11%)

4

  • Shared reminders (52, 11%)

– Not a physical event, but a note to many people

Worcester Polytechnic Institute 4

people

slide-5
SLIDE 5

Expectation vs. Reality Expectation vs. Reality

  • Comparing real world attenders and

Comparing real world attenders and invited-on-the-calendar attenders

  • In other words:

[Invited people who actually came]

All events

5 [Invited people and attending people] [All t ] Worcester Polytechnic Institute 5 [All events]

slide-6
SLIDE 6

Expectation vs. Reality Results Expectation vs. Reality Results

Comparison Real world Start time (nearest 5 minutes) (-25, 25) End time (nearest 5 minutes) (-5, 15) Location 0.11 Total correct identification 113 Total false identification 16 Total failed identifications 9

6

Set similarity (Jaccardian index) 0.89

Worcester Polytechnic Institute 6

slide-7
SLIDE 7

So we can agree So we can agree

  • The calendar alone is not a sufficient

The calendar alone is not a sufficient sensor.

  • What if we fused it with not one not two
  • What if we fused it with not one, not two,

but three other data sources?

Y t h d th t f thi – You must have read the rest of this paper

7 Worcester Polytechnic Institute 7

slide-8
SLIDE 8

Data fusion - Enablers Data fusion Enablers

  • We can make the calendar a viable

We can make the calendar a viable sensor by adding three other information sources: sources:

– Co-presence “Social network” – Social network – Planning

8 Worcester Polytechnic Institute 8

slide-9
SLIDE 9

Co-location Co location

Gather all users Gather all users who are in the same proximity (within threshold p) (within threshold p) for a decent amount

  • f time

(within threshold t). ( ) 9 Worcester Polytechnic Institute 9

slide-10
SLIDE 10

“Social Network” Social Network

  • If given a set of users

If given a set of users

– Create graphs of all users who have social ties (in each others’ contact list) ties (in each others contact list)

  • Graphs are >= 2 people
  • If given graphs of users with events

If given graphs of users with events

– Attach ungrouped users into graphs based

  • n social ties

10

  • n social ties

Worcester Polytechnic Institute 10

slide-11
SLIDE 11

Planning Planning

  • If given a set of users

If given a set of users

– Create graphs of all users who have shared events together at a specific time events together at a specific time

  • If given graphs of users in the same

“social network” social network

– Reshape the graphs according to scheduled events that are most shared

11

events that are most shared

Worcester Polytechnic Institute 11

slide-12
SLIDE 12

Data fusion (Method 1) Data fusion (Method 1)

12 Worcester Polytechnic Institute 12

slide-13
SLIDE 13

Data fusion (Method 2) Data fusion (Method 2)

13 Worcester Polytechnic Institute 13

slide-14
SLIDE 14

Event Management Process Event Management Process

  • When triggered use data fusion to

When triggered, use data fusion to

– Create events Update events – Update events – End events

Thi i h th l i

  • This is where the real success came in

Metric Original Method 1 Method 2 Success event ID 38 37 32

14

Success event ID 38 37 32 False event ID 204 32 14 Failed event ID N/A 1 6

Worcester Polytechnic Institute 14

slide-15
SLIDE 15

Expectation vs. Reality Results Expectation vs. Reality Results

Comparison Real world Method 1 Method 2 Start time (nearest 5 minutes) (-25, 25) (-5, 20) (0, 15) End time (nearest 5 minutes) (-5, 15) (-5, 20) (-5, 20) Location 0.11 0.97 0.84 Total correct identification 113 112 94 T t l f l id tifi ti 16 36 31 Total false identification 16 36 31 Total failed identifications 9 10 29 S t i il it (J di i d ) 0 89 0 65 0 60

15

Set similarity (Jaccardian index) 0.89 0.65 0.60

Worcester Polytechnic Institute 15

slide-16
SLIDE 16

Thoughts & concerns Thoughts & concerns

  • False identifications

False identifications

– Privacy concerns, spam

  • Failed identifications

Failed identifications

– Unreliable system

  • Sensor failure

Sensor failure

  • Participant Mobility

Just passing by using conference rooms for

16

– Just passing by, using conference rooms for

  • ther reasons
  • Anything else?

Worcester Polytechnic Institute 16

Anything else?

slide-17
SLIDE 17

Conclusions Conclusions

  • With data fusion calendars can be made

With data fusion, calendars can be made a more genuine source of information

Number of false events improved from 204 – Number of false events improved from 204 using just a calendar to < 32. – Updated calendars distinguish between Updated calendars distinguish between genuine events are reminders

  • Any other useful contributions?

17

  • Any other useful contributions?
  • Other ways this fusion approach can be

used?

Worcester Polytechnic Institute 17

used?