StarTrack Next Generation A Scalable Infrastructure for Track-Based - PowerPoint PPT Presentation

StarTrack Next Generation A Scalable Infrastructure for Track-Based Applications Maya Haridasan Iqbal Mohomed Doug Terry Chandu Thekkath Li Zhang MICROSOFT RESEARCH SILICON VALLEY OSDI 2010

Location-Based Applications • Many phones already have the ability to determine their own location  GPS, cell tower triangulation, or proximity to WiFi hotspots • Many mobile applications use location information

Track Time-ordered sequence of location readings Latitude: 37.4013 Longitude: -122.0730 Time: 07/08/10 08:46:45.125

Application: Personalized Driving Directions Goal: Find directions to new gym

Application: Personalized Driving Directions Goal: Find directions to new gym ≈ Take US-101 North

A Taxonomy of Applications Personal Social Current Driving directions, Friend finder, location Nearby restaurants Crowd scenes Past Personal travel journal, Post-it notes, locations Geocoded photos Recommendations Personalized Driving Ride sharing, Discovery, Tracks Directions, Track-Based Urban sensing Search Class of applications enabled by StarTrack

StarTrack System • Insertion Insertion Application ST Server Location ST Client Manager ST Server Application ST Server • Retrieval ST Client • Manipulation • Comparison …

System Challenges 1. Handling error-prone tracks 2. Flexible programming interface 3. Efficient implementation of operations on tracks 4. Scalability and fault tolerance

Challenges of Using Raw Tracks Advantages of Canonicalization:  More efficient retrieval and comparison operations  Enables StarTrack to maintain a list of non-duplicate tracks

StarTrack API Pre-filter tracks Manipulate tracks Fetch tracks Track Collections (TC): Abstract grouping of tracks  Programming Convenience  Implementation Efficiency − Prevent unnecessary client-server message exchanges − Enable delayed evaluation − Enable caching and use of in-memory data structures

StarTrack API: Track Collections Creation  TC MakeCollection(GroupCriteria criteria, bool removeDuplicates) Manipulation  TC JoinTrackCollections (TC tCs[], bool removeDuplicates)  TC SortTracks (TC tC, SortAttribute attr)  TC TakeTracks(TC tC, int count)  TC GetSimilarTracks (TC tC, Track refTrack, float simThreshold)  TC GetPassByTracks (TC tC, Area[] areas)  TC GetCommonSegments(TC tC, float freqThreshold) Retrieval  Track[] GetTracks (TC tC, int start, int count)

API Usage: Ride-Sharing Application // get user’s most popular track in the morning TC myTC = MakeCollection (“name = Maya”, *0800 1000+, true); TC myPopTC = SortTracks(myTC, FREQ); Track track = GetTracks(myPopTC, 0, 1); // find tracks of all fellow employees TC msTC = MakeCollection (“name.Employer = MS”, *0800 1000+, true); // pick tracks from the community most similar to user’s popular track TC similarTC = GetSimilarTracks(msTC, track, 0.8); Track[] similarTracks = GetTracks(similarTC, 0, 20); // Find owners of tracks, and verify that each track is frequently traveled User[] result = FindOwnersOfFrequentTracks(similarTracks);

Efficient Implementation of Operations • StarTrack exploits redundancy in tracks for efficient retrieval from database  Set of non-duplicate tracks per user  Separate table of unique coordinates • StarTrack builds specialized in-memory data-structures to accelerate the evaluation of some operations  Quad-Trees for geographic range searches  Track Trees for similarity searches

Track Similarity Track C Track A = Track B = S1, S2, S3, S4, S5 s7 s6 Track C = S1, S2, S3, S4, S6, S7 Track A s5 Track D = S1, S2, S3, S8, S9 Track B s4 s8 s3 s9 Track D s2 s1

Track Similarity Track C Track A = Track B = S1, S2, S3, S4, S5 s7 s6 Track C = S1, S2, S3, S4, S6, S7 Track A s5 Track D = S1, S2, S3, S8, S9 Track B s4 SIM A,B = |S1−5| s8 S1−5 = 1 s3 s9 |S1−4| Track D SIM A,C = S1−4 + S5 + |S6−7| s2 s1 Limited database support for computing track similarity

Track Tree Track C s6 s7 Track A S1-5 s5 Track B s4 S1-4 s8 s3 s9 S1-3 Track D s2 s1 S1-2 S6-7 S8-9 s1 s2 s3 s4 s5 s6 s7 s8 s9

Track Tree Track C s6 s7 Track A S1-5 s5 Track B s4 S1-4 s8 s3 s9 S1-3 Track D s2 s1 S1-2 S6-7 S8-9 s1 s2 s3 s4 s5 s6 s7 s8 s9 GetSimilarTracks, GetCommonSegments

Evaluation • Performance of our Track Tree approach • Performance of 2 sample applications  Personalized Driving Directions  Ride-sharing • Configuration  Synthetically generated tracks  Up to 9 StarTrack Servers + 3 Database Servers  Server Configuration: − 2.6 GHz AMD Opteron Quad-Core Processors − 16 GB RAM

Evaluation: Track Tree • Evaluation of GetSimilarTracks • Alternative approaches:  Database filtering Pre-filter tracks that intersect ref track at database  In-memory filtering Pre-filter tracks that intersect ref track in memory  In-memory brute force Compute similarity between each track and ref track in memory

Get Similar Tracks – Query Time Database Filtering 10000 In-Memory Brute Force 1000 Query Time (ms) In-Memory Filtering 100 10 1 Track Tree 0.1 0 20 40 60 80 100 Number of tracks (thousands)

Track Tree Construction Costs 200 150 125 160 Memory 100 Seconds MBytes 120 75 80 50 Time 40 25 0 0 0 20 40 60 80 100 Number of Tracks (thousands)

Performance of Applications Personalized Driving Directions Ride Sharing - Track Collection for single user at a time - Track Collection on multiple users - Calls to GetCommonSegments - Calls to GetSimilarTracks - 30 requests/s at about 100 ms (uncached) - 30 requests/s at about 170 ms - 250 requests/s at about 55 ms (cached) 120 600 Response Time (ms) Response Time (ms) 100 500 80 400 60 300 40 200 20 100 0 0 150 175 200 225 250 0 10 20 30 40 Request Rate (per second) Request Rate (per second)

Summary • StarTrack is a scalable service designed to manage tracks and facilitate the construction of track-based applications • Important Design Features  Canonicalization of Tracks  API based on Track Collections  Use of Novel Data Structures • Availability:  We are looking for users of our infrastructure. Please contact one of the authors if you are interested.

StarTrack Next Generation A Scalable Infrastructure for Track-Based - PowerPoint PPT Presentation

StarTrack Next Generation A Scalable Infrastructure for Track-Based Applications Maya Haridasan Iqbal Mohomed Doug Terry Chandu Thekkath Li Zhang MICROSOFT RESEARCH SILICON VALLEY OSDI 2010 Location-Based Applications Many phones

StarTrack Next Generation: A Scalable Infrastructure for Track-Based Applications Maya Haridasan,

PDR Presentation StarTrack Rebecca Baturin Chris Boyle Charles Urbanowski Daniel Willmott

MDR Presentation StarTrack Rebecca Baturin Chris Boyle Charles Urbanowski Daniel Willmott

Next Generation Next Generation gTLD Dir gTLD Directory Services ectory Services Pr

Next Generation Climate Next Generation Climate Grades 6-8 Supports NGSS Lots of graphs and

Next Generation ACO Model Open Door Forum: Next Generation ACO Application Overview March 29,

Next Generation ACO Model Open Door Forum: Next Generation ACO Application Overview March 14,

Video Consoles - The Next Generation consoles and games from Next Generation 1994 - present

Next Generation Lighting Industry Alliance Keith Cook Keith Cook Chair Chair The Next

sentecacommerce.com Evolution of eCommerce 1 st Generation 2nd Generation Next Generation ERP

HIV tropism assessment HIV tropism assessment HIV tropism assessment HIV tropism assessment

Next Next Generation Sequencing: an overview of Generation Sequencing: an overview of

Next Generation Networks Next Generation Networks QoS Control Architectures and QoS Control

Next Generation Costing - Approaches to the Regulatory Costing of Next Generation Telecoms Access

Designing the Next- Designing the Next -Generation of Handheld Devices Generation of Handheld

Open Architecture Approach for the Next Generation Integrated Power System Next Generation

CS3102 Theory of Computation Warm up: Whats the language of this NFA? 0, final start

UbiComp is About Context Who are you? Are you buying this? Where are you? Where are your

Information needs of cancer policy/ planning stakeholders on just published screening trial

The principle of general local covariance and the quantization of Abelian gauge theories Claudio

Building Flexible Systems with Clojure and Datomic Stuart Sierra Cognitect We dont want

More class design with C++ Class operations are typically implemented as member functions

Integrated Technology Enhanced Learning (TEL) - Emma Scales Better Training Better Care (BTBC)

Quality Conversations Russell Harvey North W est Em ployers Associate Purpose of Programme