Querying Geo-social Data by Bridging Spatial Networks and Social Networks
Yerach Doytsher Ben Galon Yaron Kanza
1
Querying Geo-social Data by Bridging Spatial Networks and Social - - PowerPoint PPT Presentation
Querying Geo-social Data by Bridging Spatial Networks and Social Networks Yerach Ben Yaron Doytsher Galon Kanza 1 Motivation Social networks provide valuable information on social relationships among people (users) Associating
1
2
3
4
5
6
7
8
9
10
11
UK
12
UK England
Northern Ireland
Wales Scotland
13
UK England
Northern Ireland
Wales Scotland
London Bristol Liverpool Sheffield Manchester Leeds
14
UK England
Northern Ireland
Wales Scotland
London Bristol Liverpool Sheffield Manchester Leeds
15
Northern Ireland
UK England Wales Scotland
London Bristol Liverpool Sheffield Manchester Leeds
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
Geographic entities Hierarchy Friendship Life pattern Adjacency users
40
Geographic entities Hierarchy Friendship Life pattern Adjacency users
41
10000 20000 30000 40000 50000 60000 70000 2000 4000 6000 8000 10000 12000 14000 16000 Run time (milisec) Number of users
Extend 3
Neo4j W/O Cache Neo4j With Cache MySQL W/O Cache MySQL With Cache
Large effect of a cache on the running time, in the relational-based implementation Almost no effect of the cache on the running time, in the graph-based implementation
42
10000 20000 30000 40000 50000 60000 70000 200000 400000 600000 800000 Run time (milisec) Number of life patterns
Bridge 3
Neo4j W/O Cache Neo4j With Cache MySQL W/O Cache MySQL With Cache
The graph model shows better result for large datasets In both models the cache significantly improves the efficiency
43
Find where paramedics might live Queries with bridge are evaluated more efficiently over the graph DBMS than
DBMS
200 400 600 800 1000 1200 1400 1600 200000 400000 600000 800000 1000000 Run time (milisec) Network size
Query 1
Neo4j W/O Cache Neo4j With Cache MySQL W/O Cache MySQL With Cache
44
John = Select(Nsocial, name=`John') Places = Bridge(John, all, 0.5) Query_2 = MBridge(Places, all, 0.5, 20%)
500 1000 1500 2000 2500 3000 200000 400000 600000 800000 1000000 Run time (milisec) Network size
Query 2
Neo4j W/O Cache Neo4j With Cache MySQL W/O Cache MySQL With Cache
Find people that visit in 20% or more at the same places as John The evaluation of Mbridge is more efficient over RDBMS than over graph DBMS
45
46