Intuitive and Interactive Query Formulation to Improve the Usability of Query Systems for Heterogeneous Graphs
Nandish Jayaram University of Texas at Arlington PhD Advisors: Dr. Chengkai Li, Dr. Ramez Elmasri VLDB 2015 Phd W
- rkshop
August 31
st 2015
Intuitive and Interactive Query Formulation to Improve the Usability - - PowerPoint PPT Presentation
Intuitive and Interactive Query Formulation to Improve the Usability of Query Systems for Heterogeneous Graphs Nandish Jayaram University of Texas at Arlington PhD Advisors: Dr. Chengkai Li, Dr. Ramez Elmasri VLDB 2015 Phd W orkshop st 2015
st 2015
2
3
SQL QUERY: SELECT Founder.subj, Founder.obj FROM Founder, Nationality, HeadquarteredIn WHERE Founder.property = ‘founded’ AND Founder.subj = Nationality.subj AND Nationality.property = ‘nationality’ AND Founder.obj = HeadquarteredIn.subj AND HeadquarteredIn.property = ‘headquartered_in’; SPARQL QUERY: SELECT ?company ?founder WHERE { :?founder dbo:founded :?company . :?founder dbo:nationality :USA . :?company dbprop:headquartered_in :Silicon Valley . }
4
5
6
7
Possible Future Work
8
9
10 10
Query Canvas Information Panel Dynamic help indicating possible actions at every moment Useful tips for basic operations
11 11
Grey edges and nodes automatically suggested in passive mode A new node added in active mode A new edge added in active mode Suggested edges accepted by the user (with blue node) are positive edges. Grey edges ignored are negative edges. A suggested edge accepted by the user
12 12
13 13
Session Id
14 14
15 15
Final score of each candidate is its average score across all RCPs.
Session Id
Each correlation path selects a subset of the query log, with no more than ‘t’ rows in it 16 16
Target Query Graphs Edge Ranking Algorithms
Query Graph # of edges RCP RCP (no negative edges) Random Forest Classifier Random ForrestGump-directorType
3 12 11 >100 37
FilmType-directorType
5 39 >100 41 >100
DirectorType-actorType
3 >100 >100 >100 >100
FilmType-DirectorType
4 28 >100 31 >100
FilmType-DirectorType
3 14 27 25 >100
FounderType-SchoolType
5 34 >100 33 >100
FounderType-SchoolType
4 >100 >100 >100 >100
JerryYang-SchoolType
5 34 85 >100 >100
JerryYang-Yahoo-Stanford
4 14 >100 33 >100
17 17
18 18
19 19
Ranked similar answer tuples Keyword completion powered query interface Query graph automatically discovered by the system An example answer graph
Maximum Query Graph
20 20
21 21
22 22
Minimal Query Trees Maximum Query Graph (MQG) 23 23
24 24
[0.5, 1.0] : Strong positive correlation [0.3, 0.5) : Medium positive correlation [0.1, 0.3) : Small positive correlation 25 25
26 26
27 27
24 24
1 10 100 1000 F1 F2 F3 F4 F5 F6 F7 F8 F9 F10 F11 F12 F13 F14 F15 F16 F17 F18 F19 F20
Query Processing Time (secs.) Query GQBE NESS Baseline
12 13 18 10 8 10 8 12 8 8 11 9 7 11 8 9 9 7 10 7
# edges in MQG
27 27
27 27
28 28
edges and vertices!