1.Lineage 2.Consistency Relational 3.Query Mining 4 6 Lineage + - - PDF document

1 lineage 2 consistency
SMART_READER_LITE
LIVE PREVIEW

1.Lineage 2.Consistency Relational 3.Query Mining 4 6 Lineage + - - PDF document

1/24/17 3 Database Problems in Data Visualization Management Systems VIZ DATA Eugene Wu Fotis Psallidas, Zhengjie Miao, Haoci Zhang,Laura Rettig, Yifan Wu, Larry Xu, Thibault Sellam, Remco Chang, Joe Hellerstein 3 3 Database Problems


slide-1
SLIDE 1

1/24/17 1

3 Database Problems in Data Visualization Management Systems Eugene Wu

Fotis Psallidas, Zhengjie Miao, Haoci Zhang,Laura Rettig, Yifan Wu, Larry Xu, Thibault Sellam, Remco Chang, Joe Hellerstein

3

DATA VIZ

4

“Relational”

3 Database Problems in Visualization

1.Lineage 2.Consistency 3.Query Mining

6

Lineage + Interactions

Lineage: high materialization cost, queries slow

7

backward_trace() forward_trace() view_refresh() Result 1 Result 2 i1, o2 i1, o3 i3, o2 i3, o4 i3, o5

Revenue Profit Price Product

Lineage + Interactions

Lineage: high materialization cost, queries slow Interactions expressed as lineage queries

ID ID ID ID ID ID

slide-2
SLIDE 2

1/24/17 2

Revenue Profit Price Product Price Product

Lineage + Interactions

Lineage: high materialization cost, queries slow Interactions expressed as lineage queries

backward_trace() view_refresh()

IDs? IDs!

Revenue Profit Price Product Price Product

Lineage + Interactions

backward_trace( )

Revenue Profit

backward_trace() view_refresh()

backward_trace(..) ✘ People don’t want raw lineage query(backward_trace(..)) ✔ They want to query lineage

avoid materialization costs lineage query optimization

11

Revenue Profit Price Product Price Product

backward_trace() query

3 Database Problems in Visualization

1.Lineage 2.Consistency 3.Query Mining

13

Perception Push-down

14

Eyes not perfect à Render approx viz

Existing Work

Model human properties à Use models in viz system

In General

slide-3
SLIDE 3

1/24/17 3

16

No CC Serial Order

Does asynchrony affect users? When? Hypotheses

  • 1. User task matters
  • 2. Design tricks can improve completion speed

17

Multi-View Concurrency Control No CC Serial Order

21

Task matters Users self-serialize MVCC (design) has an effect

No CC Serial MVCC 10 30 50 No Delay 10 30 50 Delay T ask Completion Time

3 Database Problems in Visualization

1.Lineage 2.Consistency 3.Query Mining

23

What Interfaces to Build?

2 underserved issues Many useful UI don’t exist due to high friction or org challenges Viz systems go for coverage over task efficiency

PI Precision Interfaces

http://ngoix.github.io/cyg-x1/index.html

SELECT * FROM blackholedata WHERE 1.2 < x AND x < 1.5 AND 40 < y AND y < 60 AND timestep = -50

slide-4
SLIDE 4

1/24/17 4

http://ngoix.github.io/cyg-x1/index.html

SELECT * FROM blackholedata WHERE 1.2 < x AND x < 1.5 AND 40 < y AND y < 60 AND timestep = now

http://ngoix.github.io/cyg-x1/index.html

SELECT * FROM blackholedata WHERE 1.5 < x AND x < 1.8 AND 45 < y AND y < 65 AND timestep = now

28

SQL

29 30 31

(a) (b)

(c) (d)

slide-5
SLIDE 5

1/24/17 5

33

Cost ~ UIComplexity + #Nodes + #Edges

Sloan Digital Sky Survey

34

Completeness Simplicity

3 Database Problems in Visualization

1.Lineage 2.Consistency 3.Query Mining

35

N Database Problems in Visualization

1.Lineage 2.Consistency 3.Query Mining 4.And more…

36

eugenewu.net

ewu@cs.columbia.edu