DATA ERA A COMPARATIVE REVIEW OF STATE-OF-THE-ART COMMERCIAL - - PowerPoint PPT Presentation

data era a comparative
SMART_READER_LITE
LIVE PREVIEW

DATA ERA A COMPARATIVE REVIEW OF STATE-OF-THE-ART COMMERCIAL - - PowerPoint PPT Presentation

VISUAL ANALYTICS FOR THE BIG DATA ERA A COMPARATIVE REVIEW OF STATE-OF-THE-ART COMMERCIAL SYSTEMS Leishi Zhang , Andreas Stoffel, Michael Behrisch, Sebastian Mittelstdt, Tobias Schreck, Daniel Keim ( University of Konstanz, Germany) Outline


slide-1
SLIDE 1

VISUAL ANALYTICS FOR THE BIG DATA ERA – A COMPARATIVE REVIEW OF STATE-OF-THE-ART COMMERCIAL SYSTEMS

Leishi Zhang, Andreas Stoffel, Michael Behrisch, Sebastian Mittelstädt,

Tobias Schreck, Daniel Keim (University of Konstanz, Germany)

slide-2
SLIDE 2

Outline

 Motivation  Survey Design  Results  Summary of Key Findings  Conclusions

slide-3
SLIDE 3

Outline

 Motivation  Survey Design  Survey Result  Summary of Key Findings  Conclusions

slide-4
SLIDE 4

Motivation

 Big Data Era  A survey of VA systems 

provide overview of the state-of-the-art

stimulate innovative ideas

avoid redundant effort

 What is available 

survey of open source tools

survey of BI tools

 What is missing  comparison of commercial VA tools

slide-5
SLIDE 5

Our Goal

 Complementing existing surveys  An encompassing survey of commercial VA systems  functional comparison  benchmark system performance  Provide recommendations to potential users  Identify future directions for VA system development

slide-6
SLIDE 6

Outline

 Motivation  Survey Design  Results  Summary of Key Findings  Conclusions

slide-7
SLIDE 7

Workflow

 Identify relevant commercial systems  study current market share, select systems in different

categories, assign priority level for each system

 Design structured questionnaire for functional

comparison

 Analyze functional comparison result  Further investigation on top-priority systems  system stress test, test against benchmark data

slide-8
SLIDE 8

In this paper…

 We report our findings on the systems 

15 systems in the initial list

Tableau, Spotfire, QlikView, JMP (SAS), JasperSoft, ADVIZOR Solutions, Board, Centrifuge, Visual Analytics, Visual Mining Cognos(IBM), SQL Server BI (Microsoft), Business Objects (SAP), Teradata, PowerPivot (Microsoft)

10 answered our questionnaire (in green)

Some additional text analysis systems investigated

nSpace (Oculus), Palentir, and In-Spire (PNNL)

slide-9
SLIDE 9

Outline

 Motivation  Survey Design  Results  Summary of Key Findings  Conclusions

slide-10
SLIDE 10

Results

 Part 1: functional comparison  Part 2: test with data

 Use cases  Scalability (loading stress) test

slide-11
SLIDE 11

Functional Comparison

 Data Management  Data Modelling  Visualization  System and Architecture

slide-12
SLIDE 12

Data Management

slide-13
SLIDE 13

Data Modelling

slide-14
SLIDE 14

Visualization

slide-15
SLIDE 15

System and Architecture

slide-16
SLIDE 16

Benchmarking System Performance

 Use case study

1.

Practice Fusion Medical Research Data (Health 2.0 Data Challenge)

2.

Geospatial and Microblogging Data (VAST Challenge 2011)

 Scalability test

slide-17
SLIDE 17

Use Case 1

slide-18
SLIDE 18

Use case 2

slide-19
SLIDE 19

Use case 2

slide-20
SLIDE 20

Use case 2

slide-21
SLIDE 21

Use case 2

slide-22
SLIDE 22

Scalability –Loading Stress Test

slide-23
SLIDE 23

Outline

 Motivation  Survey Design  Survey Result  Summary of Key Findings  Conclusions

slide-24
SLIDE 24

Key Findings

 Tasks categorization

Exploration, Dashboarding, Reporting, Alerting

 System characteristics

 Interactivity Tableau  Automatic Analysis Spotfire  Data Compression & memory optimization QlikView  Analytical add-ons JMP

, Cognos

 Presentation oriented features Centrifuge, Board, Visual Mining,

JasperSoft

 Network visualization Centrifuge, Visual Analytics

 Linguistic analysis on text documents Business Objects, Cognos,

Teradata, nSpace, Palentir, and In-Spire

slide-25
SLIDE 25

Outline

 Motivation  Survey Design  Survey Result  Summary of Key Findings  Conclusions

slide-26
SLIDE 26

Concluding Remarks

 Semi- and Unstructured Data  Advanced Visualization  Customizable Visualization  Real Time Analysis  Predictive Analysis

slide-27
SLIDE 27

Thanks for your attention!

 Questions?