THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA Leishi - - PowerPoint PPT Presentation

thr throu ough ghput put biol biolog ogical ical data
SMART_READER_LITE
LIVE PREVIEW

THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA Leishi - - PowerPoint PPT Presentation

VISU VISUALIZA ALIZATION TION FOR FOR HIGH HIGH THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA Leishi ishi Zha Zhang 1 , , Jasn sna Kulj ljis is 2 and Xiao Xiaohui i Liu Liu 2 1 Univ Univer ersity sity of of Kon


slide-1
SLIDE 1

VISU VISUALIZA ALIZATION TION FOR FOR HIGH HIGH THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA

Leishi ishi Zha Zhang1, , Jasn sna Kulj ljis is2 and Xiao Xiaohui i Liu Liu2

1 Univ

Univer ersity sity of

  • f Kon
  • nsta

stanz, nz, Ger German many

2 Br

Brun unel el Univ Univer ersity sity, , UK UK

slide-2
SLIDE 2

HIGH THROUGHPUT EXPERIMENTS

Small molecule microarrays DNA microarrays High throughput sequencing High content screening

2

slide-3
SLIDE 3

HT DATA

  • large
  • high-dimensionality
  • heterogeneous

How to make sense out of the data? – visual analytics automated data analysis + interactive visualizations

3

slide-4
SLIDE 4

VISUAL ANALYTICS FOR HT DATA

Visual Analytics combines

  • automated data analysis (statistics and data mining methods)
  • interactive visualization (visual parameters, graphical

representations, and human computer interactions)

In this talk, I will discuss…

  • existing visualization techniques
  • open issues

4

slide-5
SLIDE 5

VISUALIZATION DESIGN

Information seeking paradigm “Overview First, Zoom and Filter, Details on Demand”

  • Ben Shneiderman, 1996

What is important?

  • providing overview as well as details
  • showing patterns and relations
  • supporting dynamic queries

5

slide-6
SLIDE 6

VISUAL PARAMETER DESIGN

Various visual channels

  • color
  • Shape
  • size
  • position
  • texture
  • Challenge: how to effectively

use/combine different visual parameters to show interesting part of the data?

?...

6

slide-7
SLIDE 7

VISUAL PARAMETER DESIGN

Play with the parameters

line? line+ colors?

  • ffset?

two tone colors?…

7

slide-8
SLIDE 8

GRAPHICAL REPRESENTATION DESIGN

Challenge: given the large data, how to design graphical representations which:

  • highlight patterns and relations
  • show both overview and details

8

slide-9
SLIDE 9

GR DESIGN - OVERVIEW (1)

Mapping data values – heatmap vs. parallel coordinates

no overlap easy to see value differences

9

slide-10
SLIDE 10

GR DESIGN - OVERVIEW (2)

Mapping distance/similarity between objects to a 2D/3D display as scatterplot or grids: dimension reduction

– Projection Pursuit – Principle Component Analysis – Multi Dimensional Scaling – Self Organising Map – ISOMAP – Locally Linear Embedding – Stochastic Neighbourhood Embedding – Generative Topographic Mapping – …

Focus: best approximate the structure (pairwise distance, and/or neighborhood info.) of data in the low dimensional visual space

10

slide-11
SLIDE 11

GR DESIGN - OVERVIEW (3)

Divide & display: small multiples

show details of data dimensions ordering is crucial

11

slide-12
SLIDE 12

GR DESIGN – DETAILED INFO (1)

Data summarization, detailed comparison and correlation analysis

  • density plot
  • box-and-whisker plot
  • radar/spider plot
  • correlation plot

provide good support for statistical analysis, and comparison between subsets of data

12

slide-13
SLIDE 13

GR DESIGN – DETAILED INFO (2)

Links and relations

  • force-directed
  • matrix view
  • treemap
  • hyperbolic view
  • dendrogram

none –hierarchical relations hierarchical relations

13

slide-14
SLIDE 14

HUMAN COMPUTER INTERACTIONS

Design interactive user interface

  • zooming
  • panning
  • linking and brushing

Typically a HT Data Analysis tool integrates multiple visualization

panels with linking and brushing and other mouse/keyboard functions to support dynamic query and detail-on-demand visual data analysis

14

slide-15
SLIDE 15

OPEN ISSUES

  • Scalability

15

slide-16
SLIDE 16

OPEN ISSUES

  • Scalability (hardware, software)
  • Visualizing uncertainties in data
  • Visualizing evolving changes
  • Evaluating quality of visual representations

16

slide-17
SLIDE 17

Thank you very much for your attention! 

17