Visualization In Biology
Alexander Lex
CS 171 Guest Lecture, 18.04.2013
Visualization In Biology Alexander Lex CS 171 Guest Lecture, - - PowerPoint PPT Presentation
Visualization In Biology Alexander Lex CS 171 Guest Lecture, 18.04.2013 WHA HAT T DO O I M I MEAN: N: VIS ISUALI ALIZA ZATION TION IN IN BI BIOL OLOG OGY? Y? 2 Visualizing the Flight of Bats? [Bergou 2011] 3 Visualizing Bird
Alexander Lex
CS 171 Guest Lecture, 18.04.2013
2
3
[Bergou 2011]
4
[Ferreira 2011]
5
[Boosherian 2012]
6
[Bruckner 2007]
7
100,000 200,000 300,000 400,000 500,000 600,000 700,000
Suicide Influenza and Pneumonia Kidney-Related Diabetes Alzheimer's disease Accidents Stroke Chronic lower… Cancer Heart disease
Causes of Death in the USA 2011
8
[Data from CDC Death and Mortality Repot 2011]
100,000 200,000 300,000 400,000 500,000 600,000 700,000
Suicide Influenza and Pneumonia Kidney-Related Diabetes Alzheimer's disease Accidents Stroke Chronic lower… Cancer Heart disease
Causes of Death in the USA 2011
9
[Data from CDC Death and Mortality Repot 2011]
10
Transformation from a wet-lab/experimental to computational science Challenge in MB is shifting from Data Acquisition to Data Processing & Analysis
11
12
13
14
15
Chromosomal alterations Copy-number variation Mutations SNPs
16
17
18
[Krzywinski 2009] [Meyer 2009]
miRNA Expression methylation
19
20
[Eisen 1999]
21
[Meyer 2010]
22
23
24
[Cytoscape]
25
[Kegg]
26
[Kegg]
27
[Barsky 2008]
28
tabular data
numerical & categorical e.g., mRNA, microRNA, copy number variation, methylation, mutation status, etc. clinical data
pathways
KEGG, WikiPathways
29
…relationships between multiple datasets? …relationships between tabular and graph data?
30
developed in academic setting platform for trying out radically new visualization ideas
31
Marc Streit & Alexander Lex
Johannes Kepler University Linz, AT
Harvard University, Cambridge, USA
Graz University of Technology, AT
Johannes Kepler University Linz, AT
Harvard Medical School, Boston, USA
Graz University of Technology, AT
Harvard University, Cambridge, USA
32
Case Study
33
different histology different molecular alterations
different treatment for subtypes prognosis varies between subtypes
34
35
36
20 tumor types 500 patient samples each
37
methylation levels mRNA expression copy number status mutation status microRNA expression clinical parameters pathways
38
Patients
39
Manage complex setup of multiple datasets, multiple stratifications & multiple views
Visualize complex interdependencies between multiple, heterogeneous, large datasets
40
StratomeX Data-View Integrator
41
Patients
Candidate Subtypes
Genes, Proteins, etc.
42
Cluster A1 Cluster A2 Cluster A3
43
based on an expression pattern a mutation status a copy number alteration a combination of these
44
45
T1 Evaluate whether stratifications support each other T2 Review effect of stratifications
T3 Show expression patterns in subtypes
46
47
Cluster A1 Cluster A2 Cluster A3 B1 B2
T1 Evaluate whether stratifications support each other
Tabular e.g., mRNA Categorical, e.g., mutation status
Multi-dimensional dataset
Age Name Gender Survival status Class
1st class, 2nd class, 3rd class and crew
How many male crew members survived?
http://lib.stat.cmu.edu/S/Harrell/data/descriptions/titanic.html 48
49
[Friendly 1999]
How many male crew members survived?
50
[Kosara 2006]
How many male crew members survived?
Cluster A1 Cluster A2 Cluster A3 B1 B2 Tabular e.g., mRNA Categorical, e.g., mutation status
51
T1 Evaluate whether stratifications support each other
Cluster A1 Cluster A2 Cluster A3 B1 B2 Dependent Data, e.g. clinical data
T2 Review effect of stratification
Tabular e.g., mRNA Categorical, e.g., mutation status
52
53
Band = Subset of Patients Rows = Patients Columns = Genes
54
Patients stratified by Copy Number Patients stratified by Clustering
55
Table Cate- gorical Depen- dent
T3 Show expression patterns in subtypes
56
57
http://stratomex.caleydo.org
Case Study
58
mutation, changed gene expression, modulation due to drug treatment, etc.
59
Efficient communication of information A
B 2.8 C 3.1 D
E 0.5 F 0.3
C B D F A E
60
[Lindroos2002] [KEGG]
61
62
Ideal visualization technique addresses all Talking about 3 today
63
Large datasets have more than 500 experiments
64
mRNA expression
numerical
mutation status
categorical
copy number variation
metabolite concentration
numerical
65
Explore topology of pathway Explore the attributes of the nodes (experimental data)
66
C B D F A E
Alexander Lex | Harvard University
67
68
Separate Linked Views Small Multiples Layout Adaption Linearization
[Meyer 2010] [Junker 2006]
Alexander Lex | Harvard University
Path-Extraction On-Node Mapping
[Lindroos 2002]
Alexander Lex | Harvard University
69
[Lindroos2002]
Alexander Lex | Harvard University
70
[Westenberg 2008] [Gehlenborg 2010]
Alexander Lex | Harvard University
71
[Streit 2008]
[Lindroos 2002]
On-Node Mapping
72
Small Multiples Layout Adaption Linearization
[Meyer 2010] [Junker 2006]
Alexander Lex | Harvard University
Path-Extraction Separate Linked Views
Alexander Lex | Harvard University
73
[Shannon 2008]
Alexander Lex | Harvard University
74
Alexander Lex | Harvard University
75
Separate Linked Views
[Lindroos 2002]
On-Node Mapping
76
Layout Adaption Linearization
[Meyer 2010] [Junker 2006]
Alexander Lex | Harvard University
Path-Extraction Small Multiples
Alexander Lex | Harvard University
77
Alexander Lex | Harvard University
78
[Barsky 2008] Video!
Separate Linked Views
[Lindroos 2002]
On-Node Mapping
79
Small Multiples Linearization
[Meyer 2010]
Alexander Lex | Harvard University
Path-Extraction Layout Adaption
[Junker 2006]
make space for on-node encoding
Alexander Lex | Harvard University
80
[Gehlenborg 2010] [Junker 2006]
encode information through position
Alexander Lex | Harvard University
81
[Bezerianos 2010]
Video: http://www.youtube.com/watch?v=NLiHw5B0Mco
Layout Adaption
[Junker 2006]
Separate Linked Views
[Lindroos 2002]
On-Node Mapping
82
Small Multiples
Alexander Lex | Harvard University
Path-Extraction Linearization
[Meyer 2010]
Alexander Lex | Harvard University
83
[Meyer 2010]
layout adaption separate linked views
Alexander Lex | Harvard University
84
[Meyer 2010]
85
On-Node Mapping Separate Linked Views Small Multiples Layout Adaption Linearization
[Meyer 2010] [Junker 2006] [Lindroos 2002]
Alexander Lex | Harvard University
Path-Extraction
86
Pathway View A E C B D F Pathway View C B D F A E enRoute View
Group 1 Dataset 1 Group 2 Dataset 1 Group 1 Dataset 2
B C F A D E
87
Start- and end node Iterative adding of nodes
IGF-1
low high 88
89
Group 1 Dataset 1 Group 2 Dataset 1 Group 1 Dataset 2
B C F A D E
Path Representation
90
Group 1 Dataset 1 Group 2 Dataset 1 Group 1 Dataset 2
B C F A D E
Experimental Data Representation
91
92
93
http://enroute.caleydo.org
94
95
Mark I. McCarthy et al. on our understanding of the genetic basis
96
Alexander Lex, Harvard University alex@seas.harvard.edu http://alexander-lex.com
Christian Partl Marc Streit Nils Gehlenborg Samuel Gratzl Dieter Schmalstieg Hanspeter Pfister
97