Re-inserting human interaction ! into cancer genome interpretation!
CYDNEY NIELSEN
UNIVERSITY OF BRITISH COLUMBIA BRITISH COLUMBIA CANCER AGENCY
Re-inserting human interaction ! into cancer genome interpretation ! - - PowerPoint PPT Presentation
Re-inserting human interaction ! into cancer genome interpretation ! CYDNEY NIELSEN UNIVERSITY OF BRITISH COLUMBIA BRITISH COLUMBIA CANCER AGENCY Outline 1 Visualization and its role in scientific discovery ! 2 Interactive cancer genomics
CYDNEY NIELSEN
UNIVERSITY OF BRITISH COLUMBIA BRITISH COLUMBIA CANCER AGENCY
1 Visualization and its role in scientific discovery! 2 Interactive cancer genomics visualization: why now?! 3 Building a cancer genomics visualization platform !
4 Summary! !
! !
INSIGHTS! QUESTIONS! DATA! hypothesis generation! interpretation! experiments!
...01100110...
...01100110...
INSIGHTS! QUESTIONS! DATA! experiments! communication! PUBLICATIONS! interpretation!
INSIGHTS! QUESTIONS! DATA! experiments! interpretation! computer automation + human expert! communication! PUBLICATIONS!
...01100110...
! That is, a machine and a mind can beat a mind-imitating machine working by itself.!
Visualization!
y I II III IV x 10 8 13 9 11 14 6 4 12 7 5 y 8.04 6.95 7.58 8.81 8.33 9.96 7.24 4.26 10.84 4.82 5.68 x 10 8 13 9 11 14 6 4 12 7 5 9.14 8.14 8.74 8.77 9.26 8.10 6.13 3.10 9.13 7.26 4.74 y 7.46 6.77 12.74 7.11 7.81 8.84 6.08 5.39 8.15 6.42 5.73 x 8 8 8 8 8 8 8 19 8 8 8 y 6.58 5.76 7.71 8.84 8.47 7.04 5.25 12.5 5.56 7.91 6.89 x 10 8 13 9 11 14 6 4 12 7 5
a b
Anscombe’s quartet!
...01100110...
INSIGHTS! DATA!
interpretation!
Visualization!
! !
www.apple.com
Example: ! ! Visual Information-Seeking Mantra! ! Overview first, zoom and filter, then details-on-demand. !
!
Visualization!
! !
Visualization!
posed by many data analysis workflows! ! !
To reconstruct the human genome sequence from raw sequencing data!
To finish the genome: close gaps, correct mis-assemblies, improve error probabilities of the consensus bases!
Consed | David Gordon and Phil Green!
Consed | David Gordon and Phil Green!
Consed | David Gordon and Phil Green!
automated once they are better characterized (e.g. AutoFinish)!
interactively focused by the user (e.g. local re-assembly)!
To predict diverse features that differ between tumor and matched-normal sample pairs!
A > G A > G!
Mutations! Copy number!
deletion deletion!
Gene expression!
AAAAA! AAAAA! AAAAA! AAAAA! AAAAA! AAAAA!
Rearrangements!
translocation translocation!
To integrate and interpret these features together with relevant patient metadata!
A > G A > G!
Mutations! Copy number!
deletion deletion!
Gene expression!
AAAAA! AAAAA! AAAAA! AAAAA! AAAAA! AAAAA!
Rearrangements!
translocation translocation!
Michael P Schroeder1, Abel Gonzalez-Perez1 and Nuria Lopez-Bigas*1,2
REVIEW
Schroeder et al. Genome Medicine 2013, 5:9 http://genomemedicine.com/content/5/1/9
Matrix heatmaps Genomic coordinates Networks Chromosomal coordinates Clinical data Interactions Clinical data Omics data Genes Clinical data Omics data Omics data Genes Samples
Michael P Schroeder1, Abel Gonzalez-Perez1 and Nuria Lopez-Bigas*1,2
REVIEW
Schroeder et al. Genome Medicine 2013, 5:9 http://genomemedicine.com/content/5/1/9
Ding et al., Bioinformatics, 2012!
Ha et al., Genome Research, 2014!
Example analysis: Examine a mutation in its copy number context!
dele$on' muta$on'
Ding et al., Bioinformatics, 2012!
Ha et al., Genome Research, 2014!
Example analysis: Examine a mutation in its copy number context!
mutations! copy number!
MutationSeq predictions!
MutationSeq predictions! Titan copy number predictions!
sample(s) + data type!
visual representation!
v! d! d!
mutations! copy number!
v! d! v! MutationSeq predictions!
v! v! d! d! MutationSeq predictions! Titan copy number predictions!
Select a predefined structure!
Add to an existing structure!
Sample(s)! Query by project name / tumour type / sample id! ! Single data type! e.g. mutations, copy number, etc.!
Data filters depend
selected data type!
Limit the view to genes or regions of interest!
View types depend
selected data type!
v! v! d! d! MutationSeq predictions! Titan copy number predictions!
v! v! d! d!
v! v! d! d!
Zhicheng Liu, Biye Jiang, Jeffrey Heer inMens, EuroVis 2013
total data set size!
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.91! !
sample id: SA091! chrom: 1! start: 103,062! end: 109,114! state: GAIN! !
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.91! !
sample id: SA091! chrom: 1! start: 103,062! end: 109,114! state: GAIN! !
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.91! !
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.91! !
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.91! !
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.91!
sample id: SA091! chrom: 1! start: 103,062! end: 109,114! state: GAIN! !
sample id: SA091! chrom: 1! start: 103,062! end: 109,114! state: GAIN! !
sample id: SA091! chrom: 1! start: 103,062! end: 109,114! state: GAIN! !
sample id: SA091! chrom: 1! start: 103,062! end: 109,114! state: GAIN! !
sample id: SA091! chrom: 1! position: 104,589! ref_allele: A! alt_allele: T! probability: 0.95! !
sample id: SA091! chrom: 5! start: 2,062! end: 9,199! state: GAIN! !
sample id: SA091! chrom: 2! start: 69,064! end: 89,119! state: DEL!
sample id: SA091! chrom: 2! position: 19,586! ref_allele: G! alt_allele: G!
community use!
heatmaps computed during search)!
demands of cancer genomics!
insight generation!
beginning of a rigorous scientific process to further test the idea!
Sohrab Shah! ! Samuel Aparicio! David Huntsman! Marco Marra! Janessa Laskin! ! ! Michael Smith Genome Sciences Centre!
British Columbia Cancer Agency! Vancouver, Canada!
Tom Jin! Kevin Wagner! Daniel Machev! Kelsey Hamer! Ali Bashashati! ! Shah Lab Development Team!