Unsupervised joint analysis of arrayCGH, gene expression data and supplementary features
Christine Steinhoff1, Matteo Pardo1,2, Martin Vingron1
1Max Planck Institute for Molecular Genetics,
Berlin, Germany
2Sensor Lab, INFM-CNR, Brescia, Italy
Unsupervised joint analysis of arrayCGH, gene expression data and - - PowerPoint PPT Presentation
Unsupervised joint analysis of arrayCGH, gene expression data and supplementary features Christine Steinhoff 1 , Matteo Pardo 1,2 , Martin Vingron 1 1 Max Planck Institute for Molecular Genetics, Berlin, Germany 2 Sensor Lab, INFM-CNR, Brescia,
1Max Planck Institute for Molecular Genetics,
2Sensor Lab, INFM-CNR, Brescia, Italy
Bits09, Genova - 2 - Matteo Pardo
Bits09, Genova - 3 - Matteo Pardo
Bits09, Genova - 4 - Matteo Pardo
Bits09, Genova - 5 - Matteo Pardo
Bits09, Genova - 6 - Matteo Pardo
1.
2.
3.
Bits09, Genova - 7 - Matteo Pardo
Bits09, Genova - 8 - Matteo Pardo
Bits09, Genova - 9 - Matteo Pardo
Bits09, Genova - 10 - Matteo Pardo
Bits09, Genova - 11 - Matteo Pardo
Bits09, Genova - 12 - Matteo Pardo
grade stage Died 2 1 Yes 4 3 No 2 2 yes
Data INPUT Discretization Filtering Indicator coding MCASV
C o r r V a r I n d i c a t o r M a t r i x E A P ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 )
pxm
[ ] [ ]
t E A E A
B I I I I
=
*
[ ]t
E A P
B I I I
=
C B S
{ 1,0,1 } E
n xp
−
{ 1,0,1 } A
n xp
−
3
{0,1 }
E
n xp E
I
=
3
{0,1 }
A
n xp A
I
=
( )
{0,1 }Cat m xp
P
I
=
F C C o r r V a r I n d i c a t o r M a t r i x E A P ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 )
pxm
[ ] [ ]
t E A E A
B I I I I
=
*
[ ]t
E A P
B I I I
=
C B S
{ 1,0,1 } E
n xp
−
{ 1,0,1 } A
n xp
−
3
{0,1 }
E
n xp E
I
=
3
{0,1 }
A
n xp A
I
=
( )
{0,1 }Cat m xp
P
I
=
F C
Bits09, Genova - 14 - Matteo Pardo
Two Fold Change Circular Binary Segmentation (R Package DNAcopy) Genes with highest correlation between aCGH and expression Genes with highest variance across patients
A
E
Bits09, Genova - 15 - Matteo Pardo
t p E A
Nenadic, O. and Greenacre, M. (2006) Multiple Correspondence Analysis and Related Methods. Chapman & Hall/CRC, London Burt matrix: super-table of all contingency tables (between genes couples) MCA: find plane maximizing inertia Project covariates on the plane
Bits09, Genova - 16 - Matteo Pardo
Bits09, Genova - 17 - Matteo Pardo
Bits09, Genova - 18 - Matteo Pardo
Bits09, Genova - 19 - Matteo Pardo
Bits09, Genova - 20 - Matteo Pardo
Bits09, Genova - 21 - Matteo Pardo
Tumor grade 1, 2 and 3 separate (only) along the first component the gene pattern of a patient is determined foremostly by its tumor grade.
Bits09, Genova - 22 - Matteo Pardo
display considerable variation along first component
the side of higher tumor grade.
strongest negative indicator?
Bits09, Genova - 23 - Matteo Pardo
separate clearly from each other but show no
heterogeneity of gene patterns inside each state Lack of genomic support for this classification?
Bits09, Genova - 24 - Matteo Pardo
projection on the first component independent of tumor grade progression.
remaining information in the data.
value on 2nd MCA component
Bits09, Genova - 25 - Matteo Pardo
Bits09, Genova - 26 - Matteo Pardo
GO category enrichment
Bits09, Genova - 27 - Matteo Pardo
Bits09, Genova - 28 - Matteo Pardo
5 degrees 10 15 30 Chromosome distribution of the selected genes
Bits09, Genova - 29 - Matteo Pardo
10 No chromosome localization but significant GO + BioCharta enrichment:
and TNF
Bits09, Genova - 31 - Matteo Pardo
Bits09, Genova - 32 - Matteo Pardo
Bits09, Genova - 33 - Matteo Pardo Filter F1 F1acgh F1expr F2 # selected genes F1 85 103 16 179 F1acgh 18 34 193 F1expr 4 189 F2 194 10 degree F1 F1acgh F1expr F2 # selected genes F1 4 5 26 F1acgh 1 36 F1expr 28 F2 27
Supplementary figure 1 Supplementary figure 2