Modeling cancer cells using multi-comics data
February 23, 2019 The Second Korea-Japan Machine Learning Workshop
Sun Kim
Bioinformatics Institute Computer Science and Engineering Seoul National University
BHI lab @ SNU 1
Modeling cancer cells using multi-comics data February 23, 2019 - - PowerPoint PPT Presentation
Modeling cancer cells using multi-comics data February 23, 2019 The Second Korea-Japan Machine Learning Workshop Sun Kim Bioinformatics Institute Computer Science and Engineering Seoul National University BHI lab @ SNU 1 Cells, Genetic and
February 23, 2019 The Second Korea-Japan Machine Learning Workshop
Bioinformatics Institute Computer Science and Engineering Seoul National University
BHI lab @ SNU 1
Bio & Health Informatics Lab, SNU 2
literature
Sangseon Lee, Taeheon Lee, Yung-Kyun Noh, and Sun Kim Bio & Health Informatics Lab.
Bio & Health Informatics Lab, SNU 6
literature
straightforward
alphabet π, π = π (π = 4 for DNA), a feature map from π΄ to βππ, Ξ¦π π¦ = ππ½ π¦
π½βππ
πΏπ π¦, π§ = Ξ¦π π¦ , Ξ¦π π§
ΰ·© πΏπ π¦, π§ = πΏπ π¦, π§ πΏπ π¦, π¦ πΏπ π§, π§ πΈπ π¦, π§ = ΰ·© πΏπ π¦, π¦ + ΰ·© πΏπ π§, π§ β 2ΰ·© πΏπ π¦, π§
ππ½ π¦ : the number of times π½ occurs in π¦
πΏπ
ππππ π¦, π§ = ππ·(Ξ¦π ππππππ π¦ , Ξ¦π ππππππ π§ )
ΰ·© πΏπ
ππππ π¦, π§ = 1 + πΏπ ππππ π¦, π§
2 πππ‘π’ π¦, π§ = ΰ·© πΏπ
ππππ π¦, π¦ + ΰ·©
πΏπ
ππππ π§, π§ β 2ΰ·©
πΏπ
ππππ π¦, π§
= 1 β ΰ·© πΏπ
ππππ π¦, π§
ππ·: the Kendall tau rank correlation Ξ¦π
ππππππ: a feature map on the landmark
Bio & Health Informatics Lab, SNU 17
Bio & Health Informatics Lab, SNU 18
1 3 2 4 1 2 3 4
CGCG CG CG CG
MCG MCG
CG CG CG
MCG MCG MCG MCG
C: cytosine
mC: methylcytosine
CpG island
19 Bio & Health Informatics Lab, SNU
and CpG islands.
exon > CpG island > intron.
Sangseon Lee, Sangsoo Lim, Taeheon Lee, and Sun Kim
Dimension Reduction Enrichment Test Pathway Activity Inference Subpath Mining
Pathway #N Pathway #2 Pathway #1
Layers
propagation
considering
information ο Attention & Networkpropagation
Multi attention based ensemble (MAE) Network propagation on patient-specific pathway network
Natural killer cell mediated cytotoxicity Retrograde endocannabinoid signaling
Apoptosis Proliferation Neovascularization and angiogenesis Metastasis formation Autophagy
Rescued by Network propagation
computational analysis.
cells.
attention mechanism and network propagation technique.
ensemble
good performance in cancer subtype classification.
pathway interaction networks as a result of using attention mechanisms and pathway propagation.
Dohoon Lee, Sangseon Lee, and Sun Kim Bio & Health Informatics Lab.
very high dimensionsβ
dimension)
data.
Cancer subclone
X 30% X 20% X 50%
11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦
11111 / 11010 / 00000 / 01101 / 00111 / 00000 / β¦ 11111 / 10011 / 00000 / 10001 / 00111 / 00000 / β¦ 11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 11111 / 00010 / 00000 / 01001 / 00111 / 00000 / β¦ 11111 / 11011 / 00000 / 10101 / 00111 / 00000 / β¦ 00000 / 00100 / 11111 / 00101 / 00111 / 00000 / β¦ 00000 / 01100 / 11111 / 01101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 00110 / 00000 / 11111 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 01001 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 00111 / 00111 / 11111 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 11011 / 00000 / 10101 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦
Fingerprint epilocus for red subclone
11111 11111 11111 11111 11111 00000 00000 00000 00000 00000 00000 00000 00000 00000 00000
11010 10011 11010 00010 11011 00100 01100 00001 00010 10011 11110 11011 00010 00110 10011
Non-fingerprint epilocus
00000 00000 00000 00000 00000 11111 11111 11111 00000 00000 00000 00000 00000 00000 00000
Fingerprint epilocus for blue subclone
00000 00000 00000 00000 00000 00000 00000 00000 11111 11111 11111 11111 11111 11111 11111
Fingerprint epilocus for green subclone
11111 / 11010 / 00000 / 01101 / 00111 / 00000 / β¦ 11111 / 10011 / 00000 / 10001 / 00111 / 00000 / β¦ 11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 11111 / 00010 / 00000 / 01001 / 00111 / 00000 / β¦ 11111 / 11011 / 00000 / 10101 / 00111 / 00000 / β¦ 00000 / 00100 / 11111 / 00101 / 00111 / 00000 / β¦ 00000 / 01100 / 11111 / 01101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 00110 / 00000 / 11111 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 01001 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 00111 / 00111 / 11111 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 11011 / 00000 / 10101 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦
11111 / 11010 / 00000 / 01101 / 00111 / 00000 / β¦ 11111 / 10011 / 00000 / 10001 / 00111 / 00000 / β¦ 11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 11111 / 00010 / 00000 / 01001 / 00111 / 00000 / β¦ 11111 / 11011 / 00000 / 10101 / 00111 / 00000 / β¦ 00000 / 00100 / 11111 / 00101 / 00111 / 00000 / β¦ 00000 / 01100 / 11111 / 01101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 00110 / 00000 / 11111 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 01001 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 00111 / 00111 / 11111 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 11011 / 00000 / 10101 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦
11111 / 11010 / 00000 / 01101 / 00111 / 00000 / β¦ 11111 / 10011 / 00000 / 10001 / 00111 / 00000 / β¦ 11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 11111 / 00010 / 00000 / 01001 / 00111 / 00000 / β¦ 11111 / 11011 / 00000 / 10101 / 00111 / 00000 / β¦ 00000 / 00100 / 11111 / 00101 / 00111 / 00000 / β¦ 00000 / 01100 / 11111 / 01101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 00110 / 00000 / 11111 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 01001 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 00111 / 00111 / 11111 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 11011 / 00000 / 10101 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦
11111 / 11010 / 00000 / 01101 / 00111 / 00000 / β¦ 11111 / 10011 / 00000 / 10001 / 00111 / 00000 / β¦ 11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 11111 / 00010 / 00000 / 01001 / 00111 / 00000 / β¦ 11111 / 11011 / 00000 / 10101 / 00111 / 00000 / β¦ 00000 / 00100 / 11111 / 00101 / 00111 / 00000 / β¦ 00000 / 01100 / 11111 / 01101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 00110 / 00000 / 11111 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 01001 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 00111 / 00111 / 11111 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 11011 / 00000 / 10101 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦
11111 / 11010 / 00000 / 01101 / 00111 / 00000 / β¦ 11111 / 10011 / 00000 / 10001 / 00111 / 00000 / β¦ 11111 / 11010 / 00000 / 00101 / 00111 / 00000 / β¦ 11111 / 00010 / 00000 / 01001 / 00111 / 00000 / β¦ 11111 / 11011 / 00000 / 10101 / 00111 / 00000 / β¦ 00000 / 00100 / 11111 / 00101 / 00111 / 00000 / β¦ 00000 / 01100 / 11111 / 01101 / 00111 / 00000 / β¦ 00000 / 00001 / 11111 / 01001 / 00111 / 00000 / β¦ 00000 / 00110 / 00000 / 11111 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 01001 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 10011 / 00000 / 00111 / 00111 / 11111 / β¦ 00000 / 11110 / 00000 / 01101 / 00111 / 11111 / β¦ 00000 / 11011 / 00000 / 10101 / 00111 / 11111 / β¦ 00000 / 00010 / 00000 / 01101 / 00111 / 11111 / β¦
Fraction of fingerprint