MGT-SM: A Method for Constructing Cellular Signal Transduction - - PowerPoint PPT Presentation

mgt sm a method for constructing cellular signal
SMART_READER_LITE
LIVE PREVIEW

MGT-SM: A Method for Constructing Cellular Signal Transduction - - PowerPoint PPT Presentation

GIW 2016: International Conference on Genome Informatics MGT-SM: A Method for Constructing Cellular Signal Transduction Networks Min Li, Ruiqing Zheng, Yaohang Li, Fang-Xiang Wu, and Jianxin Wang School of Information Science and Engineering,


slide-1
SLIDE 1

MGT-SM: A Method for Constructing Cellular Signal Transduction Networks

GIW 2016: International Conference on Genome Informatics

Min Li, Ruiqing Zheng, Yaohang Li, Fang-Xiang Wu, and Jianxin Wang

School of Information Science and Engineering, Central South University Homepage: http://bioinformatics.csu.edu.cn/

slide-2
SLIDE 2

Outline

■ Background ■ Traditional Granger ■ Method ■ Result & Conclusion

slide-3
SLIDE 3

Background

■ signal transduction network refers to a directed network which composes of molecules and genes involved in signal transduction pathways

slide-4
SLIDE 4

Gene expression data is meaningful data for constructing the network ■ Bayesian Network ■ Mutual Information ■ Ensemble method ■ Regression

Regression-Granger causality is an effective method

Background

v1 v2 vn-1 vn v1 v2 vn-1 vn

Granger Test

v1 v2 v3 vn vn-2 vn-1

Gene Expression Directed Network

v3 v3 vn-2 vn-2

slide-5
SLIDE 5

Traditional Granger

■ Pairwise ■ Multivariate General form ■Step 1. linear regression for coefficient matrix pairwise 𝑧#= ∑

𝛽'𝑧#(' + ∑ 𝛾+𝑦#('

  • +./
  • './

+ 𝑓# , 𝑢 = 𝑟 + 1, … 𝑈

  • with null hypothesis

Multivariate 𝑦',# = ∑

∑ 𝑠

+,#(8𝑦+,#(8

  • 8./

9:+:;

+ 𝑓',# 𝑢 = 𝑟 + 1, … 𝑈, 𝑗 = 1, …, 𝑜

■Step 2. F-test for p-value 𝐺 =

?@@A,B(?@@A /- ?@@A/(E(;-)

F~ freedom (q, m-nq)

slide-6
SLIDE 6

Problem

l Indirect caused by pairwise granger l For real gene expression data,there is sometimes m ≪ nq (n is gene number, m=T-q), traditional granger is not applicable

Traditional Granger

v1 v2 v3 v1 v2 v3

(a)Bivariate Granger (b)Multivariate Granger direct edge indirect edge

slide-7
SLIDE 7

We propose an extended Granger test combining SVD and Monte Carlo Simulation

Step 1 calculate coefficient matrix

n Build 𝑍 = 𝑆𝑌 + 𝐹

𝑌 = 𝑦/,/ ⋯ 𝑦/,#(/ ⋮ ⋱ ⋮ 𝑦;,/ ⋯ 𝑦;,#(/ 𝑍 = 𝑦/,R ⋯ 𝑦/,# ⋮ ⋱ ⋮ 𝑦S,R ⋯ 𝑦;,# n Apply SVD for coefficient matrix R 𝑌 = U ∗ S ∗ 𝑊X 𝑆 Y = 𝑍 ∗ 𝑊 ∗ 𝑇(/∗ 𝑉X

Method

slide-8
SLIDE 8

Step 2 p-value for pairwise <i, j>

Monte Carlo Simulation instead of F-test

Monte Carlo Simulation

In Input:time-series expression data matrix and gene index i Ou Output: the p-value ST STEP 1. Calculate 𝑆𝑇𝑇' based on estimation of R ST STEP 2. Upset the order of the expression of gene j from regression and calculate 𝑆𝑇𝑇',+ ST STEP 3 Repeat step 2 for k times, get the distribution of RSS',+ ST STEP 4 Rank RSS' in RSS',+ in ascending order and calculate the p-value as follows

𝑞 = 𝑠𝑏𝑜𝑙 𝑝𝑔 RSS' /(𝑙 + 1)

Method

MG MGT

  • SM

SM ALGORI RITHM In Input:time-series gene expression data matrix Output:the directed edge with a significant level ST STEP 1. Normalize time-series gene expression data ST STEP 2. Use lmc test to analyze expression’s stationarity, if the expression is nonstationary, use the first order difference. ST STEP 3. Employ SVD to calculate the coefficient matrix ST STEP 4. For gene i, use Monte Carlo to calculate the p-value of edge(i,j) ST STEP 5. Repeat step 4 for all genes, and save the significant edge in a file.

slide-9
SLIDE 9

Results & Conclusion

Genes Samples Time points Real edges

Simulation

5 10 10 7

Yeast Synthetic Network

5 4 17 7 MDA-MB-468 20 4 8 48

The Datasets of the experiment

slide-10
SLIDE 10

n Recall and AUROC for Evaluation n Comparion of Method: CGC2SPR, PGC, DBN(Dynamic Bayesian Network)

Recall = 𝑈𝑄 𝑈𝑄 + 𝐺𝑂

slide-11
SLIDE 11
  • 8. Result

■ SIMULATION DATA 10 samples: all the TOP 7 edges of MGT-SM is real edges

slide-12
SLIDE 12
  • 8. Result

■ Recall and ROC in Yeast Synthetic Network

0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0

(b) Switch Off 2 (a) Switch Off 1

True Positive Rate False Positive Rate

MGT-SM(0.729) DBN(0.563) CGC2SPR(0.635) PGC(0.625)

(d) Switch Off 4 (c) Switch Off 3

True Positive Rate False Positive Rate

MGT-SM(0.677) DBN(0.594) CGC2SPR(0.354) PGC(0.542)

True Positive Rate False Positive Rate

MGT-SM(0.740) DBN(0.625) CGC2SPR(0.396) PGC(0.25)

True Positive Rate False Positive Rate

MGT-SM(0.563) DBN(0.625) CGC2SPR(0.396) PGC(0.25)

slide-13
SLIDE 13
  • 8. Result

■ Recall and ROC in MDA-MB-468

0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0

True Positive Rate False Positive Rate

MGT-SM(0.612) DBN(0.493) CGC2SPR(0.561) PGC(0.554)

True Positive Rate False Positive Rate

MGT-SM(0.539) DBN(0.533) CGC2SPR(0.423) PGC(0.522)

True Positive Rate False Positive Rate

MGT-SM(0.623) DBN(0.493) CGC2SPR(0.437) PGC(0.502)

(d) EGF 20ng (c) EGF 10ng (b) EGF 5ng (a) EGF 0ng

True Positive Rate False Positive Rate

MGT-SM(0.590) DBN(0.494) CGC2SPR(0.450) PGC(0.512)

slide-14
SLIDE 14

Discussion

■ MGT-SM combining SVD and Monte Carlo Simulation has a widely application, no matter scale of the gene set ■ MGT-SM has a better performance than previous methods in signal transduction construction ■ The Granger methods with prior knowledge is a meaningful point in further study.

slide-15
SLIDE 15

Acknowledgment

■ This work was supported in part by the National Natural Science Foundation of China NO.61622213, NO. 61370024, NO. 61232001 and NO. 61428209. ■ Co-Authors:

slide-16
SLIDE 16

Thanks for your attention!