Inferring Time-Delayed Gene Regulatory Networks Presented by: Mina - - PowerPoint PPT Presentation

inferring time delayed gene regulatory networks
SMART_READER_LITE
LIVE PREVIEW

Inferring Time-Delayed Gene Regulatory Networks Presented by: Mina - - PowerPoint PPT Presentation

Inferring Time-Delayed Gene Regulatory Networks Presented by: Mina Moradi Advisor: Dr. Abdollah Homaifar North Carolina A&T State University Dept. of Electrical & Computer Engineering mmoradik@aggies.ncat.edu http://acitcenter.ncat.edu


slide-1
SLIDE 1

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Inferring Time-Delayed Gene Regulatory Networks

Presented by: Mina Moradi Advisor: Dr. Abdollah Homaifar

North Carolina A&T State University

  • Dept. of Electrical & Computer Engineering

mmoradik@aggies.ncat.edu http://acitcenter.ncat.edu February, 17, 2017

1 / 22

slide-2
SLIDE 2

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Outline

1

Introduction and Motivation

2

Literature Review

3

Objective of the Work

4

Proposed Method

5

Simulations and Results

6

Conclusion

2 / 22

slide-3
SLIDE 3

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction and Motivation

Your genes are part of what makes you the person you are. You are different from everyone alive now and everyone who has ever lived. Genes are not independent. Regulate each other and act collectively. Gene regulatory network (GRN) is an abstract mapping of gene regulations in living cells. GRNs identify the specific functional roles of individual genes in cellular systems and can open up a window on the disease progression and drug development.

Therapy: This time it’s personal, Lauren Gravitz, Nature (2014) 3 / 22

slide-4
SLIDE 4

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction and Motivation

DNA microarrays and RNA sequencing technologies measure the expression levels of thousands of genes inside cell in respond to specific environmental conditions [1]. GRN is usually represented by a directed graph, with nodes representing the genes and links representing the regulatory relationships.

4 / 22

slide-5
SLIDE 5

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction and Motivation

Reverse engineering of GRNs is a challenging problem due to: The stochastic characteristics of biological phenomena, the inherent noise of measured gene expression data, and high dimensionality [2]. There is strong non-linearity on temporal patterns of regulatory genes [3]. Genetic interactions among different genes can have different time delays due to the time required for regulatory genes to express their protein products and etc. [4, 5]

5 / 22

slide-6
SLIDE 6

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Literature Review

Boolean networks: Based upon binary outcomes (on and off) for gene expression and therefore lack adequate dynamic resolution [6]. Bayesian networks: Represent probabilistic relationships among genes, the inherent noise and stochasticity of gene expression [7]. Ordinary differential equations: Deterministic models, where interactions among genes represent causal interactions rather than statistical dependencies [8]. TD-ARACNE [9]: The time-delayed dependencies between the genes in terms of mutual information by assuming a stationary Markov Random Field as its underlying probabilistic model. HCC-CLINDE [10]: Infer a time-delayed GRN in the presence of hidden common causes based on either a correlation test or mutual information test.

6 / 22

slide-7
SLIDE 7

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Objective of the Work

Inferring a time-delayed GRN which takes into account the non-linearity of gene interaction and the noise of measurements. RNNs are computational tools for temporal data processing, approximating nonlinear patterns and tolerating noise in measurements. RNNs are usually considered “black box” models. The internal structure and learned parameters are not interpretable [11].

7 / 22

slide-8
SLIDE 8

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Hierarchical Recurrent Neural Network

Proposed a hierarchical RNN (HRNN) that surmounts the interpretation difficulties of the RNNs for modeling of GRN. Time-delayed regulations can be captured through hierarchical paths between leaf nodes (regulatory genes) and a target node (regulated gene) in the HRNN. x1, . . . , xC are context nodes. xC+1, . . . , xC+P are genes.

8 / 22

slide-9
SLIDE 9

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Hierarchical Recurrent Neural Network

A population of candidate HRNNs are randomly generated. The network with c context nodes has c + 1 neurons. In a network with c ≤ C context nodes, the first c context nodes and genes (excluded the target gene) are potential inputs of the neurons. The target gene is the output of the first neuron. The context node ci is the input of neuroni and output of neuroni+1. If input of the neuron is a context node, the weight is positive. xi(t + 1) = f ( ∑

j

wk,j.xj(t)) (1)

9 / 22

slide-10
SLIDE 10

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Hierarchical Recurrent Neural Network

The corresponding hierarchical model which shows the direct regulation of x8 by gene x4, and time-delayed regulations of x7 by genes x4, x5, x7.

10 / 22

slide-11
SLIDE 11

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Representation of the Candidate HRNNs in the GA

Candidate networks in the GA are represented by their number of neurons (Nn), number of inputs to each neuron (Nin), indices of the input nodes (In), weights of the input connections (W ) and the decay rate of the target gene’s expression level (µ) if it exists.

11 / 22

slide-12
SLIDE 12

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Fitness of candidate networks

The performance of the candidate networks (fitness) is evaluated by measuring the trade-off between the goodness of fit and complexity of the model by using the Akaike information criterion (AIC) and the Akaike information criterion with correction (AICc). AIC = n.ln(1 n ∑

l

( ∑

t

(xl

i (t) − ˆ

xl

i (t))2))) + 2k

(2) AICc = AIC + 2k(k + 1)/(n − k − 1) (3) k is the number of leaf nodes in the HRNN and n is the total number of temporal samples for gene expression. If n is small or k is large, the AICc is preferred rather than AIC. As n gets larger, AICc converges to AIC.

12 / 22

slide-13
SLIDE 13

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

The Crossover Operator

Figure: Parent 1 Figure: Parent 2 Figure: Child 1 Figure: Child 2

13 / 22

slide-14
SLIDE 14

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

The Mutation operator

For a mutation site msite in the network, the mutation works as below: If msite is on the number of inputs of a neuron (Nin), it is mutated to Nin = Nin ± 1. Therefore, a new input and its corresponding weight are added or deleted. If msite is on an input connection of a neuron (In), the selected connection is rewired to another node in the network. If msite is on a connection weight of a neuron and input is a context node, the Gaussian mutation evolves the weight in the range of [0, wmax]; else, the weight is mutated in the range of [wmin, wmax]

14 / 22

slide-15
SLIDE 15

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Simulations and Results

The HRNN is evaluated on the GRN of Saccharomyces cerevisiae and nonlinear synthetic generated data for different sizes of networks and variances of noise. The results are compared with TD-ARACNE and HCC-CLINDE in terms of: Links: if and only if both the gene pair and the direction are correct Delays: if and only if both the link and the time delay are correct Effects: if and only if both the link and the sign of an effect are correct For each term, Recall =

TP TP+FN , Precision = TP TP+FP and F-score

= 2×Precision×Recall

Precision+Recall

metrics are computed.

15 / 22

slide-16
SLIDE 16

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

The Effect of Network Size

Figure: 5 genes Figure: 10 genes Figure: 20 genes Figure: 30 genes

16 / 22

slide-17
SLIDE 17

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

The Effect of Noise Level

Figure: σ2 = 0.5 Figure: σ2 = 1.0 Figure: σ2 = 1.5 Figure: σ2 = 2.0

17 / 22

slide-18
SLIDE 18

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Saccharomyces cerevisiae

IRMA is a recent significant contribution to systems biology reported in [12] where the authors built a synthetic network of the yeast organism Saccharomyces cerevisiae.

Figure: True regulations Figure: Proposed method Figure: TD-ARACNE Figure: HCC-CLINDE

18 / 22

slide-19
SLIDE 19

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Saccharomyces cerevisiae

Table: Comparison of Results for GRN Reconstructions of IRMA.

Methods TP FP FN Precision Recall F-score Proposed 6 3 2 0.667 0.75 0.706 TD-ARACNE 2 1 6 0.667 0.25 0.366 HCC-CLINDE 1 3 7 0.25 0.125 0.166

19 / 22

slide-20
SLIDE 20

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Conclusion

In this study, we proposed a hierarchical recurrent neural network approach to identify time-delayed regulatory interactions of genes. The designed HRNN facilitates capturing the paths with different lengths from the leaf nodes in the network to the target node. Hierarchy in the network and possibility of recurrent connections in HRNN provide a capability for modeling the temporal patterns of gene expression. The proposed method outperformed TD-ARACNE and HCC-CLINDE in terms of non-linearity and high level of noise in measurements.

20 / 22

slide-21
SLIDE 21

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

References

[1]

  • C. Sima, J. Hua, and S. Jung, “Inference of gene regulatory networks using time-series data: a survey,” Current Genomics,
  • vol. 10, pp. 416–29, Sept. 2009.

[2]

  • Z. Bar-Joseph, A. Gitter, and I. Simon, “Studying and modelling dynamic biological processes using time-series gene

expression data,” Nature Reviews Genetics, vol. 13, no. 8, pp. 552–564, 2012. [3]

  • J. Hasty, D. McMillen, F. Isaacs, and J. J. Collins, “Computational studies of gene regulatory networks: in numero

molecular biology,” Nat Rev Genet, vol. 2, pp. 268–279, apr 2001. [4]

  • N. Morshed, M. Chetty, and N. Xuan Vinh, “Simultaneous learning of instantaneous and time-delayed genetic interactions

using novel information theoretic scoring technique,” BMC Systems Biology, vol. 6, no. 1, p. 62, 2012. [5]

  • D. Bratsun, D. Volfson, L. S. Tsimring, and J. Hasty, “Delay-induced stochastic oscillations in gene regulation,”

Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 41, pp. 14593–14598, 2005. [6]

  • S. Bornholdt, “Boolean network models of cellular regulation: prospects and limitations,” Journal of the Royal Society

Interface, vol. 5 Suppl 1, pp. S85–S94, 2008. [7]

  • N. Friedman, M. Linial, I. Nachman, and D. Pe’er, “Using Bayesian networks to analyze expression data,” Journal of

Computational Biology, vol. 7, no. 3-4, pp. 601–620, 2000. [8]

  • E. Sakamoto and H. Iba, “Inferring a system of differential equations for a gene regulatory network by using genetic

programming,” in Proceedings of the 2001 Congress on Evolutionary Computation, vol. 1, pp. 720–726 vol. 1, 2001. [9]

  • P. Zoppoli, S. Morganella, and M. Ceccarelli, “TimeDelay-ARACNE: reverse engineering of gene networks from

time-course data by an information theoretic approach,” BMC Bioinformatics, vol. 11, no. 1, p. 154, 2010. [10] L.-Y. Lo, M.-L. Wong, K.-H. Lee, and K.-S. Leung, “Time delayed causal gene regulatory network inference with hidden common causes,” PLoS ONE, vol. 10, pp. 1–47, 09 2015. [11]

  • D. Castelvecchi, “Can we open the black box of AI?,” Nature, vol. 538, no. 7623, p. 20, 2016.

[12]

  • I. Cantone, L. Marucci, F. Iorio, M. A. Ricci, V. Belcastro, M. Bansal, S. Santini, M. di Bernardo, D. di Bernardo, and
  • M. P. Cosma, “A yeast synthetic network for in vivo assessment of reverse-engineering and modeling approaches,” Cell,
  • vol. 137, no. 1, pp. 172 – 181, 2009.

21 / 22

slide-22
SLIDE 22

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Acknowledgment

This work is partially supported by the National Science Foundation (NSF) under Cooperative Agreement No. CCF-1029731. Thank you for your attention

22 / 22