Modelling dynamic networks Regularization of non-homogeneous dynamic - - PowerPoint PPT Presentation

modelling dynamic networks
SMART_READER_LITE
LIVE PREVIEW

Modelling dynamic networks Regularization of non-homogeneous dynamic - - PowerPoint PPT Presentation

Modelling dynamic networks Regularization of non-homogeneous dynamic Bayesian network models by coupling interaction parameters Marco Grzegorczyk Johann Bernoulli Institute (JBI) Rijksuniversiteit Groningen Presentation at the Van Dantzig


slide-1
SLIDE 1

Modelling dynamic networks

Regularization of non-homogeneous dynamic Bayesian network models by coupling interaction parameters

Marco Grzegorczyk Johann Bernoulli Institute (JBI) Rijksuniversiteit Groningen Presentation at the Van Dantzig Seminar VU University Amsterdam 9-Oct-2014

slide-2
SLIDE 2

Cell Biology

Very brief introduction: Each gene is the code for the synthesis of a specific protein. Transcription: gene → mRNA. Translation: mRNA → protein. Proteins are the „functional units“ of the cell. Proteins are enzymes, transription factors, etc.

slide-3
SLIDE 3

Regulatory Network

TF TF

metabolite A metabolite B

G2 G3 G1 P1 P3 P2 G1

mRNA(G1) mRNA(G3) mRNA(G2)

protein 1 is a transcription factor for gene 2 protein 3 is a transcription factor for gene 2 protein 2 is an enzym and catalyses a metabolic reaction

protein level metabolite level gene level

slide-4
SLIDE 4

Microarray Chips

Expressions (activities)

  • f thousands of genes

in an experimental cell can be measured with Microarray Chips.

slide-5
SLIDE 5

TF TF

metabolite A metabolite B

G2 G3 G1 P1 P3 P2 G1

(Gen-)Regulatory Network

gene level protein level metabolite level

slide-6
SLIDE 6

G2 G3 G1 G1

Gen-Regulatory Network

Goal: Learn from gene expression data that gene 1 and gene 3 co-regulate gene 2

Remark: In gene regulatory networks the protein level is ignored. That is, proteins may build complexes with each other or may have to be activated (e.g. phosphorylated) before they can bind to binding sites of genes.

slide-7
SLIDE 7

Cell membran nucleus

Protein activation

phosphorylation

→cell response

P1 P1 phosporylated P3 P3 phosporylated G2

slide-8
SLIDE 8

Cell membran nucleus

Protein activation

phosphorylation

→cell response

P1 P1 phosporylated P3 P3 phosporylated G2

slide-9
SLIDE 9

G2 G3 G1 G1

Gen-Regulatory Network

Goal: Learn from gene expression data that gene 1 and gene 3 co-regulate gene 2

Remark: In gene regulatory networks the protein level is ignored. That is, proteins may build complexes with each other or may have to be activated (e.g. phosphorylated) before they can bind to binding sites of genes.

slide-10
SLIDE 10

G2 G3 G1 G1

Medical relevance e.g. for tumour development

  • - simplified example --

gene1 may be a tumour suppressor gene gene 3 may be an

  • ncogene

gene 2 may cause cell growth and cell divison

Healthy condition

  • strong

inhibition

+

weak activation

cell division is under control

slide-11
SLIDE 11

G2 G3 G1 G1 Tumour cell

  • no more

inhibition

+

strong activation

Altered pathway leads to uncontrolled cell division

gene1 may be a tumour suppressor gene gene 3 may be an

  • ncogene

gene 2 may cause cell growth and cell divison

Medical relevance e.g. for tumour development

  • - simplified example --
slide-12
SLIDE 12
slide-13
SLIDE 13

possibly completely unknown

slide-14
SLIDE 14

possibly completely unknown E.g.: Gene- Microarry experiments data

(expressions of genes)

slide-15
SLIDE 15

possibly completely unknown data data Machine Learning statistical methods E.g.: Gene- Microarry experiments

slide-16
SLIDE 16

Statistical Task

             

nm n n m m

x x x x x x x x x    

2 1 2 22 21 1 12 11

← m cells or time points →

n variables X(1),...X(n)

genes

Extract a network from an n-by-m data matrix

Either m independent (steady-state) observations

  • f the system X(1),…,X(n)

Or time series of the system of length m: (X(1),…,X(n))t=1,…,m

slide-17
SLIDE 17

X(1) X(1) X(2) X(2) X(2) X(3) X(1)

Dynamic Bayesian networks

X(3) X(3)

t t+1

No need for the acyclicity constraint!

Illustration: Simple dynamic Bayesian network (DBN) with three nodes. All interactions are subject to a time delay. recurrent network unfolded dynamic network

slide-18
SLIDE 18

Static/dynamic Bayesian networks

Static Bayesian networks Important feature: Network has to be acyclic Implied factorisation: P(A,B) = P(B|B)·P(A|A,B) Dynamic Bayesian networks Network does not have to be acyclic Implied factorisation: P(A(t),B(t)|A(t-1),B(t-1)) = P(B(t)|B(t-1))·P(A(t)|A(t-1),B(t-1)) (t=2,…,m) cycles cannot make sense

slide-19
SLIDE 19

Example: 4 genes, 10 time points

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1

X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1

X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1

X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1

X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Model assumption: Homogeneous Markov chain

slide-20
SLIDE 20

Impose changepoints to model non-homogeneous processes FIRST SEGMENT SECOND SEGMENT

X(1) X1,1

X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1

X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1

X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1

X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

changepoint

slide-21
SLIDE 21

Changepoint model

Our paradigm: Keep the network topology fixed but the interaction parameters can change with time.

Interaction parameters in the first segment

slide-22
SLIDE 22

Changepoint model

interaction parameters in the second segment

Our paradigm: Keep the network topology fixed but the interaction parameters can change with time.

slide-23
SLIDE 23

Introduce gene-specific changepoints to increase flexibility of the models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1

X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1

X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1

X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1

X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

slide-24
SLIDE 24

Non-Homogeneous Dynamic Bayesian Networks (NH-DBN)

Idea: Combine a standard DBN with a node- specific multiple changepoint process. Lèbre, Becq, Devaux, Lelandais, Stumpf (2010) Statistical inference of the time-varying structure of gene regulation networks BMC Systems Biology Robinson & Hartemink (2010) Learning non-stationary dynamic Bayesian networks Journal of Machine Learning Research

slide-25
SLIDE 25

What is the problem with these approaches?

slide-26
SLIDE 26

Practical problem: inference uncertainty in short time series segments

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1

X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1

X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1

X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1

X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

slide-27
SLIDE 27

Shortcomings

  • 1. Practical problem

Short time series inference uncertainty

  • 2. Methodological problem

Prior independence is biologically implausible Idea: Information coupling among segments

Is it plausible to assume a priori that the segment-specific interaction parameters are independent?

slide-28
SLIDE 28

Non-homogeneous DBN

(uncoupled NH-DBN) Information coupling with respect to the interaction parameters (coupled NH-DBN)

Grzegorczyk and Husmeier (2012a) A non-homogeneous dynamic Bayesian network model with sequentially coupled interaction parameters for applications in systems and synthetic biology. SAGMB Grzegorczyk and Husmeier (2012b) Bayesian regularization of non-homogeneous dynamic Bayesian networks by globally coupling interaction parameters. AISTATS Grzegorczyk and Husmeier (2013) Regularization of Non-Homogeneous Dynamic Bayesian Networks with Global Information-Coupling based on Hierarchical Bayesian models. Machine Learning

slide-29
SLIDE 29

Bayesian regression models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10 X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10 X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10 X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

complete network complete segmentation matrix

X(1) X(3) X(2) X(4)

slide-30
SLIDE 30

Bayesian regression models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10 X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10 X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10 X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

first gene g=1 segmentation of node g=1

X(1) X(3) X(2) X(4)

slide-31
SLIDE 31

Bayesian regression models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1

h=1 h=2

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10 X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10 X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

first gene g=1

X(1) X(3) X(2) X(4)

6

1 , 1   g

 ) ,..., (

6 , 1 2 , 1 1 , 1

X X y

h g

 

) ,..., (

10 , 1 7 , 1 2 , 1

X X y

h g

 

changepoint

This changepoint divides the observations of node X(1) into Kg=1=2 disjunct segments.

slide-32
SLIDE 32

Bayesian regression models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10 X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10 X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10 X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

first gene g=1

X(1) X(3) X(2) X(4)

…and its parent genes π1={2,3}

T h g

X X y ) ,..., (

6 , 1 2 , 1 1 , 1

  T h g

X X y ) ,..., (

10 , 1 7 , 1 2 , 1

 

For both segments h=1 and h=2 determine the observations which belong to the parent nodes of X(1). Note that all interactions are subject to a time lag of size 1.

slide-33
SLIDE 33

Bayesian regression models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10 X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10 X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10 X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

first gene g=1

X(1) X(3) X(2) X(4)

…and its parent genes π1={2,3}

T h g

X X y ) ,..., (

6 , 1 2 , 1 1 , 1

  T h g

X X y ) ,..., (

10 , 1 7 , 1 2 , 1

 

For both segments h=1 and h=2 determine the observations which belong to the parent nodes of X(1). Note that all interactions are subject to a time lag of size 1.

slide-34
SLIDE 34

Bayesian regression models

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10 X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10 X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10 X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10 T h g

X X y ) ,..., (

6 , 1 2 , 1 1 , 1

  T h g

X X y ) ,..., (

10 , 1 7 , 1 2 , 1

 

          

  5 , 3 2 , 3 1 , 3 5 , 2 2 , 2 1 , 2 1 }, 3 , 2 {

1 1 1

1

X X X X X X X

h

  

first gene g=1…

X(1) X(3) X(2) X(4)

…and its parent genes

          

  9 , 3 7 , 3 6 , 3 9 , 2 7 , 2 6 , 2 1 }, 3 , 2 {

1 1 1

1

X X X X X X X

h

  

slide-35
SLIDE 35

For each gene g=1,….,G and each gene-specific segment h=1,…,Kg: Likelihood model: Prior on the regression coefficients wg,h:

target

  • bservations

regressor matrix regression coefficients noise variance noise variance SNR hyperparameter Note that the explicit dependence on the noise variance leads to a fully conjugate prior.

slide-36
SLIDE 36

Graphical representation of the regression models

genes gene-specific segments

slide-37
SLIDE 37

parent sets implied by the network changepoint set (segmentation) fixed hyperparameters segmented data (observed) fixed hyperparameters In the absence of any genuine prior knowledge : mg=0 and Cg,h=I

Graphical representation of the regression models

slide-38
SLIDE 38

density of fixed hyperparameters fixed hyperparameters In the absence of any genuine prior knowledge : mg=0 and Cg,h=I segmented data (observed) parent sets implied by the network changepoint set (segmentation) SNR parameter

slide-39
SLIDE 39

Are these hyperparameters actually known?

Graphical representation of the regression models

slide-40
SLIDE 40

parent sets (networks) must be infered

Graphical representation of the regression models

slide-41
SLIDE 41

Graphical model representation

changepoint sets must be infered

slide-42
SLIDE 42

Graphical model representation

With a fixed hyerparameter mg there is no information coupling between the segment-specific regression coefficients.

slide-43
SLIDE 43

With a fixed hyerparameter mg there is no information coupling between the segment-specific regression coefficients. density of mg fixed

slide-44
SLIDE 44

Graphical model representation

Main idea from: Grzegorczyk and Husmeier (2012b) Bayesian regularization of non- homogeneous dynamic Bayesian networks by globally coupling interaction parameters. AISTATS

slide-45
SLIDE 45

Graphical model representation

Main idea from: Grzegorczyk and Husmeier (2012b) Bayesian regularization of non- homogeneous dynamic Bayesian networks by globally coupling interaction parameters. AISTATS

mg variable so that the segment-specific regression coefficients are coupled → information exchange among segments density of

slide-46
SLIDE 46

RJMCMC inference Part 1 of 3

  • 2. Regression coefficients:
  • 1. Noise variances:
  • 3. Coupling hyperparameters:

can be sampled with standard collapsed and uncollapsed Gibbs sampling steps That is, sample each variable from the conditional distribution, conditional on its Markov blanket. Conjugate prior distributions: sampling from standard distributions Collapsing: integrate some variables in the Markov blanket out analytically

slide-47
SLIDE 47
  • 4. Network inference by a Metropolis Hastings sampling scheme, which

changes the network by adding and removing individual edges:

RJMCMC inference Part 2 of 3

  • 5. Changepoint inference by a Metropolis Hastings sampling scheme, which

changes the segmentation by adding and removing gene-specific changepoints: marginal likelihoods can be computed in closed form: marginal likelihoods can be computed in closed form: network prior changepoint prior

slide-48
SLIDE 48

RJMCMC inference Part 3 of 3

  • 6. The global mean vector mg can be sampled with a collapsed Gibbs sampling steps:

with the sufficient statistics: Overall sampling scheme:

“Metropolis-Hastings-RJMCMC scheme within a partially collapsed Gibbs sampler”

slide-49
SLIDE 49

Empirical comparison: (1) globally coupled NH-DBN

slide-50
SLIDE 50

Empirical comparison: (2) uncoupled NH-DBN

We set: mg=0.

slide-51
SLIDE 51

Empirical comparison: (3) Homogeneous DBN

Standard homogeneous dynamic Bayesian network (DBN). There are no changepoints; i.e. There is only one segment for each gene(Kg=1).

slide-52
SLIDE 52

Empirical comparison: (4) Sequentially coupled NH-DBN

For h≥2: The prior expectation of the regression coefficients for segment h+1, mg,h, depends on the posterior distribution of the regression coefficients wg,h for segment h. The coupling strength depends on the hyperparameter λg.

Main idea from: Grzegorczyk and Husmeier (2012a) A non-homogeneous dynamic Bayesian network model with sequentially coupled interaction parameters for applications in systems and synthetic biology. SAGMB

slide-53
SLIDE 53

Information coupling

Sequential coupling

  • Information is shared

between neighbouring segments

  • For example:

morphogenesis Global coupling

  • Segments are treated as

interchangeable and information is shared globally

  • For example:

different experimental scenarios or environmental conditions

h=1 h=2 h=3 h=2 h=1 h=3

slide-54
SLIDE 54

Empirical evaluation

  • 1. Simulated data

Known gold standard Simulation process does not reflect real biology

  • 2. Data from synthetic biology

Known gold standard Real wet lab data Regulatory network small

  • 3. Data from a real application

Real wet lab data No gold standard

slide-55
SLIDE 55

Empirical evaluation

  • 1. Simulated data

Known gold standard Simulation process does not reflect real biology

  • 2. Data from synthetic biology

Known gold standard Real wet lab data Regulatory network small

  • 3. Data from a real application

Real wet lab data No gold standard

slide-56
SLIDE 56

true network extracted network biological knowledge (gold standard network)

Evaluation of learning performance

Reconstruction Accuracy

slide-57
SLIDE 57

Example: 2 genes  16 different (dynamic) network structures

Best network: maximum score

slide-58
SLIDE 58

Ideal scenario: Large data sets, low noise Identify the best network structure

P(graph|data)

M*

slide-59
SLIDE 59

Realistic: Limited number of experimental replications, high noise Uncertainty about the best network

P(graph|data)

slide-60
SLIDE 60

Sample of high-scoring networks

P(graph|data)

slide-61
SLIDE 61

Idea: Model Averaging Compute marginal posterior probabilities of the edges MCMC sample

  • f high-scoring

networks

slide-62
SLIDE 62

data

Probabilistic inference

true regulatory network

Thresholding

marginal edge posterior probabilities TP:1/2 FP:0/4 TP:2/2 FP:1/4

concrete network predictions

low high MCMC

slide-63
SLIDE 63

From Perry Sprawls

slide-64
SLIDE 64

From Perry Sprawls

slide-65
SLIDE 65

From Perry Sprawls

AREA UNDER THE ROC CURVE

slide-66
SLIDE 66
  • 1. Simulated data
slide-67
SLIDE 67
slide-68
SLIDE 68
slide-69
SLIDE 69

Generate data sets with 4 segments h=1,…,4 and 10 observations per segment. Use three noise levels (SNR=10, 3, and 1) Use the parameter ε to vary the similarity of the segment-specific interaction parameters. ε=0 -> homogeneous data … ε=1 -> non-homogeneous data

slide-70
SLIDE 70

More homogeneous Less homogeneous

AUC for SNR=3

slide-71
SLIDE 71

AUC for SNR=3

slide-72
SLIDE 72

AUC difference: coupled NH-DBN – homogeneous DBN Less homogeneous

slide-73
SLIDE 73

AUC difference: coupled NH-DBN – uncoupled NH-DBN More homogeneous

slide-74
SLIDE 74
slide-75
SLIDE 75
slide-76
SLIDE 76
  • 2. Data from

synthetic biology

Synthetic network in yeast, as designed in Cantone et

  • al. (2009)

Carbon-source switch from galactose to glucose in vivo gene expression levels measured with RT- PCRat 37 time points (in two mediums)

GALACTOSE GLUCOSE

slide-77
SLIDE 77

AUC score comparison sequentially coupled NH-DBN versus uncoupled NH-DBN for different changepoint prior hyperparameters (different numbers of changepoints per gene)

slide-78
SLIDE 78

AUC score comparison globally coupled NH-DBN versus uncoupled NH-DBN for different changepoint prior hyperparameters (different numbers of changepoints per gene)

slide-79
SLIDE 79

AUC score comparison of all three NH-DBNs

slide-80
SLIDE 80

Circadian regulation in Arabidopsis

  • 3. Data from a real application
slide-81
SLIDE 81

Collaboration with the Institute of Molecular Plant Sciences at

Edinburgh University 4 time series of microarray gene expression data from Arabidopsis thaliana.

  • Focus on: 9 circadian genes:

LHY, CCA1, TOC1, ELF4, ELF3, GI, PRR9, PRR5, and PRR3

  • The four time series were measured under constant light condition

at 13 time points: 0h, 2h,…, 24h, 26h

  • Seedlings entrained with light:dark cycles of different periods

Circadian rhythms in Arabidopsis thaliana

slide-82
SLIDE 82
slide-83
SLIDE 83
slide-84
SLIDE 84

Thank you for your attention!

Any questions?