Rule-based Modeling Bill Hlavacek Theoretical Division Los Alamos - - PowerPoint PPT Presentation

rule based modeling
SMART_READER_LITE
LIVE PREVIEW

Rule-based Modeling Bill Hlavacek Theoretical Division Los Alamos - - PowerPoint PPT Presentation

Rule-based Modeling Bill Hlavacek Theoretical Division Los Alamos National Laboratory Who is this? http://bionetgen.org Cellular regulatory systems are complex Akhilesh Pandey (Johns Hopkins) Value added by modeling 1. We can use models to


slide-1
SLIDE 1

Rule-based Modeling

Bill Hlavacek

Theoretical Division Los Alamos National Laboratory

slide-2
SLIDE 2

Who is this?

http://bionetgen.org

slide-3
SLIDE 3

Cellular regulatory systems are complex

Akhilesh Pandey (Johns Hopkins)

slide-4
SLIDE 4

Value added by modeling

  • 1. We can use models to organize information about a

system with precision

  • 2. We can determine the logical consequences of a model

specification

slide-5
SLIDE 5

Outline

  • 1. Combinatorial complexity
  • 2. The conventional approach to modeling
  • 3. The rule-based approach to modeling
  • 4. Tools
  • 5. New simulation methods
slide-6
SLIDE 6

Signaling proteins contain domains and motifs that mediate interactions with other proteins

Syk Lyn FcεRI Transmembrane Adaptors

slide-7
SLIDE 7

Multiplicity of sites and binding partners gives rise to combinatorial complexity

Epidermal growth factor receptor (EGFR)

slide-8
SLIDE 8

Multiplicity of sites and binding partners gives rise to combinatorial complexity

Epidermal growth factor receptor (EGFR) 9 sites ⇒ 29=512 phosphorylation states

slide-9
SLIDE 9

Multiplicity of sites and binding partners gives rise to combinatorial complexity

Epidermal growth factor receptor (EGFR) 9 sites ⇒ 29=512 phosphorylation states Each site has ≥ 1 binding partner ⇒ more than 39=19,683 total states

slide-10
SLIDE 10

Multiplicity of sites and binding partners gives rise to combinatorial complexity

Epidermal growth factor receptor (EGFR) 9 sites ⇒ 29=512 phosphorylation states Each site has ≥ 1 binding partner ⇒ more than 39=19,683 total states EGFR must form dimers to become active ⇒ more than 1.9× 108 states

slide-11
SLIDE 11

Signaling proteins typically contain multiple phosphorylation sites

> 50% are phosphorylated at 2 or more sites

Source: Phospho.ELM database v. 3.0 (http://phospho.elm.eu.org)

slide-12
SLIDE 12

Oligomerization alone can generate many complexes

A hexamer of death domains Weber and Vincenz (2001) FEBS Lett.

Complexes potentially involved in Toll-like receptor signaling

Complexes of TIR domains C.-T. Tung (Los Alamos)

slide-13
SLIDE 13

The problem of combinatorial complexity necessitates a new modeling approach

  • Inside a Chemical Plant

– Large numbers of molecules… – …of a few types – Conventional modeling works fine

  • Inside a Cell

– Small numbers of molecules… – …of many possible types – Rule-based modeling addresses this situation

slide-14
SLIDE 14

The need for predictive models of large scale with site-specific details

  • Molecular changes that affect cell signaling cause disease

(cancer)

  • Over 200 drugs that target malfunctioning signaling proteins are

currently in clinical trials

– One spectacular success (Gleevec) – But results are largely disappointing for most patients

  • 96 clinical trials are underway to test combinations of drugs

(clinicaltrials.gov)

– There are too many combinations to consider all of them in trials

slide-15
SLIDE 15

Outline

  • 1. The biochemistry of cell signaling and combinatorial

complexity

  • 2. The conventional approach to modeling
  • 3. The rule-based approach to modeling
  • 4. Tools
  • 5. New simulation methods
slide-16
SLIDE 16

Models can be specified in different ways

slide-17
SLIDE 17

Rules representing molecular interactions allow for compact model specifications

Science’s STKE re6 (2006)

slide-18
SLIDE 18

Early events in EGFR signaling - we’ll consider these events to illustrate modeling approaches

EGF = epidermal growth factor EGFR = epidermal growth factor receptor

  • 1. EGF binds EGFR

EGFR EGF ecto

slide-19
SLIDE 19

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF dimerization

  • 2. EGFR dimerizes
slide-20
SLIDE 20

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P Y1092 Y1172

slide-21
SLIDE 21

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P

  • 4. Grb2 binds phospho-EGFR

Grb2 Y1092 SH2

Grb2 pathway

slide-22
SLIDE 22

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P

  • 4. Grb2 binds phospho-EGFR

Grb2 Y1092 SH3

  • 5. Sos binds Grb2 (Activation Path 1)

Sos

Grb2 pathway

slide-23
SLIDE 23

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P

  • 4. Shc binds phospho-EGFR

Y1172 Shc PTB

Shc pathway

slide-24
SLIDE 24

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P

  • 4. Shc binds phospho-EGFR

Y1172 Shc Y317

  • 5. EGFR transphosphorylates Shc

P

Shc pathway

slide-25
SLIDE 25

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P

  • 4. Shc binds phospho-EGFR

Y1172 Shc

  • 5. EGFR transphosphorylates Shc

P

  • 6. Grb2 binds phospho-Shc

Grb2 SH2

Shc pathway

slide-26
SLIDE 26

Early events in EGFR signaling

  • 1. EGF binds EGFR

EGFR EGF

  • 2. EGFR dimerizes
  • 3. EGFR transphosphorylates itself

P P P P

  • 4. Shc binds phospho-EGFR

Y1172 Shc

  • 5. EGFR transphosphorylates Shc

P

  • 6. Grb2 binds phospho-Shc

Grb2 SH3

  • 7. Sos binds Grb2 (Activation Path 2)

Sos

Shc pathway

slide-27
SLIDE 27

Representation of molecules in a simple model

  • f early events in EGFR signaling

EGF(r) EGFR(l,d,Y1092~U~P,Y1172~U~P) Shc(PTB,Y317~U~P) Grb2(SH2,SH3) Sos(PR)

Blinov et al. (2006)

slide-28
SLIDE 28

Combinatorial complexity of early events

Monomeric species

EGFR

2 states

  • r
slide-29
SLIDE 29

Combinatorial complexity of early events

Monomeric species

EGFR

2 states 4 states

P Sos P Grb2 P

  • r
  • r
  • r
slide-30
SLIDE 30

Combinatorial complexity of early events

Monomeric species

EGFR

2 states 4 states 6 states

P P Sos P P Grb2 P P P Shc P

  • r
  • r
  • r
  • r
  • r
slide-31
SLIDE 31

Combinatorial complexity of early events

Monomeric species

EGFR

2 states 4 states 6 states

48 species

slide-32
SLIDE 32

Combinatorial complexity of early events

Monomeric species

EGFR

2 states 4 states 6 states

48 species Dimeric species

EGF

24 states

N× (N+1)/2 = 300 species

slide-33
SLIDE 33

A reaction-scheme diagram

Species: One for every possible modification state of every complex Reactions: One for every transition among species

This scheme can be translated to obtain a set of ODEs, one for each species

slide-34
SLIDE 34

A conventional model for EGFR signaling

The Kholodenko model*

5 proteins 18 species 34 reactions *J. Biol. Chem. 274, 30169 (1999)

slide-35
SLIDE 35

Assumptions made to limit combinatorial complexity

1. Phosphorylation inhibits dimer breakup No modified monomers

P P P

Bottleneck for dimers

slide-36
SLIDE 36

Assumptions made to limit combinatorial complexity

2. Adaptor binding is competitive No dimers with more than one associated adapter

P P P P P

slide-37
SLIDE 37

Outline

  • 1. The biochemistry of cell signaling and combinatorial

complexity

  • 2. The conventional approach to modeling
  • 3. The rule-based approach to modeling
  • 4. Tools
  • 5. New simulation methods
slide-38
SLIDE 38

Rules operate on structured objects (graphs)

Graphs represent molecules, their component parts, and states A (graph-rewriting) rule specifies the addition or removal of an edge to represent binding or unbinding, or the change of a state label to represent, for example, post-translational modification of a protein at a particular site A model specification is readily visualized and compositional Molecules, components, and states can be directly linked to annotation in databases

Ty Thomson (MIT) - yeastpheromonemodel.org

slide-39
SLIDE 39

Proteins in a model are introduced with molecule templates

Shc Grb2 PTB EGF EGFR

Y1172 Y1092 CR1 L1

Molecule templates

Y317 SH2 SH3 Sos

Nodes represent components of proteins Components may have attributes:

P

  • r
slide-40
SLIDE 40

Complexes are connected instances of molecule templates

P P P P P

Edges represent bonds between components

Bonds may be internal or external

An EGFR dimer

slide-41
SLIDE 41

Patterns select sets of chemical species with common features

P EGFR

Y1092

selects

P P P

suppressed components don’t affect match

P P P P P P P

, , , , … Pattern that selects EGFR phosphorylated at Y1092. twice

inverse indicates any bonding state

slide-42
SLIDE 42

BioNetGen language provides explicit representation of molecules and interactions

A

b Y1

B

A(b,Y1) B(a) Molecules are structured objects (hierarchical graphs) Rules define interactions (graph rewriting rules)

A B

+

k+1 k-1

A B

A(b) + B(a) <-> A(b!1).B(a!1) kp1,km1 a bond between two components

a Faeder et al., Proc. ACM Symp. Appl. Computing (2005)

BNGL: BNGL:

slide-43
SLIDE 43

Rules generate events

Example of reaction generation:

A B

+

k+1

A B

Rule1 Rule1 applied to

A

b Y1

B

a

{ }

A

b Y1

B

a

k+1

+ generates Reaction1

1 2 1 2 3

slide-44
SLIDE 44

Reaction rules, composed of patterns, generalize reactions

EGF binds EGFR

+

EGFR L1 EGF CR1

k+1 k-1 Patterns select reactants and specify graph transformation

  • Addition of bond between EGF and EGFR
slide-45
SLIDE 45

Rule-based version of the Kholodenko model

  • 5 molecule types
  • 23 reaction rules
  • No new rate parameters (!)

18 species 34 reactions 356 species 3749 reactions Blinov et al. Biosystems 83, 136 (2006).

slide-46
SLIDE 46

Dimerization rule eliminates previous assumption restricting breakup of receptors

+ k+2 k-2

EGFR EGF dimerization

Dimers form and break up independent of phosphorylation of cytoplasmic domains

EGFR dimerizes (600 reactions)

slide-47
SLIDE 47

Outline

  • 1. The biochemistry of cell signaling and combinatorial

complexity

  • 2. The conventional approach to modeling
  • 3. The rule-based approach to modeling
  • 4. Tools
  • 5. New simulation methods
slide-48
SLIDE 48

BioNetGen2: Software for graphical rule-based modeling

Reaction network

Differential Equations (ODE’s) Stochastic Simulation (SSA) Timecourse of Observables

Rule Evaluation BNGL file RuleBuilder (optional) Rules are applied iteratively. SBML file

Graphical interface for composing rules Text-based language Simulation engine

slide-49
SLIDE 49

BNGL: A textual language for graphical rules

L(r) + R(l,d) <-> L(r!1).R(l!1,d) kp1, km1

slide-50
SLIDE 50

BNGL: A textual language for graphical rules

L(r) + R(l,d) <-> L(r!1).R(l!1,d) kp1, km1

reactant patterns product pattern rate law(s)

molecule components (unbound) a bond

slide-51
SLIDE 51

Graphical Interface to BioNetGen

Greatly simplifies construction, visualization, and simulation

  • f complex models
slide-52
SLIDE 52

We can take advantage of collective intelligence to build large-scale models

One model currently under construction incorporates approximately 20 proteins involved in EGFR signaling

(EGF, HRG, EGFR, ErbB2, ErbB3, ErbB4, Shc, Grb2, Sos1, Gab1, PI3K, Akt, Ras, Raf, MEK, ERK)

And approximately 1,000 annotated rules capturing the site- specific details of protein-protein interactions The model is being built by a small research team, mostly students, led by Richard G. Posner (TGen)

slide-53
SLIDE 53

Outline

  • 1. The biochemistry of cell signaling and combinatorial

complexity

  • 2. The conventional approach to modeling
  • 3. The rule-based approach to modeling
  • 4. Tools
  • 5. New simulation methods
slide-54
SLIDE 54

Rule-based models can be difficult to simulate

Rule-based models may encompass a large or even an unbounded number of species Computational costs for standard simulation methods increase with number of species and reactions in a model Parameter estimation and data fitting require running model simulations for a large number of parameter sets We need a simulation method that is independent of the size of the reaction network implied by rules

slide-55
SLIDE 55

The system: interaction of a trivalent ligand with a bivalent cell-surface receptor

NH O N H O O O O O O H N O O O H N O O O O O O O N H N H O2N NO2 H N NO2 O2N N H O2N NO2 O O O O O O OH OH

R.G. Posner (TGen)

slide-56
SLIDE 56

Rule-based model specification corresponding to equilibrium model of Goldstein and Perelson (1984)

Equivalent-site model No cyclic aggregates

slide-57
SLIDE 57

Generate-first method of simulation

  • 1. Define seed species
  • 2. Determine if a pattern in a rule matches any species

If so, apply the transformation defined in the rule

  • 3. Iteratively apply rules to new product species
  • 4. Simulate using conventional methods once network has

been generated

slide-58
SLIDE 58

Seed species

Ligand Receptor

slide-59
SLIDE 59

After first round of rule application

slide-60
SLIDE 60

After the second round of rule application

slide-61
SLIDE 61

Rule-derived network is too large to simulate using conventional population-based methods

fi

Two rules generate a vast number of chemical species and reactions

slide-62
SLIDE 62

Rule-based KMC Method

  • 1. Instantiate molecules with components and states.
  • 2. Determine cumulative rate for each reaction type, am
  • 3. Select next reaction time,
  • 4. Select next reaction type using
  • 5. Select reactant molecules and components.
  • 6. Update reaction type rates. Iterate.

a j

j=1 J -1

Â

< r

2atot £

a j

j=1 J

Â

Dt = -ln r

1

( ) / atot

(Particle-based version of Gillespie’s Direct Method with rules)

slide-63
SLIDE 63

Kinetics of aggregate formation

At high ligand concentration, small aggregates form transiently

Ligand concentration fgel

Equilibrium

slide-64
SLIDE 64

Performance