Sloppiness in Multiparameter Models Bryan Daniels July 28, 2008 - - PowerPoint PPT Presentation

sloppiness in multiparameter models
SMART_READER_LITE
LIVE PREVIEW

Sloppiness in Multiparameter Models Bryan Daniels July 28, 2008 - - PowerPoint PPT Presentation

Sloppiness in Multiparameter Models Bryan Daniels July 28, 2008 Jim Sethna Cornell University Motivation Biological models have lots of parameters, and they control the output in complex ways. It's often hard to measure these


slide-1
SLIDE 1

Sloppiness in Multiparameter Models

Bryan Daniels July 28, 2008

Jim Sethna – Cornell University

slide-2
SLIDE 2

Motivation

› Biological models have lots

  • f parameters, and they

control the output in complex ways. › It's often hard to measure these parameters. › How does this affect model predictions? What predictions can we trust?

Brown et al., Phys. Biol. 1: 184-195

slide-3
SLIDE 3

Cost Landscape

› A set of parameters θ has a cost based on how well the model fits measured data. › We usually use a squared residuals cost.

C =∑

i=1 m

yiti ,−d i i

2

Measured value Model value Measurement uncertainty

slide-4
SLIDE 4

Cost Landscape

› Locally around the best-fit point we can approximate the cost as quadratic. › The matrix of 2nd derivatives (Hessian) gives us the quadratic expansion.

Gutenkunst et al., PLoS Comput Biol 3(10): e189 (2007)

slide-5
SLIDE 5

Making Sensible Error Bars

› Most thorough method: Bayesian analysis using Monte Carlo sampling

1.Sample from all parameter sets

that fit the data;

2.Find prediction output from

each;

3.Calculate mean, standard

deviation, etc.

› Example: Brown et al., Phys. Biol. 1: 184-195

Figure courtesy Ryan Gutenkunst

“Stat mech in model space”

slide-6
SLIDE 6

Making Sensible Error Bars

Figures courtesy Ryan Gutenkunst

slide-7
SLIDE 7

Making Sensible Error Bars

Figures courtesy Ryan Gutenkunst

slide-8
SLIDE 8

“Sloppiness”

› Nonlinear multiparameter models are “sloppy”: orders

  • f magnitude

more sensitive to changes in certain directions in parameter space.

Gutenkunst et al., PLoS Comput Biol 3(10): e189

slide-9
SLIDE 9

Measuring Sloppiness

› The eigenvalues of the Hessian tell you about sensitivity along eigendirections in parameter space › Produces a “sensitivity spectrum” 1) Define cost – usually squared residuals 2) Find Hessian (2nd derivative matrix

  • f cost)
slide-10
SLIDE 10

› Hallmarks of sloppiness:

1.Large range of

eigenvalues

2.Eigenvalues

roughly evenly spaced in log space

Gutenkunst et al., PLoS Comput Biol 3(10): e189

slide-11
SLIDE 11

Implications of Sloppiness

› Large range means cost contour ellipsoids are routinely stretched by a factor of 1000 (the aspect ratio of a human hair). › Even spacing means there is no well-defined cutoff between “important” and “unimportant”.

slide-12
SLIDE 12

Universality of sloppiness

› Sloppiness has been found in every biological system analyzed (17 so far), and more:

  • Interatomic potentials, particle accelerator

design, sums of exponentials...

› May be a “universal” feature of nonlinear multiparameter fitting problems.

slide-13
SLIDE 13

Parameter Uncertainty is Inevitable

› Sloppiness provides an answer for why fits can lead to large uncertainties in parameter values. › Large parameter uncertainty does not imply large uncertainty in predictions.

slide-14
SLIDE 14

Uncertainties

Slide courtesy Ryan Gutenkunst

slide-15
SLIDE 15

Uncertainties

Slide courtesy Ryan Gutenkunst

slide-16
SLIDE 16

Uncertainties

Slide courtesy Ryan Gutenkunst

slide-17
SLIDE 17

Uncertainties

Slide courtesy Ryan Gutenkunst

slide-18
SLIDE 18

Sloppiness is “Real”

› Is it due to too few data points?

  • No; even for data the model can fit perfectly,

sloppiness persists.

› Is it due to the local approximation?

  • No; principal component analysis of Monte

Carlo ensembles still displays sloppiness.

slide-19
SLIDE 19

SloppyCell

› Computing environment for simulating and analyzing biochemical networks (or any system of ODEs) › Structure for optimization and efficient calculation of ensembles › Supports Systems Biology Markup Language (SBML) › Implemented in Python

slide-20
SLIDE 20

Is robustness a delicate balancing act?

› A subset of the circadian rhythm network in cyanobacteria › The phosphorylation decay rate is measured to be robust to temperature change, even when individual (de)phosphorylation rates would double

slide-21
SLIDE 21

Is robustness a delicate balancing act?

› A subset of the circadian rhythm network in cyanobacteria › The phosphorylation decay rate is measured to be robust to temperature change, even when individual (de)phosphorylation rates would double

slide-22
SLIDE 22

Is robustness a delicate balancing act?

› A subset of the circadian rhythm network in cyanobacteria › The phosphorylation decay rate is measured to be robust to temperature change, even when individual (de)phosphorylation rates would double

slide-23
SLIDE 23

Is robustness a delicate balancing act?

slide-24
SLIDE 24

Robustness and Evolvability

slide-25
SLIDE 25

Robustness and Evolvability

› Robustness: What fraction of a given volume in parameter space keeps the output reasonably constant? › Evolvability: With a selection pressure to move in a certain direction in residual (output) space, how far can I move in that direction by varying my parameters by a fixed amount?

slide-26
SLIDE 26

Robustness and Evolvability

› Individual evolvability decreases with robustness in example biological model

slide-27
SLIDE 27

Individual and Population Evolvability

› Sloppiness may increase the variety of behaviors available to a population through mutation

slide-28
SLIDE 28

Future work?

› Use information from multiple experiments / multiple systems to create ensembles › Network structure

1.Can we vary uncertain network connections in

a similar way as parameters?

2.Can we predict which experimental data would

best constrain the network structure?

slide-29
SLIDE 29

Conclusions

› Varying parameters provides important information about model uncertainty. › Sloppiness is a common feature in large multiparameter models.

1.Precise measurements of constants are not as

important; instead optimize experiments to provide well-constrained predictions.

2.Sloppiness can have important implications for

robustness and evolvability.

slide-30
SLIDE 30

Thanks!

› Jim Sethna › Ryan Gutenkunst › Chris Myers › YJ Chen › Ben Machta › Mark Transtrum

slide-31
SLIDE 31
slide-32
SLIDE 32

Mapping parameters to residuals

Parameter space Residual (output) space

slide-33
SLIDE 33

Universality of Sloppiness

Systems Bio QMC Radioactivity Many exps GOE Product Fitting plane Monomials

Eigenvalue

NOT SLOPPY

Figure courtesy Jim Sethna

slide-34
SLIDE 34

Making Sensible Error Bars

› Quickest method: Linear covariance analysis › Two approximations:

1.Quadratic expansion around best-fit 2.Linear response of output to changes in

parameters

slide-35
SLIDE 35

Experimental Optimization

› We can ask what new measurement will reduce the uncertainty

  • f a specific output.

› Example: Adding a single measurement of a different protein concentration. › Must use linear approx.

Before After

Fergel Casey et al., IET Sys. Biol. 1 (3), 190 (2007)

slide-36
SLIDE 36

Sloppiness

› What does understanding sloppiness buy you? How can you use these ideas?

1.More efficient Monte Carlo sampling of

parameter space;

2.Hints at the most important reactions in a

network;

3.An appreciation of the futility of thinking in

terms of individual parameters;

4.Model simplifications?

slide-37
SLIDE 37

More Efficient Sampling

› Using a Metropolis Monte Carlo algorithm, we can make use of our knowledge about the local shape of the cost function. › Big steps in sloppy directions, small steps in stiff directions.

bare eigen

Figure courtesy Jim Sethna

slide-38
SLIDE 38

Stiffest Directions

› The parameters with large components in the stiffest eigenvectors are in some sense more important.

* * * * *

stiffest

* * * *

2nd stiffest Ras Raf1

slide-39
SLIDE 39

Current Projects

› SloppyCell › Origin of sloppiness › Curved manifolds: connections to GR? › Model simplification? › New systems to implement

slide-40
SLIDE 40

Where does sloppiness come from?

› Related to the interchangeability of different sets of parameters › Using the wrong parameterization › Example: fit polynomial function on [0,1]

monomials: sloppy Legendre polynomials: not sloppy

slide-41
SLIDE 41

Origin of Sloppiness

› With two idealizations, get “perfect” sloppiness:

1.Parameters are exactly interchangable. 2.Parameters are nearly degenerate.

            =

d N d d N

V ε ε ε ε ε ε       

2 1 2 1

1 1 1

AV A V H

T T

=

2 / ) 1 (

) ( ) det(

− <

∝ − =∏

N N j i j i

V ε ε ε

slide-42
SLIDE 42

Curved Manifolds

› Anharmonic effects seem to be important. › To efficiently explore parameter space, we may need curved coordinates.

Sloppy direction Stiff direction 10-6

  • 10-6
  • 10-3

10-3 fits of decaying exponentials

Figure courtesy Jim Sethna

slide-43
SLIDE 43

Curved Manifolds

› Example: sum of exponentials problem

slide-44
SLIDE 44

Model Simplification

› Figure: Correlated parameter clusters › When sets of parameters have the same effect on

  • utput:

1.We see sloppiness; 2.It suggests we could

simplify the model...

Data point i Parameter j

Figure courtesy Josh Waterfall/Jim Sethna

slide-45
SLIDE 45

New Systems to Implement

› To avoid getting too abstract, we are on the lookout for real-world problems to implement...

1.Climate modeling 2.Economic models 3.Physics models (CMB, accelerator design) 4.Other systems biology problems

slide-46
SLIDE 46

Conclusions

› Varying parameters provides important information about model uncertainty. › Sloppiness is a common feature in large multiparameter models.

1.Precise measurements of constants are not as

important; instead optimize experiments to provide well-constrained predictions.

2.Simplification schemes may be fruitful.