USING GENOMIC-BASED
INFORMATION FOR THE MODELING OF BACTERIAL ENVIRONMENTS AND LIFESTYLE
Omer Eilam Eytan Ruppin School of Computer Sciences Tel Aviv University
January 2010
W HY STUDY METABOLISM ? 1. Its the essence of life 2. Tremendous - - PowerPoint PPT Presentation
U SING GENOMIC - BASED INFORMATION FOR THE MODELING OF BACTERIAL ENVIRONMENTS AND LIFESTYLE Omer Eilam Eytan Ruppin School of Computer Sciences Tel Aviv University January 2010 W HY STUDY METABOLISM ? 1. Its the essence of life 2.
Omer Eilam Eytan Ruppin School of Computer Sciences Tel Aviv University
January 2010
a.
b.
Species evolve to adapt to their environment.
Environment/lifestyle Phenotype Genotype Can we use the genotype to predict the lifestyle of a species?
Metabolic information Genomic information
Gene Enzyme Metabolite
Hundreds of fully sequenced bacterial species Genomic information Metabolic information
From Borenstein et al, PNAS 2008 Based on the network topology, identifying the set of compounds that are exogenously acquired Internal metabolite External metabolite
Metabolic information Predicted metabolic environments Internal Metabolite External Metabolite
Metabolic information Internal Metabolite External Metabolite Essential Biomass Component
Viable Not viable Environmental viability matrix
Env 1 Env 2Env N Spc N Spc 1 Spc 2
Information on species Information on environment
Metabolic networks Environments Species Genomes
lifestyle of a species based on Genomic attributes?
(Freilich et al, Genome Biology 2009)
strategies based on genomic attributes?
(Freilich et al, Genome Biology 2009)
metabolic network reflect adaptation to species’ lifestyle?
(Freilich et al, PLoS Comp Biol 2010)
communities based on genomic attributes?
(Freilich et al, Under revision)
Metabolic networks Environments Species Genomes
lifestyle of the species based on Genomic attributes? Can we predict, based on genomic knowledge, whether a species is a specialist or generalist? Can we estimate the range of environments it can inhabit?
Specific examples : Pseudomonas aeruginosa Desulfotalea psychrophila
Genomic- based predicted environments
NCBI annotations Available systematic estimates/information for environmental variability Fraction of reg. genes
Multiple Specialized High Low
Beyond specific examples: strong correlations (>0.3) between the metabolic-environment variability and established measures of environmental variability
Metabolic networks Environments Species Genomes
ecological attributes based
Can we predict the level of competition a species encounters in its natural environments and its rate of growth?
Viable Not viable
Environmental Viability Matrix
Env 1 Env 2 Env 3 Spc 3 Spc 1 Spc 2 Spc 4 Env 4 Co-Habitation vector Spc 3 Spc 1 Spc 2 {1,3,2} {3} {3,2} Max-CHS Spc 4 {1} 3 3 3 1
Lifestyle annotation Max-CHS
From Freilich et al, Genome Biology, 2009
Environmental diversity Maximal co-habitation
Ecological diversity with intense co-inhabitation, associated with a typical fast rate of growth. A specialized niche with little co-inhabitation, associated with a typical slow rate of growth
Freilich et al, Genome Biology, 2009
Metabolic networks Environments Species Genomes
The patterns observed suggests a universal principle where metabolic flexibility is associated with a need to grow fast, possibly in the face of competition. The interplay between the environmental diversity – and maximal co habitation allows training a predictor for growth rate (ROC score of 0.75 ).
The application of computational methods to predict
A ‘metabolic network model’ is a collection of such
Reaction stoichiometry Reaction directionality Cellular localization Transport and exchange reactions Gene-protein-reaction association
Stoichiometric matrix – network topology with stoichiometry
Predict changes in metabolite concentrations m – metabolite concentrations vector
S – stoichiometric matrix v – reaction rates vector
– No changes in metabolite concentrations (within the system) – Metabolite production and consumption rates are equal
Solution space
Correct solutions
An optimization method for finding a feasible flux distribution
Based on the assumption that evolution optimizes microbes
To enable maximal growth rate the essential biomass
Add to the model a pseudo ‘growth reaction’
These precursors are removed from the
X axis – Succinate uptake rate Y axis – Oxygen uptake rate Z axis - Growth rate (maximal value of the
succinate and oxygen uptake)
Succinate/oxygen PPP Ibarra et al., Nature 2002
A gene knockout is simulated by setting the flux through the
The corresponding reactions are identified by evaluating the
In total, 819 out of the 896 mutants (91%) showed growth behaviors in
Producing and analyzing the first multispecies
Prediction of several ecologically relevant characteristics Sulfate
(Stolyar et a l, 2007)
A three compartments model:
D. vulgaris metabolic model M. maripaludis metabolic model Culture medium
The reconstruction process is continuously
This enables us to go large-scale and address