Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic - PowerPoint PPT Presentation

Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic Control Analysis COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1

Flux Balance Analysis (FBA) ✤ Flux balance analysis (FBA), an optimality-base method for flux prediction, is one of the most popular modeling approaches for metabolic systems. ✤ Flux optimization methods do not describe how a certain flux distribution is realized (by kinetics or enzyme regulation), but which flux distribution is optimal for the cell; e.g., providing the highest rate of biomass production at a limited inflow of external nutrients. ✤ This allows us to predict flux distributions without the need for a kinetic description. 2

Flux Balance Analysis (FBA) ✤ FBA investigates the theoretical capabilities and modes of metabolism by imposing a number of constraints on the metabolic flux distributions: ✤ The assumption of a steady state: S × v = 0 . ✤ Thermodynamics constraints: a i ≤ v i ≤ b i . ✤ An optimality assumption: the flux distribution has to maximize (or, minimize) an objective function f(v) r X f ( v ) = c i v i i =1 3

� Geometric Interpretation of FBA v 3 v 3 v 3 Constraints Optimization 1) Sv = 0 maximize Z 2) a i < v i < b i v 1 v 1 v 1 Unconstrained Allowable Optimal solution solution space solution space v 2 v 2 v 2 Figure 1 The conceptual basis of constraint-based modeling. With no constraints, the flux distribution of a biological network may lie at any point in a solution space. When mass balance constraints imposed by the stoichiometric matrix S (labeled 1) and capacity constraints imposed by the lower and upper bounds ( a i and b i ) (labeled 2) are applied to a network, it defines an allowable solution space. The network may acquire any flux distribution within this space, but points outside this space are denied by the constraints. Through optimization of an objective function, FBA can identify a single optimal flux distribution that lies on the edge of the allowable solution space. [Source: “What is flux balance analysis?”, Nat Biotech.] 4

Formulation of an FBA Problem Reaction 1 A B + C a B + 2C D Reaction 2 Genome-scale ... metabolic reconstruction Reaction n Biomass Glucose Oxygen Reactions ... 1 2 n b v 1 A Mathematically represent –1 v 2 s B 1 –1 e metabolic reactions t i C l 1 –2 o ... = 0 * b and constraints D a 1 t v e ... M n –1 v biomass v –1 glucose m v oxygen Stoichiometric matrix, S Fluxes, v – v + ... = 0 1 c Mass balance defines a v – v 2 + ... = 0 1 system of linear equations v – 2 v + ... = 0 1 2 v + ... = 0 2 etc. d Define objective function To predict growth, Z = v biomass ( Z = c 1 * v 1 + c 2 * v 2 ... ) v 2 Z Point of optimal v e Calculate fluxes that maximize Z Solution space defined by constraints v 1 5

What to Optimize? ✤ Minimize ATP production: the most energy-efficient state ✤ Minimize nutrient intake: the fittest state under nutrient shortage ✤ Maximize metabolite production: the biochemical production capabilities of certain desirable metabolites such as lysine, phenylalanine, etc. ✤ Maximize biomass formation: maximal growth rate ✤ ... 6

Producing Biomass ✤ Growth can be defined in terms of the biosynthetic requirements to make a cell. ✤ These requirements are based on literature values of experimentally determined biomass composition. ✤ Thus, biomass generation is defined as a linked set of reaction fluxes draining intermediate metabolites in the appropriate ratios and represented as an objective function Z. 7

Biomass Formation in E. coli ✤ The requirements for making 1g of E. coli biomass from key cofactors and biosynthetic precursors have been documented. ✤ This means that for E. coli to grow, all these components must be provided in the appropriate relative amounts. ✤ Key biosynthetic precursors are used to make all the constituents of E. coli biomass. Their relative requirements to make 1g of E. coli biomass are: Z precursors = +0.205V g6P +0.071V F6P +0.898V R5P +0.361V E4P +0.129V T3P +1.496V 3PG +0.519V PEP +2.833V PYR +3.748V AcCoA +1.787V OAA +1.079V α KG 8

Biomass Formation in E. coli ✤ In addition to precursors, cofactors are needed to drive the process. ✤ The cofactors requirement to synthesize the monomers from the precursors (amino acids, fatty acids, nucleic acids) and to polymerize them into macromolecules is Z cofactors = 42.703V ATP -3.547V NADH +18.22V NADPH 9

Biomass Formation in E. coli ✤ The mass and cofactor requirements to generate E. coli biomass are: Z biomass = Z precursors + Z cofactors 10

Resources for FBA ✤ The BIGG database: http://bigg.ucsd.edu/ ✤ The COBRA toolbox: http://opencobra.sourceforge.net/ ✤ FASIMU: http://www.bioinformatics.org/fasimu/ 11

Applications of FBA 12

Modular Epistasis in Yeast Metabolism ✤ Genes can be classified by categories related to functions of the cell (e.g., translation, energy metabolism, mitosis, etc.) based on textbook knowledge. ✤ Can we infer functional associations directly from deletion experiments? ✤ If two gene products can compensate for each other’s loss, then deleting both of them will have a much stronger impact on cell fitness than one would expect from their single deletions. 13

Modular Epistasis in Yeast Metabolism ✤ On the other hand, if two gene products are essential parts of the same pathway, a single deletion would already shut down the pathway and a double deletion would not have any further effect. ✤ Accordingly, we may try to infer functional relationships among the gene products by comparing the fitness losses caused by combined gene deletions. 14

Modular Epistasis in Yeast Metabolism ✤ Epistasis describes how the fitness loss due to a gene mutation depends on the presence of other genes. ✤ It can be quantified by comparing the fitness of a wild type organism, e.g., the growth rate of a bacteria culture, to the fitness of single and double deletion mutants. ✤ A single gene deletion (for gene i) will decrease the fitness (e.g., the growth rate) from a value f wt to a value f i , leading to a growth defect w i =f i /f wt ( ≤ 1). 15

Modular Epistasis in Yeast Metabolism ✤ For a double deletion of unrelated genes i and j, we may expect a multiplicative effect w ij =w i w j (no epistasis). ✤ If the double deletion is more severe ( w ij <w i w j ), we call the epistasis aggravating . ✤ If the double deletion is less severe ( w ij >w i w j ), we call the epistasis buffering . ✤ Both cases of aggravating and buffering epistasis indicate functional associations between the genes in question. 16

Modular Epistasis in Yeast Metabolism ✤ Segre et al. (Nature Genetics, 37(1):77-83, 2005) recently used FBA to predict growth rates of the yeast S. cerevisiae and to calculate the epistatic effects between all metabolic genes. ✤ The model predicted relative growth defects of all single and double deletion mutants, from which they computed an epistasis measure for each pair of genes: ✏ ij = w ij − w i w j ˆ | ˆ w ij − w i w j | extreme buffering extreme aggravation w ij = min { w i , w j } w ij = 0 ˆ ˆ 17

Modular Epistasis in Yeast Metabolism b 600 400 Gene pairs 200 0 –1 0 1 strong ~ � complete aggravation no epistasis buffering (lethal phenotypes) 18

Modular Epistasis in Yeast Metabolism a A Glycolysis / gluconeogenesis B Pentose phosphate cycle C Tricarboxylic acid cycle Electron transport complex IV D Figure 2 Epistatic interactions between genes D ! Oxidative phosphorylation classified by functional annotation groups tend to E Pyruvate metabolism F Mitochondrial membrane transport be of a single sign ( i.e. , monochromatic). G ATP synthetase ( a ) Representation of the number of buffering and H Anaplerotic reactions I Transport, metabolic byproducts aggravating interactions within and between Transport, other compounds I ! J Aromatic amino acids metabolism groups of genes defined by common preassigned K Cysteine biosynthesis annotation from the FBA model. The radii of the K ! Sulfur metabolism Proline metabolism L pies represent the total number of interactions Pyrimidine metabolism M (ranging logarithmically from 1 in the smallest N Purine metabolism O Salvage pathways pies to 35 in the largest). The red and green pie P Sterol biosynthesis slices reflect the numbers of aggravating and Q Alanine and aspartate metabolism Q ! Arginine metabolism buffering interactions, respectively. Coenzyme A biosynthesis R Monochromatic interactions, represented by R ! Pantothenate and CoA biosynthesis S Glycine, serine and threonine metabolism whole green or red pies, are much more common Sucrose and sugar metabolism T than would be expected by chance. ( b ) Sensitivity U Lysine metabolism V Methionine metabolism analysis of the prevalence of monochromaticity W Phospholipid biosynthesis X Plasma membrane transport - amino acids A B C D D ! E F G H I I ! J K K ! L M N O P Q Q ! R R ! S T U V W X 19

The Interplay Between Metabolism and Gene Regulation ✤ Recently, Shlomi et al. (MSB 3:101) conducted a computational analysis of the interplay between metabolism and transcriptional regulation in E. coli . ✤ To enable such an analysis, the authors proposed a new method, steady-state regulatory flux balance analysis (SR-FBA), for predicting gene expression and metabolic fluxes in a large-scale integrated metabolic-regulatory model. 20

Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic - PowerPoint PPT Presentation

Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic Control Analysis COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Flux Balance Analysis (FBA) Flux balance analysis (FBA), an optimality-base

Data Mining in Bioinformatics Day 7: Clustering in Bioinformatics Karsten Borgwardt February 25

Outline Administravia What is bioinformatics CS 5263 Bioinformatics Why

Data Mining in Bioinformatics Day 6: Classification in Bioinformatics Karsten Borgwardt February

Data Mining in Bioinformatics Day 9: String & Text Mining in Bioinformatics Karsten Borgwardt

Bioinformatics Outline What is bioinformatics? Who are bioinformaticians? Hardware

Bioinformatics Panel Presentation Peter D. Karp, Ph.D. Director, Bioinformatics Research Group

SciLifeLab Bioinformatics Platform National Bioinformatics Infrastructure Sweden (NBIS) Nina

Data Mining in Bioinformatics Day 8: Feature Selection in Bioinformatics Karsten Borgwardt

Practical Bioinformatics Mark Voorhies 5/21/2019 Mark Voorhies Practical Bioinformatics Change

Biological Network Analysis: Graph Mining in Bioinformatics Karsten Borgwardt Interdepartmental

Bioinformatics Bioinformatics Tools for RNA Tools for RNA Data Analysis Data Analysis Joseph

Bioinformatics: Network Analysis Comparative Network Analysis COMP 572 (BIOS 572 / BIOE 564) -

Thailand Bioinformatics: Research and Applications Sissades T ongsima Bioinformatics

CAMDA: An Overview Michael Ochs Bioinformatics Fox Chase Cancer Center Bioinformatics Fox

Introduction to Cancer Bioinformatics and cancer biology Anthony Gitter Cancer Bioinformatics

Text Mining and Information Extraction Applications for Bioinformatics and Systems Biology Plant

Leveraging State Pilot Experience in Health Home Programs August 2, 2012; 3:00 4:00PM (ET)

NICHD NICHD Research Mission & Research Mission & Research Mission & Research

Increasing Physical Activity through Health-Enabling Technologies: the Project Being Strong

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

Measuring aquatic ecosystem Measuring aquatic ecosystem metabolism in the southern Everglades

Jonathan Karr karr@mssm.edu July 17, 2019 Join us at Mount Sinai! karr@mssm.edu KarrLab.org

Scalinga Plenitude of Power Laws Scaling-at-large Principles of Complex Systems Allometry

Framework for Establishing an Internal Threshold of Toxicological Concern Presentation for the

Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic - PowerPoint PPT Presentation

Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic Control Analysis COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Flux Balance Analysis (FBA) Flux balance analysis (FBA), an optimality-base

Data Mining in Bioinformatics Day 7: Clustering in Bioinformatics Karsten Borgwardt February 25

Outline Administravia What is bioinformatics CS 5263 Bioinformatics Why

Data Mining in Bioinformatics Day 6: Classification in Bioinformatics Karsten Borgwardt February

Data Mining in Bioinformatics Day 9: String &amp; Text Mining in Bioinformatics Karsten Borgwardt

Bioinformatics Outline What is bioinformatics? Who are bioinformaticians? Hardware

Bioinformatics Panel Presentation Peter D. Karp, Ph.D. Director, Bioinformatics Research Group

SciLifeLab Bioinformatics Platform National Bioinformatics Infrastructure Sweden (NBIS) Nina

Data Mining in Bioinformatics Day 8: Feature Selection in Bioinformatics Karsten Borgwardt

Practical Bioinformatics Mark Voorhies 5/21/2019 Mark Voorhies Practical Bioinformatics Change

Biological Network Analysis: Graph Mining in Bioinformatics Karsten Borgwardt Interdepartmental

Bioinformatics Bioinformatics Tools for RNA Tools for RNA Data Analysis Data Analysis Joseph

Bioinformatics: Network Analysis Comparative Network Analysis COMP 572 (BIOS 572 / BIOE 564) -

Thailand Bioinformatics: Research and Applications Sissades T ongsima Bioinformatics

CAMDA: An Overview Michael Ochs Bioinformatics Fox Chase Cancer Center Bioinformatics Fox

Introduction to Cancer Bioinformatics and cancer biology Anthony Gitter Cancer Bioinformatics

Text Mining and Information Extraction Applications for Bioinformatics and Systems Biology Plant

Leveraging State Pilot Experience in Health Home Programs August 2, 2012; 3:00 4:00PM (ET)

NICHD NICHD Research Mission &amp; Research Mission &amp; Research Mission &amp; Research

Increasing Physical Activity through Health-Enabling Technologies: the Project Being Strong

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

Measuring aquatic ecosystem Measuring aquatic ecosystem metabolism in the southern Everglades

Jonathan Karr karr@mssm.edu July 17, 2019 Join us at Mount Sinai! karr@mssm.edu KarrLab.org

Scalinga Plenitude of Power Laws Scaling-at-large Principles of Complex Systems Allometry

Framework for Establishing an Internal Threshold of Toxicological Concern Presentation for the

Data Mining in Bioinformatics Day 9: String & Text Mining in Bioinformatics Karsten Borgwardt

NICHD NICHD Research Mission & Research Mission & Research Mission & Research