FINAL WORKSHOP OF GRID PROJECTS, “PON RICERCA 2000-2006, AVVISO 1575” 1
Parameters estimate in metabolic networks reconstruction
- G. Aprea1, G. Licciardello2, and V. Rosato3
1ENEA, Via del Vecchio Macello, 80055 Portici (Naples), Italy,
giuseppe.aprea@gmail.com
2Science and Technology Park of Sicily, stradale V. Lancia 57, z.i. Blocco Palma I,
95121 Catania, Italy, gralicci@unict.it
3ENEA, Computinig and Modelling Unit, Via Anguillarese 301, 00123 S.Maria di
Galeria (Rome), Italy, rosato@casaccia.enea.it
Abstract—In this paper we describe the im- plementation of two different parallel codes for parameters estimate in metabolic networks based
- n the genetic algorithm to fully take advantage of
modern computational facilities such as the Enea GRID. Index Terms—COMETA, journal, L
A
T EX, paper, template.
- I. INTRODUCTION
Advances in omics sciences produce large amounts of data that need analysis and interpre-
- tation. Reliable explanations of how processes
are regulated require an accurate modeling ap- proach at the systems level. In this paper we focus on metabolic networks models which re- produce the time evolution of all the metabo-
- lites. Quite often these models rely on several
unknown parameters which have to be estimated from experimental data. This task consists in the solution of an inverse problem which requires the use of an efficient optimization algorithm. GA [1] is a widely known optimum search method which yields reliable values for model parameters with a large computational demand. Our aim is to develop a parallel implementation for parameter estimate based on GA to fully take advantage of the modern different compu- tational facilities. Our implementation relies on:
- Ecell
software from Keio Univer- sity(Japan) [2], [4] for simulations of biochemical networks;
- LSF - load sharing facility - a job scheduler
for (multi)cluster [3].
- II. THE GENETIC ALGORITHM: BASICS
According to GA analogy, in a biochemical network, a set K = ki, i = 1, m of unknown parameters is defined genome. Each of these missing array of constants gives rise to a dif- ferent behavior of the network, that is different time evolutions for the metabolites’ concentra-
- tions. The network with the genome end its
behavior together constitute an individual and a group of individuals is a population. As in the case of populations of organisms in nature, GA populations undergo a selection where good experimental data fitting represent the selection
- criterion. After every selection stage, a new
population is created; the current generation is
- ver and a new one is ready. After a large
number of generations, GA is expected to yield the individuals which best fit experimental data.
- III. THE GENETIC ALGORITHM:
COMPUTATIONAL SCHEME GA is implemented following this steps: 1) a random number I of genomes is chosen, each of them differing from the others by the value of the unknown m constants. This is the starting population. 2) for each individual, the time evolution of the metabolites’ concentration is calcu- lated with the E-Cell tool.