data modeling the key to biological data integration
play

Data modeling: the key to biological data integration Franois - PowerPoint PPT Presentation

Data modeling: the key to biological data integration Franois Rechenmann NETTAB 2012 Biological data: not so big, but highly heterogeneous and evolving Big data Satellite images, particle physics, Banks, insurance, telecom


  1. Data modeling: the key to biological data integration François Rechenmann NETTAB 2012

  2. Biological data: not so big, but highly heterogeneous and evolving Big data  Satellite images, particle physics,…  Banks, insurance, telecom companies,… Heterogeneous biological data  Genomic, transcriptomic, proteic, metabolic data  Spectra, structures… Evolving biological data  New technologies  New problematics  Genostar 2012

  3. Data modeling via UML inheritance class Protein Regulator “is - A” MW Length class Sequence slots roles regulated-prot regulator Regulates N-ary associations association Km association Compound slots effector  Genostar 2012

  4. Data modeling via UML  Genostar 2012

  5. Advantages  Intuitive (and graphical) UML-like representation of biological entities and of their relationships  Formal modeling ( vs. natural language): no ambiguity over the definition of entities and relationships  An integrated data space as a large network where nodes are entities and edges are relationships  Efficient support for data consistency checking  Navigation and query facilities over the whole data space

  6. Data modeling in software  Entities described as classes: types and subtypes  Distinction between « sequence » and « replicon »  Relationships  « Feature » is-located-on « sequence »  Methods described as classes  Typed input and output  Typed input and ouput of methods  Type checking: testing method adequacy for input data  Type assignment to output data

  7. Data modeling in database  MicroB: a relationnal database  Interconnected genomic, proteic and metabolic reference data on more than 1500 microbial organisms  Overlapping schema with software schema  More than 300 relations/tables  Easy data import and export from and back to the software

  8. An integrated bioinformatics platform MicroB database Metabolic Pathway Builder Connected genomic, proteic & Perform comparative genomics metabolic data on 1500+ reference & metabolic analyses from microorganisms annotation to analysis of relevant metabolic reactions & Integration of new annotated pathways genomes

  9. An integrated bioinformatics platform  Dedicated visualizers and editors  Exploration and query mechanism

  10. Contacts www.genostar.com Francois.Rechenmann@genostar.com  Genostar 2012

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend