University of Florence Microbial genetics lab Florence computational biology group
Marco Galardini
(@mgalactus)
DuctApe
a tool for the analysis and correlation of genomic and high throughput phenotypic Biolog data
04/03/2013
DuctApe a tool for the analysis and correlation of genomic and high - - PowerPoint PPT Presentation
Marco Galardini (@mgalactus) DuctApe a tool for the analysis and correlation of genomic and high throughput phenotypic Biolog data University of Florence Microbial genetics lab Florence computational biology group 04/03/2013 Who we are 2
University of Florence Microbial genetics lab Florence computational biology group
Marco Galardini
(@mgalactus)
a tool for the analysis and correlation of genomic and high throughput phenotypic Biolog data
04/03/2013
Who we are
2
@combogenomics combo.unifi@gmail.com http://www.unifi.it/dbefcb
Who we are
3
Other collaborations
Dipartimento di Scienze delle Produzioni Agroalimentari e dell'Ambiente
Introduction
4
The genomics and phenomics era
The wishing well
5
MacLean et al., 2009
The genomics era
genomesonline.com
The wishing well
6
http://www.genome.jp/kegg/
The genomics era
The wishing well
7
The phenomics era
www.biolog.com
Introduction
8
Genome data analysis
Phenome data analysis
Introduction
9
How to combine genomic and phenomic data?
The missing link
10
The missing link between genomics and phenomics
The missing link
11
Three different experimental setups
Single strain(s) Mutant(s)
PanGenome
The missing link
12
Three different modules
dgenome
dphenome
dape
The missing link
13
Genomics made easy
dgenome
14
Genome map to KEGG (1)
Blast BBH on a local KEGG database* Blast BBH using KASS web-server**
*Since July 1th 2011, the access to KEGG FTP needs a $2000/$5000 licence **Available for free, fast and reliable
dgenome
15
Genome map to KEGG (2)
KEGG public API Detailed info on:
Fast, multi-threaded access
dgenome
16
Pangenome prediction
The missing link
17
Painless high-throughput phenomics
dphenome
18
From raw data to phenotypic variability
dphenome
19
From raw data to phenotypic variability
dphenome
20
From raw data to phenotypic variability
dphenome
21
From raw data to phenotypic variability
dphenome
22
From raw data to phenotypic variability
dphenome
23
From raw data to phenotypic variability
Lag Min Max Slope Plateau + Area + Average height
dphenome
24
Phenotypic variability at a glance
dphenome
25
Phenotypic variability at a glance
AV = Activity Index
dphenome
26
Activity index (AV)
K-means clustering on 5 parameters, with 10 clusters Fast: from raw .csv files to AV in less than 5 minutes
dphenome
27
Activity index (AV)
No activity Max activity
dphenome
28
Activity index (AV)
Plates heatmaps: phenotypic variability at a glance
dphenome
29
Activity index (AV)
AV boxplots: overall strains comparison (also on single compounds categories)
dphenome
30
Activity index (AV)
AV rings: overall strains comparison
Δ AV
+
dphenome
31
Activity index (AV)
Replica management: discard inconsistent replica using the Δ AV
3 replica 2 replica Keep-min
The missing link
32
The missing link
dape
33
Whole metabolic network reconstruction
dape
34
Single genome metabolic network
Interactive metabolic maps (as web pages)
dape
35
Single genome metabolic network
No activity Max activity
Interactive metabolic maps (as graph files)
dape
36
Single genome metabolic network
No activity Max activity
Interactive metabolic maps (as graph files)
dape
37
Metabolic network comparisons
The missing link
38
Technical features
Under the hood
39
Technical features
DuctApe comes as a UNIX command line program
DuctApe project file Inputs Outputs
Under the hood
40
Technical features
Language Standing on the shoulders of giants
Under the hood
41
http://combogenomics.github.com/DuctApe “combogenomics ductape” ductape-users@googlegroups.com @combogenomics
Acknowledgements
University of Florence
Alessio Mengoni Marco Bazzicalupo Emanuela Marchi Giulia Spini Francesca Decorosi Carlo Viti Luciana Giovannetti
Biolog Inc.
Barry Bochner
CRA
Stefano Mocali Alessandro Florio Anna Benedetti
Emanuele Biondi