The study of microbial communities: Bioinformatics applications within the UL HPC environment
UL HPC school 2017 13 June 2017 Sh Shaman Na Narayanasamy
Eco-Systems Biology group of LCSB
The study of microbial communities: Bioinformatics applications - - PowerPoint PPT Presentation
The study of microbial communities: Bioinformatics applications within the UL HPC environment UL HPC school 2017 13 June 2017 Sh Shaman Na Narayanasamy Eco-Systems Biology group of LCSB The subject: microbial communities 2 The samples:
Eco-Systems Biology group of LCSB
2
3
Roume et al. ISME J. (2013) 7:110-21 Roume et al. Methods Enzymol. (2013) 531:219-36
4
Metatranscriptomics Metagenomics
Data integration
Metaproteomic
Roume et al. ISME J. (2013) 7:110-21 Roume et al. Methods Enzymol. (2013) 531:219-36
5
DNA / cDNA
WGS
Biological
NGS
WGS library
Biological sample
NGS reads
In silico data
cDNA: complementary DNA WGS: whole genome shotgun NGS: next-generation sequencing
6
Uncompressed Size: 14-82 GB
7
NGS reads
Preprocessing In silico data
Preprocessed NGS reads
8
Preprocessing Assembly Post-assembly Automation Containerization Trimmomatic CutAdapt SortMeRNA *BWA *Bowtie2 IDBA-UD MEGAHIT SPAdes AbySS Newbler Cap3 BWA Bowtie2 MaxBin dRep HMMer BLASTn AMPHORA2 PhyloPhlan Bash Make Python Perl Galaxy Snakemake CWL Ruffus Docker LXD Vigrant *BioConda
9
NGS reads
Preprocessing In silico data
Contig 1 Contig 2
Preprocessed NGS reads
De novo assembly
Assembled contigs
10
Preprocessing Assembly Post-assembly Automation Containerization Trimmomatic CutAdapt SortMeRNA *BWA *Bowtie2 IDBA-UD MEGAHIT SPAdes AbySS Newbler Cap3 BWA Bowtie2 MaxBin dRep HMMer BLASTn AMPHORA2 PhyloPhlan Bash Make Python Perl Galaxy Snakemake CWL Ruffus Docker LXD Vigrant *BioConda
11
Structure information Contig 1 Contig 2
Assembled contigs Annotation
Binning
Contig 2 Gene B Contig 1 Gene A
Predicted genes
Contig 1
Bin X
Contig 2
Bin Y
Bins
Function information Gene B Gene A
12
Preprocessing Assembly Post-assembly Automation Containerization Trimmomatic CutAdapt SortMeRNA *BWA *Bowtie2 IDBA-UD MEGAHIT SPAdes AbySS Newbler Cap3 BWA Bowtie2 MaxBin dRep HMMer BLASTn AMPHORA2 PhyloPhlan Bash Make Python Perl Galaxy Snakemake CWL Ruffus Docker LXD Vigrant *BioConda
13
Preprocessing Assembly Post-assembly Automation Containerization Trimmomatic CutAdapt SortMeRNA *BWA *Bowtie2 IDBA-UD MEGAHIT SPAdes AbySS Newbler Cap3 BWA Bowtie2 MaxBin dRep HMMer BLASTn AMPHORA2 PhyloPhlan Bash Make Python Perl Galaxy Snakemake CWL Ruffus Docker LXD Vigrant *BioConda
14
Preprocessing Assembly Post-assembly Automation Containerization Trimmomatic CutAdapt SortMeRNA *BWA *Bowtie2 IDBA-UD MEGAHIT SPAdes AbySS Newbler Cap3 BWA Bowtie2 MaxBin dRep HMMer BLASTn AMPHORA2 PhyloPhlan Bash Make Python Perl Galaxy Snakemake CWL Ruffus Docker LXD Vigrant *BioConda
15
Narayanasamy, Jarosz et al. BioarXiv (2016) Narayanasamy, Jarosz et al. Genome Biology (2016)
Original logo by Linda Wampach
16
snakemake
RAM
Narayanasamy, Jarosz et al. BioarXiv (2016) Narayanasamy, Jarosz et al. Genome Biology (2016)
17
Muller, Pinel et al. Nature Communications (2014) Roume, Heintz-Buschart et al. NPJ Microbiome and Biofilms (2015) Laczny et al. Frontiers in Microbiology (2016) Heintz-Buschart et al. Nature Microbiology (2016) Narayanasamy, Jarosz et al. Genome Biology (2016) Wampach et al. Frontiers in Microbiology (2017) Kaysen et al. Translational Research (accepted) Muller, Narayanasamy et al. Standards in Genomic Sciences (in review) Wampach, Heintz-Buschart et al. (in preparation) Herold et al. (in preparation) Narayanasamy, Martinez-Arbas et al. (in preparation)
18
19
And in all presentations/posters in international conferences and PhD theses!
20
21
Former ESBers: Emilie Muller Cedric Laczny Abdul Sheik Hugo Roume Myriam Zeimes
22