Machine Learning and Metagenome Analysis
Chris Fields’s slides presented by Amel Ghouila
Machine Learning and Metagenome Analysis Chris Fieldss slides - - PowerPoint PPT Presentation
Machine Learning and Metagenome Analysis Chris Fieldss slides presented by Amel Ghouila Overview of Overview of analysis analysis workflow workflow ASSEMBLY ( DE NOVO ) FASTQC RECONSTRUCTION OF QUALITY CONTROL F ASTQ A GENOME OF
Chris Fields’s slides presented by Amel Ghouila
1
FASTQ
FILES
FASTQC QUALITY CONTROL OF READS TRIMMING FILTERING BAD QUALITY READS
2
MAPPING OF READS TO A REFERENCE GENOME ASSEMBLY (DE NOVO) RECONSTRUCTION OF A GENOME
3
SAM FILES BAM FILES
4
READ DEPTH VARIANT CALLING STRUTURAL VARIATIONS GENE / CHR CNV
5
VCF FILES SNPS INDELS ANNOTATION VISUALIZATION FASTA FILE GFF FILE
2
Morgan XC, Huttenhower C (2012) Chapter 12: Human Microbiome
Biology 8(12): e1002808.
OTU: Operational Taxonomic Unit (cluster
variants) used to categorize bacteria
Morgan XC, Huttenhower C (2012) Chapter 12: Human Microbiome
Biology 8(12): e1002808.
k-NN Hierarchical clustering Bayesian clustering Greedy heuristic clustering
Tools Mothur USEARCH/UCLUST/UPARSE CD-HIT
Morgan XC, Huttenhower C (2012) Chapter 12: Human Microbiome
Biology 8(12): e1002808.
Linear model Random forest
Tools RDP Classifier 16s Classifier PhyloSift PhyloPithia
Quince, C et al. Shotgun metagenomics, from sampling to analysis, (2017) Nature Biotechnology (35):833–844
Sedlar, K et al, Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun
ML Model Linear regression
PCA SVD Lots of Clustering! k-means k-medioids Gaussian mixture model Greedy heuristic Bayesian clustering Spectral clustering
Tools CONCOCT MetaBAT MaxBin
http://armbrustlab.ocean.washington.edu/seastar
Tools MetaProdigal MetaGeneMark FragGeneScan
ML Model HMM Neural network
Sharpton, T. An introduction to the analysis of shotgun metagenomic data. Front. Plant Sci., 16 June 2014
Figure 5 : Gut MLGs classify colorectal carcinoma and adenoma samples from healthy controls.
https://arxiv.org/pdf/1510.06621.pdf