Rapid Identification of AMR Determinants from Metagenomic Samples - PowerPoint PPT Presentation

Rapid Identification of AMR Determinants from Metagenomic Samples AMRtime Progress Report Finlay Maguire June 22, 2018 Faculty of Computer Science, Dalhousie University

Table of contents 1. Overview 2. Training Data 3. Read filtering 4. Sensitive Homology Search 5. Variant Models 6. Summary 7. Acknowledgements 1

Overview

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase • Protein Variant e.g. GyrA fluoroquinolone mutation, FolP sulfonamide mutation 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase • Protein Variant e.g. GyrA fluoroquinolone mutation, FolP sulfonamide mutation • rRNA gene variants e.g. Mycobacterium aminoglycoside resistance 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase • Protein Variant e.g. GyrA fluoroquinolone mutation, FolP sulfonamide mutation • rRNA gene variants e.g. Mycobacterium aminoglycoside resistance • Efflux pump e.g. AcrAB-TolC, MexAB-OprM mutations 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase • Protein Variant e.g. GyrA fluoroquinolone mutation, FolP sulfonamide mutation • rRNA gene variants e.g. Mycobacterium aminoglycoside resistance • Efflux pump e.g. AcrAB-TolC, MexAB-OprM mutations • Gene cluster e.g. Van glycopeptide resistance clusters 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase • Protein Variant e.g. GyrA fluoroquinolone mutation, FolP sulfonamide mutation • rRNA gene variants e.g. Mycobacterium aminoglycoside resistance • Efflux pump e.g. AcrAB-TolC, MexAB-OprM mutations • Gene cluster e.g. Van glycopeptide resistance clusters • Resistance Gene Identifier (RGI): contigs, predicted genes and merged metagenomic reads 2

Comprehensive Antibiotic Resistance Database • https://card.mcmaster.ca/ (Jia et al., 2016) as of June 2018: • Built around Antibiotic Resistance Ontology (ARO): 3996 terms • 2536 AMR Detection Models with manually curated criteria: • Homology e.g. NDM beta-lactamases, aminoglycoside acetyltransferase • Protein Variant e.g. GyrA fluoroquinolone mutation, FolP sulfonamide mutation • rRNA gene variants e.g. Mycobacterium aminoglycoside resistance • Efflux pump e.g. AcrAB-TolC, MexAB-OprM mutations • Gene cluster e.g. Van glycopeptide resistance clusters • Resistance Gene Identifier (RGI): contigs, predicted genes and merged metagenomic reads • CARDPredicted prevalence dataset 2

Metagenomic Analysis modified from https://www.gatc-biotech.com/en/expertise/genomics/metagenome-analysis.html Key difficulties: • Variation in abundance and diversity 3

Metagenomic Analysis modified from https://www.gatc-biotech.com/en/expertise/genomics/metagenome-analysis.html Key difficulties: • Variation in abundance and diversity • Short fragmentary data 3

Metagenomic Analysis modified from https://www.gatc-biotech.com/en/expertise/genomics/metagenome-analysis.html Key difficulties: • Variation in abundance and diversity • Short fragmentary data • Large amounts of data 3

Metagenomic Analysis modified from https://www.gatc-biotech.com/en/expertise/genomics/metagenome-analysis.html Key difficulties: • Variation in abundance and diversity • Short fragmentary data • Large amounts of data • Compositionality 3

Metagenomic Analysis modified from https://www.gatc-biotech.com/en/expertise/genomics/metagenome-analysis.html Key difficulties: • Variation in abundance and diversity • Short fragmentary data • Large amounts of data • Compositionality • Spare and imbalanced labels 3

AMRtime Structure Input files Metagenomic Reads Processes AMR Filtering Intermediate files Output files Filtered reads CARD Sensitive Homology Search Homology predictions Variant Identification Metamodels Variant predictions Metamodel predictions 4

Training Data

Dataset Generator Assembled Genomes (*.fna) Resistance Gene Identifier (RGI) Abundance/Diversity Resampling CARD AMR Annotations (*.gff) ’Assembled’ metagenome (.fna) Illumina Simulator (ART) Labelling Synthetic metagenome (.fq) Read labels (.txt) 5

Determinants are scarce 6

Determinants are imbalanced 7

AMR sequence space is biased 8

Read filtering

Homology Filter Approaches • BLASTX (Gish et al., 1993) • DIAMOND (Buchfink et al., 2015) • PALADIN (Westbrook et al., 2017) • MMSeqs2 (Steinegger and S¨ oding, 2017) 9

Performance at defaults? 10

How computationally efficient are they? 11

What about in terms of memory? 12

Is there a cap on overall performance? 13

What about to hit any ARO? 14

Performance for best setting per tool 15

But what about individual ARO performance? 16

Systematically missing AROs 17

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 • OXA-2 (M95287.4): 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 • OXA-2 (M95287.4): • Protein 2456-3280 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 • OXA-2 (M95287.4): • Protein 2456-3280 • DNA 1-828 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 • OXA-2 (M95287.4): • Protein 2456-3280 • DNA 1-828 • Acinetobacter OprD conferring resistance to imipenem (CP006768.1): 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 • OXA-2 (M95287.4): • Protein 2456-3280 • DNA 1-828 • Acinetobacter OprD conferring resistance to imipenem (CP006768.1): • Protein 3513470-3514777 18

Why are these 10 always missed? • Enterococcus faecalis liaS mutant conferring daptomycin resistance (AE016830.1): • Protein 2790824-2789724 • DNA 1-732 • OXA-2 (M95287.4): • Protein 2456-3280 • DNA 1-828 • Acinetobacter OprD conferring resistance to imipenem (CP006768.1): • Protein 3513470-3514777 • DNA 3514887-3515414 18

CARD Full Length Alignment QC • 11 AROs protein not detected from DNA 19

CARD Full Length Alignment QC • 11 AROs protein not detected from DNA • 2 AROs different top protein hit from DNA 19

CARD Full Length Alignment QC • 11 AROs protein not detected from DNA • 2 AROs different top protein hit from DNA • Warnings: 119 AROs with different top protein but ID % > 99 19

Rapid Identification of AMR Determinants from Metagenomic Samples - PowerPoint PPT Presentation

Rapid Identification of AMR Determinants from Metagenomic Samples AMRtime Progress Report Finlay Maguire June 22, 2018 Faculty of Computer Science, Dalhousie University Table of contents 1. Overview 2. Training Data 3. Read filtering 4.

AMRtime Precise identification of antimicrobial resistance determinants from metagenomic data

The AMR Group An n In Intr trod oduc uction tion 2013 2013 The Group The he AMR AMR

AMR and EPSRC AMR Networks Meeting, Sheffield, Sept 16 Christina Turner and Stephanie Newland

CS225: Spatial Computing Course Outline Instructor: Amr Magdy Computer Science and Engineering

Updating the RTP payload format for AMR and AMR-WB draft-ietf-avt-rtp-amr-bis-00.txt Magnus

WITH C++ Prof. Amr Goneid AUC Part 5. Functions Prof. amr Goneid, AUC 1 Functions Prof. amr

WITH C++ Prof. Amr Goneid AUC Part 12. Recursion Prof. amr Goneid, AUC 1 Recursion Prof. amr

Rapid Response Jobs are Alaskas Future Rapid Response Rapid Response Rapid Response is a

SOCIAL DETERMINANTS OF HEALTH Yes, We Have a Role in Our Patients Social Determinants of Health

Neural AMR : Sequence-to-Sequence Models for Parsing and Generation annis Konstas joint work

CS260-002: Spatial Data Modeling and Analysis Course Outline Instructor: Amr Magdy Computer

WITH C++ Prof. Amr Goneid AUC Part 6. Simple and User Defined Data Types Prof. amr Goneid, AUC

WITH C++ Prof. Amr Goneid AUC Part 13. Abstract Data Types (ADTs) Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

WITH C++ Prof. Amr Goneid AUC Part 11a. The Vector Class Prof. amr Goneid, AUC 1 The Vector

WITH C++ Prof. Amr Goneid AUC Part 8. Characters & Strings Prof. amr Goneid, AUC 1

ORAL PRESENTATIONS Invited seminars 34. Italian Institute of Technology, Naples, Italy, 5 June

March 2019 Important Information Cautionary Statement Regarding Forward-Looking Statements

THE WEBINAR WILL BEGIN IN A FEW MINUTES Please call +1.646.307.1990 and enter conference ID

EMA Workshop on Hemophilia Registries Industry Perspectives July 01-02, 2015 - London Mary

Second ISV COVID-19 Vaccines Virtual Congress 21 July 2020 Novavax in brief We are a late-stage

OGI INVESTOR PRESENTATION NASDAQ (OGI) TSX (OGI) OGI INVESTORS PRESENTATION | Cautionary

General Motors Lightweighting Strategy Dr. Paul E. Krajewski Global Mass Integration and Strategy

Investor Presentation Fullerton India Home Finance Co. Ltd. No part of it may be circulated,

Sambuz

Useful Links

Newsletter

Mail Us

Rapid Identification of AMR Determinants from Metagenomic Samples - PowerPoint PPT Presentation

Rapid Identification of AMR Determinants from Metagenomic Samples AMRtime Progress Report Finlay Maguire June 22, 2018 Faculty of Computer Science, Dalhousie University Table of contents 1. Overview 2. Training Data 3. Read filtering 4.

AMRtime Precise identification of antimicrobial resistance determinants from metagenomic data

The AMR Group An n In Intr trod oduc uction tion 2013 2013 The Group The he AMR AMR

AMR and EPSRC AMR Networks Meeting, Sheffield, Sept 16 Christina Turner and Stephanie Newland

CS225: Spatial Computing Course Outline Instructor: Amr Magdy Computer Science and Engineering

Updating the RTP payload format for AMR and AMR-WB draft-ietf-avt-rtp-amr-bis-00.txt Magnus

WITH C++ Prof. Amr Goneid AUC Part 5. Functions Prof. amr Goneid, AUC 1 Functions Prof. amr

WITH C++ Prof. Amr Goneid AUC Part 12. Recursion Prof. amr Goneid, AUC 1 Recursion Prof. amr

Rapid Response Jobs are Alaskas Future Rapid Response Rapid Response Rapid Response is a

SOCIAL DETERMINANTS OF HEALTH Yes, We Have a Role in Our Patients Social Determinants of Health

Neural AMR : Sequence-to-Sequence Models for Parsing and Generation annis Konstas joint work

CS260-002: Spatial Data Modeling and Analysis Course Outline Instructor: Amr Magdy Computer

WITH C++ Prof. Amr Goneid AUC Part 6. Simple and User Defined Data Types Prof. amr Goneid, AUC

WITH C++ Prof. Amr Goneid AUC Part 13. Abstract Data Types (ADTs) Prof. amr Goneid, AUC 1

WITH C++ Prof. Amr Goneid AUC Part 9. Streams &amp; Files Prof. amr Goneid, AUC 1 Streams

WITH C++ Prof. Amr Goneid AUC Part 11a. The Vector Class Prof. amr Goneid, AUC 1 The Vector

WITH C++ Prof. Amr Goneid AUC Part 8. Characters &amp; Strings Prof. amr Goneid, AUC 1

ORAL PRESENTATIONS Invited seminars 34. Italian Institute of Technology, Naples, Italy, 5 June

March 2019 Important Information Cautionary Statement Regarding Forward-Looking Statements

THE WEBINAR WILL BEGIN IN A FEW MINUTES Please call +1.646.307.1990 and enter conference ID

EMA Workshop on Hemophilia Registries Industry Perspectives July 01-02, 2015 - London Mary

Second ISV COVID-19 Vaccines Virtual Congress 21 July 2020 Novavax in brief We are a late-stage

OGI INVESTOR PRESENTATION NASDAQ (OGI) TSX (OGI) OGI INVESTORS PRESENTATION | Cautionary

General Motors Lightweighting Strategy Dr. Paul E. Krajewski Global Mass Integration and Strategy

Investor Presentation Fullerton India Home Finance Co. Ltd. No part of it may be circulated,

Sambuz

Useful Links

Newsletter

Mail Us

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

WITH C++ Prof. Amr Goneid AUC Part 8. Characters & Strings Prof. amr Goneid, AUC 1