Global surveillance of infectious diseases Open science, open data, - - PowerPoint PPT Presentation

global surveillance of infectious diseases open science
SMART_READER_LITE
LIVE PREVIEW

Global surveillance of infectious diseases Open science, open data, - - PowerPoint PPT Presentation

Global surveillance of infectious diseases Open science, open data, open for all Frank M. Aarestrup www.compare-europe.eu www.genomicepidemiology.org The charge of the light brigade Strategy to win a war Putting up the right defense Knowing:


slide-1
SLIDE 1

Global surveillance of infectious diseases Open science, open data, open for all

Frank M. Aarestrup www.compare-europe.eu www.genomicepidemiology.org

slide-2
SLIDE 2
slide-3
SLIDE 3

The charge of the light brigade

slide-4
SLIDE 4

Strategy to win a war

Putting up the right defense

Knowing: Where What When How Communicate & Prioritise

slide-5
SLIDE 5

IMPORTANCE OF INFECTIOUS DISEASES

  • Direct cause of 22% of all global deaths (15 million)
  • Huge burden on expected life length and disability (DALYs)
  • Devastating effects on economy:
  • Absence from work
  • Trade
  • Travel
  • Health system
  • Expected to increase; e.g. AMR alone causing 10 million deaths in

2050 (cancer 8.2 million in mainly older people)

slide-6
SLIDE 6

Current infectious disease situation

  • Dynamics of common infectious diseases are changing

– Demographic change, population density, anti vaccine, AMR, etc.

  • New diseases / variants emerge frequently

– Population growth, travel, trade, climate change

  • Effects are difficult to predict due to complexity

– Rapid flexible response

  • Public health, diagnostic, vaccine development and clinical

response depend on global capacity for disease surveillance

– Rapid sharing, comparison and analysis of data from multiple sources and using multiple methodologies

slide-7
SLIDE 7

2 main aspects

§ Most EID come from animals,

  • pportunity for contact increases

§ Once introduced in people, many

  • pportunities for transmission

Wolfe et al., 2007; http://rambaut.github.io/EBOV_Visualization/Makona_1561_D3/; Gytis et al., 2017

§ Complexity increased by food trade

slide-8
SLIDE 8

Ambition: Reduce impact of EID by improving prediction, prevention, detection, control and treatment

slide-9
SLIDE 9

Response to ID outbreaks usually fragmented and late

9

Infected patients

Public Health response Preclinical research response

time

clinical research response Adapted from:

Courtesy Frank Deege

slide-10
SLIDE 10

Response to ID outbreaks with improved detection and sharing of data

10

Infected patients

Public Health response Preclinical research response

time

clinical research response

Courtesy Frank Deege

slide-11
SLIDE 11

NGS advantages

  • Laboratory diagnostics increasingly rely on (pathogen) genomic

information

  • RNA / DNA are common across pathogens, therefore, methods to

analyse pathogen genomes are potentially universal

  • Next generation sequencing capacity is developing fast, and costs are

becoming competitive

  • Data are easy to share electronically and are in a standardized format

Ø Capturing NGS developments may provide a universal language that can be harnessed for early detection of outbreaks across disciplines and domains Ø If the technology keeps developing, less equipped labs may leapfrog

slide-12
SLIDE 12

Preparing for Global Surveillance - Center for Genomic Epidemiology (CGE)

  • Provide a proof of concept of combining bioinformatics with global

epidemiology in real-time (2010-2016)

  • Provide foundation for web based solutions (plug and play tool)

– Rapid sequence assembly and annotation

  • What is it
  • How dangerous is it
  • Have we seen it before
  • Can it be treated
  • www.genomicepidemiology.org
slide-13
SLIDE 13

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 643476.

http://cge.cbs.dtu.dk/services/all.php

CGE tool box

slide-14
SLIDE 14

CGE Batch upload – simple upload and ”all in one go”

1 4 12 June 2017

slide-15
SLIDE 15

Uploaded data and analysis easy to search and download

slide-16
SLIDE 16

User Statistics

Until now: >1,000,000 submissions from 15,000 IP- adresses in +100 countries

Moving tools to the data – integration into ENA repository

slide-17
SLIDE 17

Why a central public repository?

Besides the language and altruistic issues

  • The data comparison problem
  • Allowing easy transfer between levels of access

including public

  • Allow access to bioinformatics for the frontline
  • Allowing for constantly improving the analytic

pipelines

slide-18
SLIDE 18

Data comparison problem

Global repositories > 0.1 - 10 Pb data Client ~1-100 Gb data Internet ~1-10Gb/hour Bring the tools to the data

slide-19
SLIDE 19

mcr-1

  • Mcr-1 gene added to ResFinder database on Nov. 24
  • Nov. 25 – screening of online available human and food

isolates initiated using an iphone on a plane to Paris – 5 hits

  • Nov. 25-26 the authorities notified and the screening

extended to +3,000 available genomes

  • Nov. 29 meeting with the authorities – please wait
  • Dec. 2 press release and submission of manuscript
  • Dec. 9 publication
slide-20
SLIDE 20

Establishing and improving real-time surveillance

  • Rapid sharing

– Online bioinformatic tools

  • >1,000 jobs per day

– Facilities for rapid sharing

  • Natural reservoirs

– Major issues with individual samples

20

The Rio Convention The Nagoya Protocol National ethical committees

slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23

Global sewage surveillance - 2016

12 June 2017

India Brazil

slide-24
SLIDE 24

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 643476.

slide-25
SLIDE 25
slide-26
SLIDE 26

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 643476.

R² = 0,30873

10 12 14 16 18 20 22 24 26 28 30

50 100 150 200 250 300 350 400 450

AMR

slide-27
SLIDE 27

Copenhagen according to sewage

Samples collected since Nov. 2015 Sequenced routinely since June 2016 Released publicly real-time Very preliminary data Drug use the previous year

slide-28
SLIDE 28

Global sewage 2017

slide-29
SLIDE 29

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 643476.

Global surveillance of:

  • Healthy humans
  • Healthy animals
  • Diseased humans & animals

Explanatory varaibles:

  • Trade (food)
  • Travel (flight)
  • Demographic (World bank)

1000 cities 1000 slaugtherhouses 50,000 laboratories

slide-30
SLIDE 30

Building a global genomic epidemiological infrastructure – making data assessable Data:

  • World Bank

(demographic/economic)

  • WHO (health)
  • FAO (food production and

trade)

  • Flight connections
  • Antimicrobial use

Own data Online epidemiological tools (epidemiology for dummies) Search and download Own analysis Raw NGS data Sharing site

  • Own data
  • Shared data
  • Public data

Online bioinformatic tools

(Bioinformartics for dummies)

  • Assembly
  • Mapping
  • Phylogeny
slide-31
SLIDE 31

Research is not enough

  • E-learning

– Massive Open Online Courses

  • Ring-trials
  • Advisory service
  • No free ride

– If you will not share – pay

46 laboratories in 22 countries have provided data for at least one of the PT components

slide-32
SLIDE 32

Challenges

  • Scientific

– Make sense (or even analyse) +50 million genes among exabytes data explained by +2,000 variables and global connectivities

  • False discovery rate – min. 5 billion using clasical statistics
  • Sustainability/analytic pipeline

– 500 million € to establish, 50 million €/y to run

  • Sharing of data
  • Scientific credit
  • Political and other consequences

– When we report diseases where they should not exist!

slide-33
SLIDE 33

How to get involved

  • Build a database and work with us
  • Come with some interesting questions and engage us
  • Make a pipeline and challenge us
  • Support us financially

– 500 million € to establish, 50 million €/y to run – I will be standing outside accepting collections

slide-34
SLIDE 34

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 643476.

Guiding principles:

  • Cross sector, cross domain, open source (not commercial)
  • Interaction with the rest of the world (all inclusive)
  • Data for action (actionable outputs)
  • Central repository (tools to the data) (ENA, DDJ,NCBI)

Our vision: one system serves all

There can be no real-time disease surveillance without real-time data sharing

slide-35
SLIDE 35

Kingdom Phylum Class Family Genus Species IKEA-BigMac Lyngby-Whopper Eukaryota Chordata Mammalia Bovidae Bos Bos mutus 47,728767 55,138006 Eukaryota Chordata Mammalia Bovidae Bubalus Bubalus bubalis 16,571103 16,052984 Eukaryota Chordata Mammalia Bovidae Capra Capra hircus 0,779368 0,858108 Eukaryota Chordata Mammalia Bovidae Ovis Ovis aries 0,362244 0,592654 Eukaryota Chordata Mammalia Vespertilionid ae Myotis Myotis davidii 0,286175 0,062029 Eukaryota Chordata Mammalia Muridae Rattus Rattus norvegicus 0,165231 0,151688 Eukaryota Chordata Mammalia Macropodidae Macropus Macropus eugenii 0,153902 0,189751 Eukaryota Chordata Mammalia Bovidae Pantholops Pantholops hodgsonii 0,129919 0,102347 Eukaryota Chordata Mammalia Bovidae Bos Bos taurus 0,088133 0,203566 Eukaryota Chordata Mammalia Balaenopterid ae Balaenoptera Balaenoptera acutorostrata 0,077098 0,045253 Eukaryota Chordata Mammalia Ursidae Ailuropoda Ailuropoda melanoleuca 0,012359 0,008599

Complete sequencing of food products