Rob Niche Dimensions Edwards What would you do with hundreds of - - PowerPoint PPT Presentation

rob
SMART_READER_LITE
LIVE PREVIEW

Rob Niche Dimensions Edwards What would you do with hundreds of - - PowerPoint PPT Presentation

UNAM The Global Evolution and 2012 Adaptation of Vibrio cholerae Across Multiple Rob Niche Dimensions Edwards What would you do with hundreds of genome sequences? Cholerae Haiti Global evolution Niche dimensions What's


slide-1
SLIDE 1

The Global Evolution and Adaptation of Vibrio cholerae Across Multiple Niche Dimensions

Rob Edwards

UNAM 2012

slide-2
SLIDE 2

What would you do with hundreds

  • f genome sequences?
  • Cholerae
  • Haiti
  • Global evolution
  • Niche dimensions
  • What's important
slide-3
SLIDE 3

Cholera is caused by Vibrio cholerae

slide-4
SLIDE 4

A world wide pandemic

About 3-5 million cases per year About 100 - 200,000 deaths world wide per year Notable deaths:

  • Tchaikovsky
  • Polk (11th President USA)
slide-5
SLIDE 5

Symptoms

About 75% of patients have no symptoms 25-50 PINTS of diarrhea per DAY Severe symptoms are by dehydration Treatment Clean water Electrolytes Vaccine Not antibiotics

slide-6
SLIDE 6

Multiple Pandemics

1st – 1817 to 1823 Started at the Ganges, spread by colonialists 2nd – 1829 to 1849 Worldwide spread via immigrants 3rd – 1852 to 1859 John Snow first epidemiologist

slide-7
SLIDE 7

First epidemiological study

John Snow Portrait painted in 1847 when he was 34 years

  • ld.
slide-8
SLIDE 8

First epidemiological study

John Snow Cholera outbreak in Soho, London 1854 Plotted all cases on a map Found big cluster around water well

slide-9
SLIDE 9

John Snow’s Map

First epidemiological study

slide-10
SLIDE 10

On the mode of communication of Cholera 1854

slide-11
SLIDE 11

Cholera caused by bacteria Outbreaks of cholera

slide-12
SLIDE 12

Multiple Pandemics

1st – 1817 to 1823 Started at the Ganges, spread by colonialists 2nd – 1829 to 1849 Worldwide spread via immigrants 3rd – 1852 to 1859 John Snow first epidemiologist 4th – 1863 to 1879 Originated in mecca 5th – 1881 to 1896 First cholerae vaccine (1892) 6th – 1899 to 1923 Killed 800,000 people 7th – 1961 to present

  • 1991: South America killed > 100,000 people
slide-13
SLIDE 13

Haitian Outbreak

Earthquake Jan 12th, 2010 No cholera in Haiti for > 50 years First case, October 22nd, 2010 By February, 2011 250,000 cases and ~5,000 deaths What was the original source?

slide-14
SLIDE 14

Haitian cholera outbreaks

http://www.ph.ucla.edu/

slide-15
SLIDE 15

Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti

slide-16
SLIDE 16

Cases by day – Mirebalais Hospital

slide-17
SLIDE 17

Cases by Age – St Marc Hospital

On October 20th, 2010

slide-18
SLIDE 18

Haitian Outbreak

Two hypotheses: Endemic, waterborne strain that has been in Haiti but not caused disease for 50 years Imported from another country

slide-19
SLIDE 19

The environmental hypothesis

"They have been fortunate in Haiti that for 50 years the conditions have been such that they haven’t had an intense increase in cholera bacterial populations. ... But they’ve had an earthquake, they’ve had destruction, they’ve had a hurricane ... I think it’s very unfortunate to look for a scapegoat. It is an environmental phenomenon that is involved”

Rita Colwell Johns Hopkins School of Public Health

slide-20
SLIDE 20

The human hypothesis

“The organism that is causing the disease is very uncharacteristic of (Haiti and the Caribbean), and is quite characteristic of the region from where the soldiers in the base came. ... I don't see there is any way to avoid the conclusion that an unfortunate and presumably accidental introduction of the organism occurred."

John Mekalanos Harvard Medical School

slide-21
SLIDE 21

Conditions favor human hypothesis

Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti

slide-22
SLIDE 22

Conditions favor human hypothesis

Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti

slide-23
SLIDE 23

Conditions favor human hypothesis

Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti

slide-24
SLIDE 24

Cholerae Mimicus Parahemolyticus Harveyi Vibrio cholerae from Bangladesh in 1994 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Bangladesh in 2002 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Haiti in 2010

On the source of Haitian cholera

slide-25
SLIDE 25

Nepalese soldiers?

Outbreak in Khatmandu, Nepal before the soldiers left Outbreaks downstream (not upstream) along the river from the nepalese UN camp But that could have come from river trade. Ships used to fly the yellow flag when they were quarantined by cholera

slide-26
SLIDE 26

Haitian cholera outbreaks

http://www.ph.ucla.edu/

slide-27
SLIDE 27

Global evolution of Vibrio

Can we use genomics to identify the global evolution of Vibrio? Which gene(s) are important for temporal/spatial variation?

slide-28
SLIDE 28

Prototype Vibrio cholerae sequence

TIGR Nature 406, 477-483(3 August 2000)

slide-29
SLIDE 29

Sequenced genomes

2011 – 32 Vibrio strains sequenced

slide-30
SLIDE 30

Fabiano Thompson's Lab @ UFRJ

slide-31
SLIDE 31

Fundação Oswaldo Cruz

slide-32
SLIDE 32
slide-33
SLIDE 33

Ion quality scores

slide-34
SLIDE 34

Sequenced genomes

2011 – 32 Vibrio strains sequenced 2011 – 171 Vibrio strains sequenced

slide-35
SLIDE 35

Reads per chromosome (Chr. I)

slide-36
SLIDE 36

Reads per chromosome (Chr. I)

slide-37
SLIDE 37

Cholera Toxin Phage

slide-38
SLIDE 38

Annotated using RAST

slide-39
SLIDE 39

Merge frameshifts

  • Merge adjacent genes if:
  • Have the same function
  • Have similarity to the same 3rd party protein
slide-40
SLIDE 40

Frameshift frequency

slide-41
SLIDE 41

Metabolic reconstruction considers the pathways present in the cell Based on comparison to Sanger sequenced V. cholerae El Tor strain (a virulent pathogen)

Is it any good?

slide-42
SLIDE 42
slide-43
SLIDE 43

R-18304 (Ion) El T

  • r

(Sanger)

slide-44
SLIDE 44

R-18304 (Ion) El T

  • r

(Sanger)

slide-45
SLIDE 45

Single nucleotide polymorphisms

ATCATCGATCAGCATGCATCAGCATCGATCAGC ATCATCGATCAGCATGCATCAGCATCGATCAGC ATCATCGATCAGCATGCATCAGCCTCGATCAGC ATCATCGATCAGCATGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGAGCAGC ATCATCGATCAGCAAGCATCAGCCTCGAGCAGC

slide-46
SLIDE 46

Global evolution

Mutreja et al 2011

slide-47
SLIDE 47

Waves of spread of cholera

Mutreja et al 2011

slide-48
SLIDE 48

Different evolution for each wave

Mutreja et al 2011

slide-49
SLIDE 49

Cholerae Mimicus Parahemolyticus Harveyi Vibrio cholerae from Bangladesh in 1994 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Bangladesh in 2002 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Haiti in 2010

On the source of Haitian cholera

slide-50
SLIDE 50

Evolution not only by SNPs

Conservation of the ~120 kb superintegron region across 210 strains

slide-51
SLIDE 51

Horizontal gene transfer versus Vertical evolution

Mother Daughter Daughter

S N P s HGT

slide-52
SLIDE 52

Niche dimensions

  • 210 Vibrio genomes
  • Reassembled
  • Reannotated
  • Find interesting

genes!

  • Year
  • Continent
  • Country
  • Lat/Lon Coordinates
  • Clinical or Environmental
  • Source
  • Serogroup
  • Serotype
  • Biotype
  • Mutreja wave
slide-53
SLIDE 53

Serogroup

  • V. cholerae classification

Vibrio cholerae Non-cholera toxin No disease Cholera toxin O1 O139 Classical El Tor Ogawa Inaba Epidemics

Biotype Serotype

slide-54
SLIDE 54

Niche dimensions

  • 210 Vibrio genomes
  • Reassembled
  • Reannotated
  • Find interesting

genes!

  • Year
  • Continent
  • Country
  • Lat/Lon Coordinates
  • Clinical or Environmental
  • Source
  • Serogroup
  • Serotype
  • Biotype
  • Mutreja wave
slide-55
SLIDE 55

Response variables

  • 15,000 genes in the pangenome
  • 933 subsystems (pathways) present in at least
  • ne genome
  • SNPs (after Mutreja)
slide-56
SLIDE 56

Analysis

  • Recreate evolution of the Vibrios
  • What are the important genes for each niche

dimension

Who, what, when, where! Use random forests to identify important variables

slide-57
SLIDE 57

Random Forest

O-antigen Exopoly- saccharide Capsule Sialic Acid DNA recomb. 01 10 20 5 10 01 10 20 5 10 01 10 20 5 10 0139 100 1 8 10 0139 100 1 8 10 0139 100 1 8 10 0139 100 1 8 10

slide-58
SLIDE 58

O139 O1 Exopoly- saccharide

<50

Random Forest

O1 O139 DNA- recombination

10

O1 O139 Capsule

<10

slide-59
SLIDE 59

Random Forest

Each tree votes on the importance

  • f each

variable. Typically, run 10,000 trees

slide-60
SLIDE 60

Genes important for who ? (serogroup)

slide-61
SLIDE 61

Genes important for what? (clinical, environmental, ...)

slide-62
SLIDE 62

Genes important for where? (continent)

slide-63
SLIDE 63

Separation of functions by continent

slide-64
SLIDE 64

Genes important for when? (year)

DNA Repair

slide-65
SLIDE 65

DNA repair & phages

Normal DNA repair (134 strains) umuC umuD Additional DNA repair (4 strains; not O1) umuC umuD Phage borne DNA repair (72 strains) umuC prophage

slide-66
SLIDE 66

Different evolution for each wave

Mutreja et al 2011 Waves 2 & 3 have phage Interrupted repair

slide-67
SLIDE 67

Conclusions

  • Unraveling evolution and spread of new

pathogens

  • Mining genomes and niche dimensions
  • Don't get scooped!
slide-68
SLIDE 68

Discussion points

  • What is the value of multi genome projects?
slide-69
SLIDE 69

Current multigenome projects

Organism Number Organism Number

  • S. pyogenes

3,615 Mycobacterium tuberculosis 390

  • S. pneumoniae

3,085 Salmonella in cattle and humans 373 Rice (Oryza sativa) 3,000 Vibrio 274

  • C. elegans

2,007 Shigella sonnei 263 Clostridium difficile 1,250 Mycobacterium tuberculosis 259 The thousand (human) genome project 1,092 Streptococcus pneumoniae 240 Mycobacterium tuberculosis 1,000 Methicillin-resistant Staphylococcus aereus 193 Plasmodium falciparum 825 Campylobacter jejuni 192 Streptococcus pneumoniae 616 Mycobacterium abscessus in CF 170 Nick Loman: http://lab.loman.net/