SLIDE 1 The Global Evolution and Adaptation of Vibrio cholerae Across Multiple Niche Dimensions
Rob Edwards
UNAM 2012
SLIDE 2 What would you do with hundreds
- f genome sequences?
- Cholerae
- Haiti
- Global evolution
- Niche dimensions
- What's important
SLIDE 3
Cholera is caused by Vibrio cholerae
SLIDE 4 A world wide pandemic
About 3-5 million cases per year About 100 - 200,000 deaths world wide per year Notable deaths:
- Tchaikovsky
- Polk (11th President USA)
SLIDE 5
Symptoms
About 75% of patients have no symptoms 25-50 PINTS of diarrhea per DAY Severe symptoms are by dehydration Treatment Clean water Electrolytes Vaccine Not antibiotics
SLIDE 6
Multiple Pandemics
1st – 1817 to 1823 Started at the Ganges, spread by colonialists 2nd – 1829 to 1849 Worldwide spread via immigrants 3rd – 1852 to 1859 John Snow first epidemiologist
SLIDE 7 First epidemiological study
John Snow Portrait painted in 1847 when he was 34 years
SLIDE 8
First epidemiological study
John Snow Cholera outbreak in Soho, London 1854 Plotted all cases on a map Found big cluster around water well
SLIDE 9
John Snow’s Map
First epidemiological study
SLIDE 10
On the mode of communication of Cholera 1854
SLIDE 11 Cholera caused by bacteria Outbreaks of cholera
SLIDE 12 Multiple Pandemics
1st – 1817 to 1823 Started at the Ganges, spread by colonialists 2nd – 1829 to 1849 Worldwide spread via immigrants 3rd – 1852 to 1859 John Snow first epidemiologist 4th – 1863 to 1879 Originated in mecca 5th – 1881 to 1896 First cholerae vaccine (1892) 6th – 1899 to 1923 Killed 800,000 people 7th – 1961 to present
- 1991: South America killed > 100,000 people
SLIDE 13
Haitian Outbreak
Earthquake Jan 12th, 2010 No cholera in Haiti for > 50 years First case, October 22nd, 2010 By February, 2011 250,000 cases and ~5,000 deaths What was the original source?
SLIDE 14 Haitian cholera outbreaks
http://www.ph.ucla.edu/
SLIDE 15 Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti
SLIDE 16
Cases by day – Mirebalais Hospital
SLIDE 17
Cases by Age – St Marc Hospital
On October 20th, 2010
SLIDE 18
Haitian Outbreak
Two hypotheses: Endemic, waterborne strain that has been in Haiti but not caused disease for 50 years Imported from another country
SLIDE 19
The environmental hypothesis
"They have been fortunate in Haiti that for 50 years the conditions have been such that they haven’t had an intense increase in cholera bacterial populations. ... But they’ve had an earthquake, they’ve had destruction, they’ve had a hurricane ... I think it’s very unfortunate to look for a scapegoat. It is an environmental phenomenon that is involved”
Rita Colwell Johns Hopkins School of Public Health
SLIDE 20
The human hypothesis
“The organism that is causing the disease is very uncharacteristic of (Haiti and the Caribbean), and is quite characteristic of the region from where the soldiers in the base came. ... I don't see there is any way to avoid the conclusion that an unfortunate and presumably accidental introduction of the organism occurred."
John Mekalanos Harvard Medical School
SLIDE 21 Conditions favor human hypothesis
Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti
SLIDE 22 Conditions favor human hypothesis
Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti
SLIDE 23 Conditions favor human hypothesis
Source: Final Report of the Independent Panel of Experts on the Cholera Outbreak in Haiti
SLIDE 24 Cholerae Mimicus Parahemolyticus Harveyi Vibrio cholerae from Bangladesh in 1994 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Bangladesh in 2002 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Haiti in 2010
On the source of Haitian cholera
SLIDE 25
Nepalese soldiers?
Outbreak in Khatmandu, Nepal before the soldiers left Outbreaks downstream (not upstream) along the river from the nepalese UN camp But that could have come from river trade. Ships used to fly the yellow flag when they were quarantined by cholera
SLIDE 26 Haitian cholera outbreaks
http://www.ph.ucla.edu/
SLIDE 27
Global evolution of Vibrio
Can we use genomics to identify the global evolution of Vibrio? Which gene(s) are important for temporal/spatial variation?
SLIDE 28 Prototype Vibrio cholerae sequence
TIGR Nature 406, 477-483(3 August 2000)
SLIDE 29
Sequenced genomes
2011 – 32 Vibrio strains sequenced
SLIDE 30
Fabiano Thompson's Lab @ UFRJ
SLIDE 31
Fundação Oswaldo Cruz
SLIDE 32
SLIDE 33
Ion quality scores
SLIDE 34
Sequenced genomes
2011 – 32 Vibrio strains sequenced 2011 – 171 Vibrio strains sequenced
SLIDE 35
Reads per chromosome (Chr. I)
SLIDE 36
Reads per chromosome (Chr. I)
SLIDE 37
Cholera Toxin Phage
SLIDE 38
Annotated using RAST
SLIDE 39 Merge frameshifts
- Merge adjacent genes if:
- Have the same function
- Have similarity to the same 3rd party protein
SLIDE 40
Frameshift frequency
SLIDE 41
Metabolic reconstruction considers the pathways present in the cell Based on comparison to Sanger sequenced V. cholerae El Tor strain (a virulent pathogen)
Is it any good?
SLIDE 42
SLIDE 43 R-18304 (Ion) El T
(Sanger)
SLIDE 44 R-18304 (Ion) El T
(Sanger)
SLIDE 45
Single nucleotide polymorphisms
ATCATCGATCAGCATGCATCAGCATCGATCAGC ATCATCGATCAGCATGCATCAGCATCGATCAGC ATCATCGATCAGCATGCATCAGCCTCGATCAGC ATCATCGATCAGCATGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGATCAGC ATCATCGATCAGCAAGCATCAGCCTCGAGCAGC ATCATCGATCAGCAAGCATCAGCCTCGAGCAGC
SLIDE 46 Global evolution
Mutreja et al 2011
SLIDE 47 Waves of spread of cholera
Mutreja et al 2011
SLIDE 48 Different evolution for each wave
Mutreja et al 2011
SLIDE 49 Cholerae Mimicus Parahemolyticus Harveyi Vibrio cholerae from Bangladesh in 1994 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Bangladesh in 2002 Vibrio cholerae from Haiti in 2010 Vibrio cholerae from Haiti in 2010
On the source of Haitian cholera
SLIDE 50
Evolution not only by SNPs
Conservation of the ~120 kb superintegron region across 210 strains
SLIDE 51 Horizontal gene transfer versus Vertical evolution
Mother Daughter Daughter
S N P s HGT
SLIDE 52 Niche dimensions
- 210 Vibrio genomes
- Reassembled
- Reannotated
- Find interesting
genes!
- Year
- Continent
- Country
- Lat/Lon Coordinates
- Clinical or Environmental
- Source
- Serogroup
- Serotype
- Biotype
- Mutreja wave
SLIDE 53 Serogroup
- V. cholerae classification
Vibrio cholerae Non-cholera toxin No disease Cholera toxin O1 O139 Classical El Tor Ogawa Inaba Epidemics
Biotype Serotype
SLIDE 54 Niche dimensions
- 210 Vibrio genomes
- Reassembled
- Reannotated
- Find interesting
genes!
- Year
- Continent
- Country
- Lat/Lon Coordinates
- Clinical or Environmental
- Source
- Serogroup
- Serotype
- Biotype
- Mutreja wave
SLIDE 55 Response variables
- 15,000 genes in the pangenome
- 933 subsystems (pathways) present in at least
- ne genome
- SNPs (after Mutreja)
SLIDE 56 Analysis
- Recreate evolution of the Vibrios
- What are the important genes for each niche
dimension
Who, what, when, where! Use random forests to identify important variables
SLIDE 57
Random Forest
O-antigen Exopoly- saccharide Capsule Sialic Acid DNA recomb. 01 10 20 5 10 01 10 20 5 10 01 10 20 5 10 0139 100 1 8 10 0139 100 1 8 10 0139 100 1 8 10 0139 100 1 8 10
SLIDE 58
O139 O1 Exopoly- saccharide
<50
Random Forest
O1 O139 DNA- recombination
10
O1 O139 Capsule
<10
SLIDE 59 Random Forest
Each tree votes on the importance
variable. Typically, run 10,000 trees
SLIDE 60
Genes important for who ? (serogroup)
SLIDE 61
Genes important for what? (clinical, environmental, ...)
SLIDE 62
Genes important for where? (continent)
SLIDE 63
Separation of functions by continent
SLIDE 64
Genes important for when? (year)
DNA Repair
SLIDE 65 DNA repair & phages
Normal DNA repair (134 strains) umuC umuD Additional DNA repair (4 strains; not O1) umuC umuD Phage borne DNA repair (72 strains) umuC prophage
SLIDE 66 Different evolution for each wave
Mutreja et al 2011 Waves 2 & 3 have phage Interrupted repair
SLIDE 67 Conclusions
- Unraveling evolution and spread of new
pathogens
- Mining genomes and niche dimensions
- Don't get scooped!
SLIDE 68 Discussion points
- What is the value of multi genome projects?
SLIDE 69 Current multigenome projects
Organism Number Organism Number
3,615 Mycobacterium tuberculosis 390
3,085 Salmonella in cattle and humans 373 Rice (Oryza sativa) 3,000 Vibrio 274
2,007 Shigella sonnei 263 Clostridium difficile 1,250 Mycobacterium tuberculosis 259 The thousand (human) genome project 1,092 Streptococcus pneumoniae 240 Mycobacterium tuberculosis 1,000 Methicillin-resistant Staphylococcus aereus 193 Plasmodium falciparum 825 Campylobacter jejuni 192 Streptococcus pneumoniae 616 Mycobacterium abscessus in CF 170 Nick Loman: http://lab.loman.net/