data approach to integrative
play

data approach to integrative evolutionary histories Fabia U. - PowerPoint PPT Presentation

CENTER FOR DATA SCIENCE SCIENCE Strength in Numbers AND BIG D BIG DATA ANALYTICS Earth, genomes, and time: a big data approach to integrative evolutionary histories Fabia U. Battistuzzi Biological Sciences battistu@oakland.edu Dec.


  1. CENTER FOR DATA SCIENCE SCIENCE Strength in Numbers AND BIG D BIG DATA ANALYTICS Earth, genomes, and time: a big data approach to integrative evolutionary histories Fabia U. Battistuzzi Biological Sciences battistu@oakland.edu Dec. 1, 2016

  2. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Nothing in biology makes sense except in the light of evolution T. Dobzhansky • Past history of life is a predictor of current and Past future changes • Medical field • Climate science • Astrobiology Evolution • Conservation biology • Sustainable energy • … Present Future

  3. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Nothing in biology makes sense except in the light of evolution T. Dobzhansky • Genomes are repositories of billions of data points (DNA bases) • Human genome: 3 billion DNA base pairs • 7 billion individuals on Earth • 2.1 e+19 base pairs • Species estimates: 10 million to 1 trillion • Many will be much smaller than us (< 1 million base pairs) • Many are much larger than us (up to ~150 billion base pairs)

  4. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Nothing in biology makes sense except in the light of evolution T. Dobzhansky

  5. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Nothing in biology makes sense except in the light of evolution T. Dobzhansky • Comparative genomics • How and where did life originate • And where should we look for other life (Astrobiology)

  6. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Nothing in biology makes sense except in the light of evolution T. Dobzhansky • Comparative genomics • How did life survive on Earth through major environmental changes • Microbes are the longest living lineages on Earth (~4 billion years) • They survived and thrived during planetary-scale climate changes (conservation and sustainability)

  7. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Nothing in biology makes sense except in the light of evolution T. Dobzhansky • Comparative genomics • How do pathogens escape our immune system and drugs • What changes at the genomic level allow them to adapt? • How do pathogens arise and spread?

  8. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Evolution in the Blab • Early life evolution • How to accurately reconstruct the evolutionary histories of microbes on Earth • Conditions for life to thrive • Adaptations that sustained microbial life through climate changes • Rate of species diversification

  9. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Evolution in the Blab • Evolution of malaria • How does it adapt to humans and other hosts? • Are genomes evolving differently depending on the host? AT-balance • Are genes involved in antimalarial drug resistance evolving faster? AT-rich

  10. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Evolution in the Blab • Students involvement • Undergrads & grads (current size: 11+1 students) • Skills • Scripting/programming • Phylogenetics • Comparative genomics • Data mining • Statistics

  11. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS New opportunities with CDaS • Explore new statistical applications for Big Data • Systematic bias • False discovery rates Kumar et al 2012

  12. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS New opportunities with CDaS • Connect comparative and functional genomics • Text mining of functional databases • Integration of multiple databases

  13. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS New opportunities with CDaS • Explore new strategies to gain computational support • On-site high- performance computational cluster (working on it…) • Cloud-based and off- site clusters (exploring options…) • Programming, database architecture

  14. CENTER FOR DATA SC SCIENC IENCE Strength in AND BIG BIG DATA A Numbers ANALYTICS Contact info battistu@oakland.edu 340 Dodge Hall Biological Sciences Oakland University

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend