Big Data A Discovery Dinner College of Natural Sciences October - - PowerPoint PPT Presentation

big data
SMART_READER_LITE
LIVE PREVIEW

Big Data A Discovery Dinner College of Natural Sciences October - - PowerPoint PPT Presentation

Big Data A Discovery Dinner College of Natural Sciences October 13, 2015 Presenters: Karl Gebhardt, Astronomy Lauren Ancel Meyers, Integrative Biology Stefano Tiziani, Nutritional Sciences Moderated by Hans Hofmann, Integrative Biology


slide-1
SLIDE 1

Big Data

A Discovery Dinner College of Natural Sciences October 13, 2015

Presenters: Karl Gebhardt, Astronomy Lauren Ancel Meyers, Integrative Biology Stefano Tiziani, Nutritional Sciences Moderated by Hans Hofmann, Integrative Biology Mike Daniels, Statistics and Data Sciences

slide-2
SLIDE 2

Hobby-Eberly Telescope Dark Energy Experiment The upgraded HET

(Karl Gebhardt, Project Scientist)

slide-3
SLIDE 3

HETDEX is:

→ Blind spectrographic survey on 10m Hobby-Eberly Telescope → About 1 million galaxies around 10 billion years ago → About 2 million galaxies within 4 billion years ago → New instrument VIRUS with 33,500 fibers over focal plane → Generate about 0.3 Pb/yr for 5-10 years → Goal is to measure the expansion rate of Universe.

Data Issues:

→ Transfer, real-time analysis, storage on TACC → Access to a broad community → Develop techniques for understood processes and new exploratory studies

slide-4
SLIDE 4

ESSENCE of HETDEX

380,000 years after the Big Bang Today: 13.7 billion years after the Big Bang

  • There is a characteristic scale

imprinted in the distribution of matter.

  • It acts like a fingerprint that we

use to trace over cosmic time.

  • Requires about one million

galaxies in order to see it.

slide-5
SLIDE 5

HETDEX Analysis Issues

  • How to handle many Pb of data. We are working with TACC
  • n this. All data will be public.
  • Base programs are all in c++, with python wrappers for

metadata and access. Need developers who understand statistics, astrophysics, coding.

  • As is always the issue, large-scale systematics will be the

limiting factor for the uncertainties.

  • Example case: We will have 1e6 objects detected. We will

have 1e9 spectra of blank sky. Correlate detections against blank sky to pull out spatial signal from fainter sources.

  • Example case: Detection of cosmic web, which is

correlation over full region (3 Pb) of unknown pattern.

  • Example case: Real-time analysis to look for supernovae,

requires fast transfer, analysis, and feedback.

slide-6
SLIDE 6

Big data and the race to stop global pandemics

Lauren Ancel Meyers Department of Integrative Biology Department of Statistics & Data Sciences The University of Texas at Austin

slide-7
SLIDE 7

Where are outbreaks spreading today?

slide-8
SLIDE 8

Where will they be tomorrow?

slide-9
SLIDE 9

How can we stop them?

slide-10
SLIDE 10

 To obtain a metabolic signature of clinical samples

  • Early detection
  • Disease stratification
  • Disease progression
  • Therapy outcome

The Tiziani Research Group

 To identify novel top-hit most effective compound combinations for targeting the tumor microenvironment  To better understand cellular metabolism and its link to human diseases

slide-11
SLIDE 11

The Tiziani Research Group

This urine wheel was published in 1506 by Ulrich Pinder, in his book Epiphanie Medicorum used to diagnose disease

The Metabolome

slide-12
SLIDE 12

Thousands of features Metabolites/lipids validated Not detectable yet… Metabolites/lipids potentially identified 103 104 105 106 102

This image cannot currently be displayed.

Targeting the Metabolome is Challenging

slide-13
SLIDE 13

Stable Isotope Assisted-Metabolomics

13C 1H

slide-14
SLIDE 14
  • NMR- and MS-based metabolic

studies

(intra- and extra-cellular)

  • Metabolic fluxes analysis
  • Constraining simulations
  • 13C, 15N labeled nutrient
  • High-Throughput Screening

(e.g. ATP and ROS assays 384 well plate)

  • High-Content NMR- and MS-based

metabolic screening

(96 well plate, flasks)