Computational, Statistical, and Mathematical Challenges in - - PowerPoint PPT Presentation

computational statistical and mathematical challenges in
SMART_READER_LITE
LIVE PREVIEW

Computational, Statistical, and Mathematical Challenges in - - PowerPoint PPT Presentation

Christopher J. Miller Asst. Professor, Astronomy Computational, Statistical, and Mathematical Challenges in Astronomy The Challenges The Demand for Higher Precision Science The Hubble Constant The Challenges The Demand for Higher


slide-1
SLIDE 1
slide-2
SLIDE 2

Computational, Statistical, and Mathematical Challenges in Astronomy

Christopher J. Miller

  • Asst. Professor,

Astronomy

slide-3
SLIDE 3

The Challenges

The Demand for Higher Precision Science

The Hubble Constant

slide-4
SLIDE 4

The Challenges

The Demand for Higher Precision Science

The Hubble Constant

slide-5
SLIDE 5

The Challenges

The Demand for Higher Precision Science

The Hubble Constant

slide-6
SLIDE 6

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameter

slide-7
SLIDE 7

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameters There is more to astronomy than cosmology

Galaxies have mass and stellar populations with ages, metallicities, star formation histories

slide-8
SLIDE 8

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameters There is more to astronomy than cosmology

Galaxies have mass and stellar populations with ages, metallicities, star formation histories Stars (which make up galaxies) have mass, temperatures, ages, abundances Planets (around stars) have masses, compositions, atmospheres, orbital parameters

slide-9
SLIDE 9

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameters There is more to astronomy than cosmology

Galaxies have mass and stellar populations with ages, metallicities, star formation histories Stars (which make up galaxies) have mass, temperatures, ages, abundances

slide-10
SLIDE 10

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameters There is more to astronomy than cosmology

Galaxies have mass and stellar populations with ages, metallicities, star formation histories Stars (which make up galaxies) have mass, temperatures, ages, abundances Planets (around stars) have masses, compositions, atmospheres, orbital parameters

slide-11
SLIDE 11

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameters There is more to astronomy than cosmology

Galaxies have mass and stellar populations with ages, metallicities, star formation histories Stars (which make up galaxies) have mass, temperatures, ages, abundances Planets (around stars) have masses, compositions, atmospheres, orbital parameters

slide-12
SLIDE 12

The Challenges

The Demand for Higher Precision Science

The Hubble Constant One of dozens of Cosmological Parameters There is more to astronomy than cosmology

Galaxies have mass and stellar populations with ages, metallicities, star formation histories Stars (which make up galaxies) have mass, temperatures, ages, abundances Planets (around stars) have masses, compositions, atmospheres, orbital parameters

slide-13
SLIDE 13

The Challenges

The “Data Flood”

Astronomical catalogs today contain about 1.5 billion objects (SDSS is ~300 million, IRSA is ~1 billion).

A factor of 20 smaller than the largest commercial DBs

LSST (~2020) will have ~50 billion objects

Large by today's standards. But average (or even small) by 2015

Astronomy has “Real World” DB challenges

UPS Verizon Caixa 10000 20000 30000 40000 50000 60000 70000 80000 90000 2003 2005

Data volumes grow as well.....20 times increase from 2003-2005

slide-14
SLIDE 14

The Challenges

Astronomy Data is Distributed

slide-15
SLIDE 15

The Challenges

Astronomy Data is Distributed

slide-16
SLIDE 16

The Solutions

Astronomical data creates opportunities for Computer Science, Information Technologies, and Statistics Astronomy provides the (interesting) datasets, the distributed network, and the scientific questions IT connects the network CS handles the datasets and algorithms MATHEMATICS and STATISTICS quantifies the answers

slide-17
SLIDE 17

The Solutions

 Possible Detection of Baryonic Fluctuations in the Large-Scale Structure Power Spectrum: Miller, Nichol, Batuski

2001, ApJ

 Acoustic Oscillations in the Early Universe and Today: Miller, Nichol, Batuski 2001, Science  Controlling the False Discovery Rate in Astrophysical Data Analysis: Miller, Genovese, Nichol, Wasserman,

Connolly Reichart, Hopkins, Schneider, Moore, 2001 AJ

 A new source detection algorithm using FDR Hopkins, Miller, Connolly, Genovese, Nichol, Wasserman, 2002 AJ  A non-parametric analyss of the CMB Power Spectrum Miller, Genovese, Nichol Wasserman, ApJ  Non–parametric Inference in Astrophysics, Wasserman, Miller, Nichol, Genovese, Jang, Connolly, Moore,

Schneider, 2002

 Detecting the Baryons in Matter Power Spectra Miler, Nichol, Chen 2002 ApJ  Galaxy ecology: groups and low-density environments in the SDSS and 2dFGRS Balogh, Eke, Miller, Gray et al.

2002, MNRAS

 The Clustering of AGN in the SDSS Wake, Miller, Di Matteo, Nichol, Pope, Szalay, Gray, Schnieder, York 2004

ApJ

 Nonparametric Inference for the Cosmic Microwave Background Genovese, Miller, Nichol, Arjunwadkar,

  • Wasserman. 2004 Annals of Statistics

 The C4 Clustering Algorithm: Clusters of Galaxies in the SDSS Miller et al. 2005 AJ  The Effect of Large-Scale Structure on the SDSS Galaxy Three–Point Correlation Function Nichol et al. 2006,

MNRAS

 Mapping the Cosmological Confidence Ball Surface Bryan, Schneider, Miller, Nichol, Genovese, Wasserman, 2007

ApJ

 Inference for the Dark Energy Equation of State Using Type Ia SN data Genovese, Freeman, Wasserman, Nichol,

Miller 2008, Annals of Statistics

slide-18
SLIDE 18

Example: Non-parametric fits of the CMB

COBE WMAP

slide-19
SLIDE 19

Example: Non-parametric fits of the CMB

COBE WMAP Fully non-parametric Fully parameterized

slide-20
SLIDE 20

Confidence Balls: Pictorially

r

  • 1. obtain experimental data
  • 2. compute non-parametric

fit

  • 3. compute confidence ball
  • 4. Iterate through

parameters to determine confidence.

slide-21
SLIDE 21

Deriving Confidence Intervals

θ1 θ2

θ1 θ2

slide-22
SLIDE 22

Cosmological Confidence Intervals

½σ 38% 1σ 68% 1½σ 86% 2σ 95%

Color Key

slide-23
SLIDE 23

Including Assumptions

(Bryan et al., ApJ 2007)

slide-24
SLIDE 24

Including Assumptions

(Bryan et al., ApJ 2007)

Assumes 60 ≤ H0 ≤ 75 (km/s)/Mpc

slide-25
SLIDE 25

95% χ2 confidence regions from based on Davis et al. (2007) data

What Does “Convergence” Mean?

θ = {H0, ΩM, ΩΛ} p1 = {65, 0.23, ?} p2 = {0.02, 42, ?}

To prove p1 is within the confidence region: ∃ ΩΛ such that m(θ) is accepted To prove p is not within the region: ∀ ΩΛ, m(θ) is rejected can’t check all ΩΛ

slide-26
SLIDE 26

A Summary of the INCA Group Activities in Astronomy

Originated at Carnegie Mellon University and the University of Pittsburgh (PiCA Group). Membership expanded and members moved so that we changed the name to the INternational Computational Astrostatistics Group). A loose group of committed researchers (no formal structure) Astronomy provides the data and drives the science Real work is done in developing, proving, and applying novel statistical methods and computational algorithms to astronomical datasets Success is shared equally amongst the domains What are we interested in?

slide-27
SLIDE 27

A Summary of the INCA Group Activities in Astronomy

Originated at Carnegie Mellon University and the University of Pittsburgh (PiCA Group). Membership expanded and members moved so that we changed the name to the INternational Computational Astrostatistics Group). A loose group of committed researchers (no formal structure) Astronomy provides the data and drives the science Real work is done in developing, proving, and applying novel statistical methods and computational algorithms to astronomical datasets Success is shared equally amongst the domains What are we interested in?

Parametrics Dis-entangling multiple -components via Expectation Maximization Nonparametrics Reducing the size of the error ellipse Non-linear SVM-like spaces Focusing the available model space High-dimensional searches and surface fitting Constraining (as opposed to finding) the truth