Comments on Measurement Error Models in Astronomy by Brandon C. - PowerPoint PPT Presentation

Comments on “Measurement Error Models in Astronomy” by Brandon C. Kelly David Ruppert Operations Research & Information Engineering and Department of Statistical Science, Cornell University Jun 15, 2011

Introduction • These comments are personal views about measurement error modeling from someone who has • worked in statistics for over 35 years • worked on measurement error for over 25 years • worked in astrostatistics for roughly one year

Theme • There are many special-purpose methods for handling measurement errors for particular models with certain assumptions • What is proposed here is a general approach that can be used in most, if not all, situations

My Perspective on Measurement Error Modeling • Bayesian approaches have much to offer • I prefer structural models • a Bayesian approach is inherently structural • Flexible parametric structural models can avoid problems of low-dimensional parametric structural models • Careful modeling is essential • Example: pitfalls of orthogonal regression

Advantages of Bayesian Models There are several advantages to taking a Bayesian approach to measurement error modeling 1 Focuses attention on careful modeling 2 Make efficient use of information in data • asymptotically efficient • optimal according to decision theory • all admissible estimators are Bayes for some prior 3 Inference is straightforward using credible intervals • these are similar in practice to confidence intervals

Advantages of Bayesian Models, cont. 4 Allows the use of prior information • but can use diffuse priors when there is little prior information 5 The true values of mismeasured data can be treated in the same way as unknown parameters • To a Bayesian, anything unknown is random • and one conditions on everything known • MCMC multiply imputes values of unknown true values of mismeasured variables 6 Bayesian analysis works for virtually any problem

Example: measurement error in a nonlinear model Example: quadratic regression simulation Y i = α + β X i + γ X 2 i + ǫ i (regression model) iid ∼ N (0 , σ 2 • ǫ i ǫ ) W ij = X i + U ij , j = 1 , 2 (measurement model) iid ∼ N (0 , σ 2 • U ij u ) • σ 2 U unknown • W i = ( W i 1 + W i 2 ) / 2 iid ∼ N ( µ x , σ 2 x ) (structural model) X i

Example: measurement error in a nonlinear model o no error 10 naive * Bayes ● * 8 y 6 #4 ● ● * * ● * ● * ● * ● * ● * ● * 4 ● ● * * ● * * ● ● ● * * ● * ● * ● ● * * ● ● * * ● * ● ● ● * * ● * * * ● ● ● ● ● * * ● * * * * ● ● ● * * ● * 2 ● * * * ● ● ● * ● ● * ● ● ● * * * ● ● * * * ● * ● * ● * −2 0 2 4 6 8 x or w

Example: measurement error in a nonlinear model R2WinBUGS output: 5 chains each of length 35,000 Trace of beta Density of beta Trace of deviance Density of deviance 500 0.012 1.2 450 −1.5 0.8 0.006 400 0.4 −2.5 350 0.000 0.0 50 150 250 350 −3.0 −2.0 −1.0 50 150 250 350 350 450 Iterations N = 300 Bandwidth = 0.07421 Iterations N = 300 Bandwidth = 6.84 Trace of gamma Density of gamma Trace of x[4] Density of x[4] 0.30 8 8 0.5 6 6 0.20 0.4 4 4 0.3 2 0.10 2 0 0.2 0.00 −2 0 50 150 250 350 0.1 0.3 0.5 50 150 250 350 −5 0 5 10 Iterations N = 300 Bandwidth = 0.01294 Iterations N = 300 Bandwidth = 0.821

BUGS program model{ for(i in 1:N){ w1[i] ~ dnorm(x[i],tauw) w2[i] ~ dnorm(x[i],tauw) x[i] ~ dnorm(mux,taux) y[i] ~ dnorm(muy[i],taue) muy[i] <- alpha + beta*x[i]+ gamma*x[i]*x[i] } mux ~ dnorm(0.0,1.0E-6) alpha ~ dnorm(0.0,1.0E-6) beta ~ dnorm(0.0,1.0E-6) gamma ~ dnorm(0.0,1.0E-6) tauw ~ dgamma(0.1,0.01) taux ~ dgamma(0.1,0.01) taue ~ dgamma(0.1,0.01) }

Structural models It is often reasonable to assume that the true covariates ξ 1 , . . . , ξ n come from some probability distribution • This assumption justifies using a structural model • Any Bayesian model must be structural because • to a Bayesian, any unknown is random

Parsimonious structural models • Structural models often assume that the distribution of the true covariates is Gaussian or in some other low-dimensional parametric family • This is done for simplicity and parsimony • Often conclusions are robust (= insensitive) to this assumption • But not always

Flexible structural models Sometimes we are worried about the nonrobustness of a structural model • An alternative is to use a high-dimensional parametric family to model the true covariate distribution • Flexible parametric families are, in effect, nonparametric • Two examples • splines • mixture distributions • One needs to guard against overfitting = undersmoothing • Bayesian methods can do this automatically

Splines • B-splines are nonnegative and have minimal support • They can be normalized to be densities • A convex combination of normalized B-splines is a density

B-splines 0−degree B−splines 6 4 2 0 0 0.2 0.4 0.6 0.8 1 Linear B−splines 3 2 1 0 Quadratic B−splines 1.5 1 0.5 0 0 0.2 0.4 0.6 0.8 1

Density Estimation by Splines Staudenmayer, J., Ruppert, D., and Buonaccorsi, J. (2008) Density estimation in the presence of heteroskedastic measurement error, JASA , 103, 726–736. • Estimates the variance function • this is the conditional variance of the measurement error given the true covariate value • Uses splines • Could be part of a structural model • Bayesian • so can easily be modified to handle similar problems

Focus on Modeling, not Algorithms • Carefully statistical modeling is always important • Focusing on algorithms/estimators is potentially a distraction • The distinction between measurement error and equation error was an important conceptual advance

Heteroskedastic Error • In simple cases, one replaces a constant error variance by an average variance • The Akritas and Bershady (1996) is a nice example • For nonparametric modeling (local estimation) this approach fails • Staudenmayer and Ruppert found that for nonparametric density estimation using the average variance • overcorrects where the actual variance is smaller than average • undercorrects where the actual variance is larger than average • Their Bayesian estimator does not have these flaws

Orthogonal Regression Setup The orthogonal regression (OR) model is = β 0 + β 1 X (no equation error) y true = y true + ǫ Y = X + U W It is assumed that we “know” var ( W | X ) = σ 2 η = var ( Y | X ) ǫ σ 2 U

Orthogonal Regression Estimator The OR estimator can be viewed as a functional estimator that treats X 1 , . . . , X n as unknown parameters β 0 , β 1 , X 1 , . . . , X n are estimated by minimizing n η − 1 ( Y i − β 0 − β 1 X i ) 2 + ( W i − X i ) 2 � � � i =1 n ǫ ( Y i − β 0 − β 1 X i ) 2 + σ − 2 � U ( W i − X i ) 2 � � σ − 2 ∝ i =1 over ( β 0 , β 1 , X 1 , . . . , X n )

The Pitfall of OR • The danger is that it is easy to misapply OR in the presence of equation error • This leads to overcorrection if one uses η = σ 2 ǫ σ 2 U • Instead one should use var ( W | X ) = σ 2 Q + σ 2 η EE := var ( Y | X ) ǫ σ 2 U • σ 2 Q is the equation error variance

Bayesian Inference for Very Large Data Sets • Simple problems (measurement error in linear models) • look for low dimensional sufficient statistics • Bliznyuk, Ruppert, Shoemaker (et al) (2008, 2011, 2012), JCGS • Bayesian inference with computationally expensive posterior densities • radial basis function emulator of log-posterior • adaptive design • focuses on high posterior density region • might be 0.1% of volume of parameter space • Emulators for high-dimensional parameter spaces will be challenging • and for measurement error models true covariate values are parameters

Summary A Bayesian framework focuses on the three components of the model • structural model for the true covariates • measurement model for the measurement errors • regression model for the conditional distribution of the response given the true covariate values After modeling, inference is relatively automatic (and efficient)

Comments on Measurement Error Models in Astronomy by Brandon C. - PowerPoint PPT Presentation

Comments on Measurement Error Models in Astronomy by Brandon C. Kelly David Ruppert Operations Research & Information Engineering and Department of Statistical Science, Cornell University Jun 15, 2011 Introduction These comments

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Measurement Uncertainty - Error & Uncertainty Measurement errors are impossible to avoid

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars & Galaxies Stars &

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

Bridging social and physical measurement: measurement is not scale construction; measurement is

Modelling Measurement Error in Administrative and Survey Variables Sander Scholtus, Bart Bakker,

Natural and Flexible Error Recovery for Generated Parsers Maartje de Jonge Emma Nilsson-Nyman

Was it operator error or human error? Commodore David Squire, CBE, FNI, FCMI Editor, Alert! The

10/4/18 What is a medication error? A medication error is defined by the Nation Coordinating

Coal Generation in Maryland February 21, 2020 1 Summary Talens Position Background

S TOP B ORROWING T ROUBLE : C LARIFYING THE S AUDI B ASIC E XCEPTION TO D ELAWARE ' S B ORROWING S

Introduction Samuel Bajomo Project Manager Phone: 301-206-8523, Email:

Master of Science Program in Nuclear Security (MiNS) A Project Overview and Future Challenges

BRANDON UNIVERSITY General Faculty Council September 2012 A G E N D A 1. Call to Order 2.

Composting at Home and at Work Presentation by Brandon Castillo Cowboy Compost Services

Kinde rg a rte n Informa tion E ve ning PARE NT PRE SE NT AT ION We lc ome : Mr. B. T

American Opinions On the Use of Armed Military Drones What Is A Drone? Unmanned Aerial

Comments on Measurement Error Models in Astronomy by Brandon C. - PowerPoint PPT Presentation

Comments on Measurement Error Models in Astronomy by Brandon C. Kelly David Ruppert Operations Research & Information Engineering and Department of Statistical Science, Cornell University Jun 15, 2011 Introduction These comments

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

ERROR DETECTON &amp; CORRECTION Error Detection EDC= Error Detection and Correction bits

Measurement Uncertainty - Error &amp; Uncertainty Measurement errors are impossible to avoid

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars &amp; Galaxies Stars &amp;

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

Bridging social and physical measurement: measurement is not scale construction; measurement is

Modelling Measurement Error in Administrative and Survey Variables Sander Scholtus, Bart Bakker,

Natural and Flexible Error Recovery for Generated Parsers Maartje de Jonge Emma Nilsson-Nyman

Was it operator error or human error? Commodore David Squire, CBE, FNI, FCMI Editor, Alert! The

10/4/18 What is a medication error? A medication error is defined by the Nation Coordinating

Coal Generation in Maryland February 21, 2020 1 Summary Talens Position Background

S TOP B ORROWING T ROUBLE : C LARIFYING THE S AUDI B ASIC E XCEPTION TO D ELAWARE ' S B ORROWING S

Introduction Samuel Bajomo Project Manager Phone: 301-206-8523, Email:

Master of Science Program in Nuclear Security (MiNS) A Project Overview and Future Challenges

BRANDON UNIVERSITY General Faculty Council September 2012 A G E N D A 1. Call to Order 2.

Composting at Home and at Work Presentation by Brandon Castillo Cowboy Compost Services

Kinde rg a rte n Informa tion E ve ning PARE NT PRE SE NT AT ION We lc ome : Mr. B. T

American Opinions On the Use of Armed Military Drones What Is A Drone? Unmanned Aerial

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Measurement Uncertainty - Error & Uncertainty Measurement errors are impossible to avoid

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars & Galaxies Stars &