Checking Assumptions Normal distributions: use probability plot (or - PowerPoint PPT Presentation

ST 516 Experimental Statistics for Engineers II Checking Assumptions Normal distributions: use probability plot (or quantile-quantile plot); straight line implies normal distribution: Normal Q−Q Plot Normal Q−Q Plot 1.5 1.5 ● Theoretical Quantiles Theoretical Quantiles ● ● 0.5 0.5 ● ● ● −0.5 −0.5 ● ● ● −1.5 −1.5 ● 16.4 16.4 16.6 16.6 16.8 16.8 17.0 17.0 17.2 17.2 17.4 17.4 Sample Quantiles Sample Quantiles 1 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II In R, use qqnorm to make qq-plots: qqnorm(cement[ , "Unmodified"], datax = TRUE); qqnorm(cement[ , "Modified"], datax = TRUE); Note: by default, qqnorm plots the empirical quantiles on the y -axis and the theoretical quantiles on the x -axis; the datax option reverses this, to match the “normal probability plot” as used in the book. Overlaying qq-plots is a little more work: qqnorm(cement[ , "Unmodified"], pch = 22, ylim = c(16.3, 17.4), datax = TRUE); par(new = TRUE); qqnorm(cement[ , "Modified"], pch = 21, ylim = c(16.3, 17.4), datax = TRUE); 2 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II Sample Size Recall the hypotheses: Null hypothesis H 0 : µ 1 = µ 2 Alternate hypothesis H 1 : µ 1 � = µ 2 . Two types of error: Type I error: reject H 0 when it is true; Type II error: fail to reject H 0 when it is false. P(Type I error) = α , P(Type II error) = β . 3 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II We usually choose α at say . 05 (or . 01, or . 10, or ...). For a given sample size n , β depends on the difference between the true means, δ = | µ 1 − µ 2 | . The operating characteristic curve , or O.C. curve, is a graph of β against δ . A graph usually shows the O.C. curves for various sample sizes n , which gives guidance about sample size. 4 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II Checking Assumptions Equal variances: informally, compare standard deviations; also compare slopes in q-q plots; formally, use F -test for ratio of variances. 5 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II If Variances are Unequal Estimated standard error of ¯ y 1 − ¯ y 2 is now � S 2 + S 2 1 2 , n 1 n 2 so the test statistic is y 1 − ¯ ¯ y 2 t 0 = ; � S 2 n 1 + S 2 1 2 n 2 6 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II Under H 0 : µ 1 = µ 2 , t 0 is approximately t -distributed (Welch’s approximation) with degrees of freedom � 2 � S 2 + S 2 1 2 n 1 n 2 ν = . � 2 � 2 � S 2 � S 2 1 2 n 1 n 2 n 1 − 1 + n 2 − 1 7 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II R command for equal variances t.test(cement$Modified, cement$Unmodified, var.equal = TRUE) Output Two Sample t-test data: cement$Modified and cement$Unmodified t = -2.1869, df = 18, p-value = 0.0422 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -0.54507339 -0.01092661 sample estimates: mean of x mean of y 16.764 17.042 8 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II R command for unequal variances t.test(cement$Modified, cement$Unmodified, var.equal = FALSE) # var.equal = FALSE is the default, so it could be omitted Output Welch Two Sample t-test data: cement$Modified and cement$Unmodified t = -2.1869, df = 17.025, p-value = 0.043 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -0.54617414 -0.00982586 sample estimates: mean of x mean of y 16.764 17.042 Note: fewer (non-integer) degrees of freedom; slightly higher p -value; slightly wider confidence interval. 9 / 15 Simple Comparative Experiments Inference About Differences in Means

ST 516 Experimental Statistics for Engineers II Paired Comparisons In some situations, we can manage at least part of the variability in the measurements. Example: measuring hardness, comparing two instruments (tips). Making 10 measurements with each tip, we could: Choose 20 specimens, divide them into two groups of 10, and test each group with one tip; Choose 10 specimens, and test each with both tips: paired measurements. 10 / 15 Simple Comparative Experiments Paired Comparisons

ST 516 Experimental Statistics for Engineers II Results for a paired experiment Specimen Tip 1 Tip 2 Difference 1 7 6 1 2 3 3 0 3 3 5 -2 4 4 3 1 5 8 8 0 6 3 2 1 7 2 4 -2 8 9 9 0 9 5 4 1 10 4 5 -1 Mean 4.8 4.9 -0.1 S.D. 2.39 2.23 1.20 11 / 15 Simple Comparative Experiments Paired Comparisons

ST 516 Experimental Statistics for Engineers II Differences have less variability than individual measurements, because specimen-to-specimen variations cancel out. Statistical model y i , j = µ i + β j + ǫ i , j , i = 1 , 2 , j = 1 , 2 , . . . , n . Here: y i , j = measurement for tip i in pair j ; µ i = mean strength for tip i , averaged across conditions; β j = deviation from overall mean strength for pair j ; ǫ i , j ∼ N (0 , σ 2 i ). 12 / 15 Simple Comparative Experiments Paired Comparisons

ST 516 Experimental Statistics for Engineers II Analysis form within-pair differences d j = y 1 , j − y 2 , j = µ 1 − µ 2 + ǫ 1 , j − ǫ 2 , j ; pair deviation β j cancels out, so d j ∼ N ( µ 1 − µ 2 , σ 2 1 + σ 2 2 = σ 2 d ); standard error of ¯ � σ 2 d is d / n ; Sample variance S 2 d estimates σ 2 d . The test statistic: ¯ d S d / √ n ; t 0 = Under H 0 : µ 1 = µ 2 , t 0 is t -distributed with n − 1 degrees of freedom. 13 / 15 Simple Comparative Experiments Paired Comparisons

ST 516 Experimental Statistics for Engineers II R command for paired data tips <- read.table("data/tips.txt") t.test(tips$Tip1, tips$Tip2, paired = TRUE) Output Paired t-test data: tips$Tip1 and tips$Tip2 t = -0.2641, df = 9, p-value = 0.7976 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -0.9564389 0.7564389 sample estimates: mean of the differences -0.1 14 / 15 Simple Comparative Experiments Paired Comparisons

ST 516 Experimental Statistics for Engineers II Comparison with Unpaired Design If each run was made on randomly selected materials with no pairing: y i , j = µ i + β i , j + ǫ i , j , i = 1 , 2 , j = 1 , 2 , . . . , n : now Var( y i , j ) = σ 2 β + σ 2 i ; denominator of t 0 is larger, making test less sensitive; degrees of freedom are 2( n − 1), making test more sensitive; on balance, test is usually more sensitive, so pairing is good. Pairing helps when within-pair variation is much less than among-pair variation. 15 / 15 Simple Comparative Experiments Paired Comparisons

Checking Assumptions Normal distributions: use probability plot (or - PowerPoint PPT Presentation

ST 516 Experimental Statistics for Engineers II Checking Assumptions Normal distributions: use probability plot (or quantile-quantile plot); straight line implies normal distribution: Normal QQ Plot Normal QQ Plot 1.5 1.5

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Checking & Spot-Checking the Correctness of Priority Queues Matthew Chu & Sampath Kannan

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

3. Satisfiability Checking 3.1 SAT-Checking Procedures Verification Technology

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

Model checking perhaps the most important part of applied statistical modelling Simon Wood

2011 SERTP Input 2011 SERTP Input Assumptions Assumptions 1 2011 Load Forecast 2011 Load

CTL Chapter 6 Part 2 Overview Review CTL Model Checking CTL model Checking algorithms

Model-checking in systems biology - From Micro to Macro 1 / 62 00001 - 00:00:01 Model-checking

Scalable Multi-Core Model Checking Alfons Laarman ( alfons@laarman.com ), Theory joint work with

Type checking COMP 520 Fall 2010 Type checking (2) The type checker has severals tasks:

Mechanized Metatheory Model-Checking WMM 2006 James Cheney 9/21/06 Mechanized Metatheory

Estimating Mean and Need to Consider . . . Variance under Interval Simplest Case: . . .

Outlines Weighted Graph Cuts without Eigenvectors: A Multilevel Approach (PAMI 2007) User-Guided

Understanding flash reconstruction Bruce Howard and Denver Whittington DUNE PD Sim Meeting 22

Stat 5101 Lecture Slides Deck 1 Charles J. Geyer School of Statistics University of Minnesota

Fast leakage assessment Oscar Reparaz COSIC / KU Leuven Benedikt Gierlichs CHES 2017 Taipei

Why use R? Introduction to R: To perform inferential statistics (e.g., use a statistical

University of the Basque Country (EHU) Systems for the NIST 2011 LRE Mikel Penagarikano, Amparo

Data privacy: introduction Vicen c Torra January 15, 2018 Privacy, Information and

Checking Assumptions Normal distributions: use probability plot (or - PowerPoint PPT Presentation

ST 516 Experimental Statistics for Engineers II Checking Assumptions Normal distributions: use probability plot (or quantile-quantile plot); straight line implies normal distribution: Normal QQ Plot Normal QQ Plot 1.5 1.5

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Checking &amp; Spot-Checking the Correctness of Priority Queues Matthew Chu &amp; Sampath Kannan

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

3. Satisfiability Checking 3.1 SAT-Checking Procedures Verification Technology

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

Model checking perhaps the most important part of applied statistical modelling Simon Wood

2011 SERTP Input 2011 SERTP Input Assumptions Assumptions 1 2011 Load Forecast 2011 Load

CTL Chapter 6 Part 2 Overview Review CTL Model Checking CTL model Checking algorithms

Model-checking in systems biology - From Micro to Macro 1 / 62 00001 - 00:00:01 Model-checking

Scalable Multi-Core Model Checking Alfons Laarman ( alfons@laarman.com ), Theory joint work with

Type checking COMP 520 Fall 2010 Type checking (2) The type checker has severals tasks:

Mechanized Metatheory Model-Checking WMM 2006 James Cheney 9/21/06 Mechanized Metatheory

Estimating Mean and Need to Consider . . . Variance under Interval Simplest Case: . . .

Outlines Weighted Graph Cuts without Eigenvectors: A Multilevel Approach (PAMI 2007) User-Guided

Understanding flash reconstruction Bruce Howard and Denver Whittington DUNE PD Sim Meeting 22

Stat 5101 Lecture Slides Deck 1 Charles J. Geyer School of Statistics University of Minnesota

Fast leakage assessment Oscar Reparaz COSIC / KU Leuven Benedikt Gierlichs CHES 2017 Taipei

Why use R? Introduction to R: To perform inferential statistics (e.g., use a statistical

University of the Basque Country (EHU) Systems for the NIST 2011 LRE Mikel Penagarikano, Amparo

Data privacy: introduction Vicen c Torra January 15, 2018 Privacy, Information and

Checking & Spot-Checking the Correctness of Priority Queues Matthew Chu & Sampath Kannan