CS 147: Computer Systems Performance Analysis Replicated Binary - PowerPoint PPT Presentation

CS147 2015-06-15 CS 147: Computer Systems Performance Analysis Replicated Binary Designs CS 147: Computer Systems Performance Analysis Replicated Binary Designs 1 / 44

Overview CS147 Overview 2015-06-15 2 k r Designs 2 2 r Designs Effects Analysis of Variance Confidence Intervals Predictions Verification Overview Multiplicative Models Example General 2 k r Designs 2 k r Designs 2 2 r Designs Effects Analysis of Variance Confidence Intervals Predictions Verification Multiplicative Models Example General 2 k r Designs 2 / 44

2 k r Designs 2 k Factorial Designs With Replications 2 k Factorial Designs With Replications CS147 2015-06-15 2 k r Designs ◮ 2 k factorial designs do not allow for estimation of experimental error ◮ No experiment is ever repeated ◮ Error is usually present 2 k Factorial Designs With Replications ◮ And usually important ◮ Handle issue by replicating experiments ◮ But which to replicate, and how often? ◮ 2 k factorial designs do not allow for estimation of experimental error ◮ No experiment is ever repeated ◮ Error is usually present ◮ And usually important ◮ Handle issue by replicating experiments ◮ But which to replicate, and how often? 3 / 44

2 k r Designs 2 k r Factorial Designs CS147 2 k r Factorial Designs 2015-06-15 2 k r Designs ◮ Replicate each experiment r times ◮ Allows quantifying experimental error ◮ Again, easiest to first look at case of only 2 factors 2 k r Factorial Designs ◮ Replicate each experiment r times ◮ Allows quantifying experimental error ◮ Again, easiest to first look at case of only 2 factors 4 / 44

2 k r Designs 2 2 r Designs 2 2 r Factorial Designs CS147 2 2 r Factorial Designs 2015-06-15 2 k r Designs ◮ 2 factors, 2 levels each, with r replications at each of the four combinations 2 2 r Designs ◮ y = q 0 + q A x A + q B x B + q AB x A x B + e ◮ Now we need to compute effects, estimate errors, and allocate variation 2 2 r Factorial Designs ◮ Can also produce confidence intervals for effects and predicted responses ◮ 2 factors, 2 levels each, with r replications at each of the four combinations ◮ y = q 0 + q A x A + q B x B + q AB x A x B + e ◮ Now we need to compute effects, estimate errors, and allocate variation ◮ Can also produce confidence intervals for effects and predicted responses 5 / 44

2 k r Designs Effects Computing Effects for 2 2 r Factorial Experiments CS147 Computing Effects for 2 2 r Factorial Experiments 2015-06-15 2 k r Designs ◮ We can use sign table, as before ◮ But instead of single observations, regress off mean of the r observations Effects ◮ Compute errors for each replication using similar tabular method ◮ Sum of errors must be zero Computing Effects for 2 2 r Factorial ◮ e ij = y ij − ˆ y i ◮ Similar methods used for allocation of variance and calculating confidence intervals Experiments ◮ We can use sign table, as before The tabular method for errors is as follows: after computing the effects, multiply the effects by the sign table to get the estimated ◮ But instead of single observations, regress off mean of the r response. Enter that into the table and then subtractfrom each observations measured response to get errors. ◮ Compute errors for each replication using similar tabular method ◮ Sum of errors must be zero ◮ e ij = y ij − ˆ y i ◮ Similar methods used for allocation of variance and calculating confidence intervals 6 / 44

2 k r Designs Effects Example of 2 2 r Factorial Design With Replications CS147 Example of 2 2 r Factorial Design With Replications 2015-06-15 2 k r Designs ◮ Same parallel system as before, but with 4 replications at each point ( r = 4) Effects ◮ No DLM, 8 nodes: 820, 822, 813, 809 ◮ DLM, 8 nodes: 776, 798, 750, 755 ◮ No DLM, 64 nodes: 217, 228, 215, 221 Example of 2 2 r Factorial Design With ◮ DLM, 64 nodes: 197, 180, 220, 185 Replications ◮ Same parallel system as before, but with 4 replications at each point ( r = 4) ◮ No DLM, 8 nodes: 820, 822, 813, 809 ◮ DLM, 8 nodes: 776, 798, 750, 755 ◮ No DLM, 64 nodes: 217, 228, 215, 221 ◮ DLM, 64 nodes: 197, 180, 220, 185 7 / 44

2 k r Designs Effects 2 2 r Factorial Example Analysis Matrix CS147 2 2 r Factorial Example Analysis Matrix 2015-06-15 2 k r Designs I A B AB y Mean 1 -1 -1 1 816.00 (820,822,813,809) 1 1 -1 -1 220.25 (217,228,215,221) 1 -1 1 -1 769.75 (776,798,750,755) Effects 1 1 1 1 195.50 (197,180,220,185) 2001.5 -1170.0 -71.00 21.5 Total 500.4 -292.5 -17.75 5.4 Total/4 2 2 r Factorial Example Analysis Matrix q 0 = 500.40 q A = -292.5 q B = -17.75 q AB = 5.4 I A B AB y Mean 1 -1 -1 1 816.00 (820,822,813,809) 1 1 -1 -1 220.25 (217,228,215,221) 1 -1 1 -1 769.75 (776,798,750,755) 1 1 1 1 195.50 (197,180,220,185) 2001.5 -1170.0 -71.00 21.5 Total 500.4 -292.5 -17.75 5.4 Total/4 q 0 = 500.40 q A = -292.5 q B = -17.75 q AB = 5.4 8 / 44

2 k r Designs Effects Estimation of Errors for 2 2 r Factorial Example CS147 Estimation of Errors for 2 2 r Factorial Example 2015-06-15 2 k r Designs ◮ Figure differences between predicted and observed values for each replication: e ij = y ij − ˆ y i Effects = y ij − q 0 − q A x Ai − q B x Bi − q AB x Ai x Bi Estimation of Errors for 2 2 r Factorial ◮ Now calculate SSE: 2 2 r SSE = � � e 2 ij = 2606 Example i = 1 j = 1 ◮ Figure differences between predicted and observed values for each replication: y ij − ˆ e ij = y i = y ij − q 0 − q A x Ai − q B x Bi − q AB x Ai x Bi ◮ Now calculate SSE: 2 2 r � � e 2 SSE = ij = 2606 i = 1 j = 1 9 / 44

2 k r Designs Analysis of Variance Allocating Variation CS147 Allocating Variation 2015-06-15 2 k r Designs ◮ We can determine percentage of variation due to each factor’s impact Analysis of Variance ◮ Just like 2 k designs without replication ◮ But we can also isolate variation due to experimental errors ◮ Methods are similar to other regression techniques for Allocating Variation allocating variation ◮ We can determine percentage of variation due to each factor’s impact ◮ Just like 2 k designs without replication ◮ But we can also isolate variation due to experimental errors ◮ Methods are similar to other regression techniques for allocating variation 10 / 44

2 k r Designs Analysis of Variance Variation Allocation in Example CS147 Variation Allocation in Example 2015-06-15 2 k r Designs ◮ We’ve already figured SSE ◮ We also need SST, SSA, SSB, and SSAB Analysis of Variance � ( y ij − y ·· ) 2 SST = i , j ◮ Also, SST = SSA + SSB + SSAB + SSE Variation Allocation in Example ◮ Use same formulae as before for SSA, SSB, and SSAB ◮ We’ve already figured SSE ◮ We also need SST, SSA, SSB, and SSAB � ( y ij − y ·· ) 2 SST = i , j ◮ Also, SST = SSA + SSB + SSAB + SSE ◮ Use same formulae as before for SSA, SSB, and SSAB 11 / 44

2 k r Designs Analysis of Variance Sums of Squares for Example CS147 Sums of Squares for Example 2015-06-15 2 k r Designs ◮ SST = SSY − SS0 = 1,377,009.75 ◮ SSA = 1,368,900 ◮ SSB = 5041 Analysis of Variance ◮ SSAB = 462.25 ◮ Percentage of variation for A is 99.4% ◮ Percentage of variation for B is 0.4% Sums of Squares for Example ◮ Percentage of variation for A/B interaction is 0.03% ◮ And 0.2% (approx.) is due to experimental errors ◮ SST = SSY − SS0 = 1,377,009.75 ◮ SSA = 1,368,900 ◮ SSB = 5041 ◮ SSAB = 462.25 ◮ Percentage of variation for A is 99.4% ◮ Percentage of variation for B is 0.4% ◮ Percentage of variation for A/B interaction is 0.03% ◮ And 0.2% (approx.) is due to experimental errors 12 / 44

2 k r Designs Confidence Intervals Confidence Intervals for Effects CS147 Confidence Intervals for Effects 2015-06-15 2 k r Designs ◮ Computed effects are random variables ◮ Thus would like to specify how confident we are that they are correct Confidence Intervals ◮ Usual confidence-interval methods ◮ First, must figure Mean Square of Errors SSE Confidence Intervals for Effects s 2 e = 2 2 ( r − 1 ) ◮ r − 1 is because errors add up to zero ⇒ Only r − 1 can be chosen independently ◮ Computed effects are random variables ◮ Thus would like to specify how confident we are that they are correct ◮ Usual confidence-interval methods ◮ First, must figure Mean Square of Errors SSE s 2 e = 2 2 ( r − 1 ) ◮ r − 1 is because errors add up to zero ⇒ Only r − 1 can be chosen independently 13 / 44

2 k r Designs Confidence Intervals Calculating Variances of Effects CS147 Calculating Variances of Effects 2015-06-15 2 k r Designs ◮ Variance (due to errors) of all effects is the same: q A B = s 2 Confidence Intervals s 2 q 0 = s 2 q A = s 2 q B = s 2 e 2 2 r ◮ So standard deviation is also the same Calculating Variances of Effects ◮ In calculations, use t - or z -value for 2 2 ( r − 1 ) degrees of freedom ◮ Variance (due to errors) of all effects is the same: q A B = s 2 s 2 q 0 = s 2 q A = s 2 q B = s 2 e 2 2 r ◮ So standard deviation is also the same ◮ In calculations, use t - or z -value for 2 2 ( r − 1 ) degrees of freedom 14 / 44

CS 147: Computer Systems Performance Analysis Replicated Binary - PowerPoint PPT Presentation

CS147 2015-06-15 CS 147: Computer Systems Performance Analysis Replicated Binary Designs CS 147: Computer Systems Performance Analysis Replicated Binary Designs 1 / 44 Overview CS147 Overview 2015-06-15 2 k r Designs 2 2 r Designs

CS 147: Computer Systems Performance Analysis Approaching Performance Projects 1 / 35 Overview

CS 147: Computer Systems Performance Analysis Comparing Systems and Analyzing Alternatives 1 /

CS 147: Computer Systems Performance Analysis Networks of Queues 1 / 18 Overview CS147

CS 147: Computer Systems Performance Analysis Selecting Techniques 1 / 37 Overview CS147

CS 147: Computer Systems Performance Analysis Advanced Regression Techniques 1 / 31 Overview

CS 147: Computer Systems Performance Analysis Introduction to Queueing Theory 1 / 27 Overview

CS 147: Computer Systems Performance Analysis Higher Designs and Other Considerations 1 / 25

CS 147: Computer Systems Performance Analysis Fractional Factorial Designs 1 / 26 Overview

CS 147: Computer Systems Performance Analysis Multiple and Categorical Regression 1 / 36

CS 147: Computer Systems Performance Analysis Examples Using a Distributed File System 1 / 37

CS 147: Computer Systems Performance Analysis Course Introduction 1 / 35 Overview CS147

CS 147: Computer Systems Performance Analysis Mistakes in Graphical Presentation 1 / 45

CS 147: Computer Systems Performance Analysis Workload Characterization 1 / 31 Overview CS147

CS 147: Computer Systems Performance Analysis Specifics of Graphical Presentation 1 / 35

CS 147: Computer Systems Performance Analysis Workload Selection 1 / 39 Overview CS147

CS 147: Computer Systems Performance Analysis Summarizing Data 1 / 30 Overview CS147 Overview

Outline The Simple Linear Regression Model (12.1) Fitting the Regression Line (12.2)

Example 1.11 I Fig 1.12 September 13, 2020 1 / 12 Example 1.11 II s a b q 1 a r 1 b a a

State Prof. Hakim Weatherspoon CS 3410 Computer Science Cornell University [Weatherspoon,

Set-difference () Tuples in reln.1,butnotin reln.2. Union

Active Appearance Models Edwards, Taylor, and Cootes Presented by Bryan Russell Overview

EDUCATION AND LEARNING ONLINE RESOURCES Continued Professional Development for Archivists

WHEN WHEN WE WE RES RESTAR TART NYC NYC Driven by Science, Data and Public Health Prioritize

Noble County COVID-19 Responsible Restart Ohio Conversation Noble County Updates by: Noble