DM841 Discrete Optimization
Methods for Experimental Analysis
Marco Chiarandini
Department of Mathematics & Computer Science University of Southern Denmark
Methods for Experimental Analysis Marco Chiarandini Department of - - PowerPoint PPT Presentation
DM841 Discrete Optimization Methods for Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Experimental Methods Course Overview Sequential Testing Combinatorial
Department of Mathematics & Computer Science University of Southern Denmark
Experimental Methods Sequential Testing
◮ Very Large Scale Neighborhoods
2
Experimental Methods Sequential Testing
3
Experimental Methods Sequential Testing
4
Experimental Methods Sequential Testing
5
Experimental Methods Sequential Testing
◮ We work with samples (instances, solution quality) ◮ But we want sound conclusions: generalization over a given population
◮ Thus we need statistical inference
11
Experimental Methods Sequential Testing
◮ There is a competition and two stochastic algorithms A1 and A2 are
◮ We run both algorithms once on n instances.
12
Experimental Methods Sequential Testing
◮ p: probability that A1 wins on each instance (+) ◮ n: number of runs without ties ◮ Y : number of wins of algorithm A1
10 15 20 0.00 0.04 0.08 0.12
Binomial Distribution: Trials = 30, Probability of success = 0.5
Number of Successes Probability Mass
Experimental Methods Sequential Testing
2 4 6 8 10 0.00 0.05 0.10 0.15 0.20 0.25
Binomial distribution: Trials = 30 Probability of success 0.5
number of successes y Pr[Y=y]
14
Experimental Methods Sequential Testing
15
Experimental Methods Sequential Testing
◮ Assume that data are consistent with a null hypothesis H0 (e.g., sample
◮ Use a statistical test to compute how likely this is to be true, given the
◮ Do not reject H0 if the p-value is larger than an user defined threshold
◮ Alternatively, (p-value < α), H0 is rejected in favor of an alternative
16
Experimental Methods Sequential Testing
17
Experimental Methods Sequential Testing
◮ Consequences:
◮ allows inference from a sample ◮ allows to model errors in measurements: X = µ + ǫ
◮ Issues:
◮ n should be enough large ◮ µ and σ must be known 18
Experimental Methods Sequential Testing
10 20 30 40 0.0 0.2 0.4 0.6
Weibull distribution
x dweibull(x, shape = 1.4)
¯ X−µ σ/√n
n=1 x Density −1 1 2 3 4 5 0.0 0.1 0.2 0.3 0.4 0.5 0.6 n=5 x Density −2 −1 1 2 3 4 5 0.0 0.1 0.2 0.3 0.4 n=15 x Density −2 −1 1 2 0.0 0.1 0.2 0.3 0.4 n=50 x Density −2 −1 1 2 3 0.0 0.1 0.2 0.3 0.4
19
Experimental Methods Sequential Testing
¯ X1 ¯ X2 ¯ X3
¯ X1 ¯ X2 ¯ X3
20
Experimental Methods Sequential Testing
21
Experimental Methods Sequential Testing
22
Experimental Methods Sequential Testing
T =
( ¯ X1− ¯ X2)−
r
T Student’s t Distribution
1 − ¯
2
22
Experimental Methods Sequential Testing
25 30 35 40 45 0.0 0.2 0.4 0.6 0.8 1.0 F(x) x
1 2
23
Experimental Methods Sequential Testing
◮ independence ◮ homoschedasticity ◮ normality
◮ independence ◮ homoschedasticity
◮ Rank based tests ◮ Permutation tests
◮ Exact ◮ Conditional Monte Carlo 24
Experimental Methods Sequential Testing
25
Experimental Methods Sequential Testing
◮ Blocking on instances ◮ Same pseudo random seed
◮ If the sample size is large enough (infinity) any difference in the means
◮ Real vs Statistical significance
◮ Desired statistical power + practical precision ⇒ sample size
26
Experimental Methods Sequential Testing
◮ Statement of the objectives of the experiment
◮ Comparison of different algorithms ◮ Impact of algorithm components ◮ How instance features affect the algorithms
◮ Identification of the sources of variance
◮ Treatment factors (qualitative and quantitative) ◮ Controllable nuisance factors ⇐ blocking ◮ Uncontrollable nuisance factors ⇐ measuring
◮ Definition of factor combinations to test
◮ Running a pilot experiment and refine the design
◮ Bugs and no external biases ◮ Ceiling or floor effects ◮ Rescaling levels of quantitative factors ◮ Detect the number of experiments needed to obtained the desired power. 27
Experimental Methods Sequential Testing
Algorithm 1 Algorithm 2 . . . Algorithm k Instance 1 X11 X12 X1k . . . . . . . . . . . . Instance b Xb1 Xb2 Xbk
Algorithm 1 Algorithm 2 . . . Algorithm k Instance 1 X111, . . . , X11r X121, . . . , X12r X1k1, . . . , X1kr Instance 2 X211, . . . , X21r X221, . . . , X22r X2k1, . . . , X2kr . . . . . . . . . . . . Instance b Xb11, . . . , Xb1r Xb21, . . . , Xb2r Xbk1, . . . , Xbkr
28
Experimental Methods Sequential Testing
◮ Protected versions: global test + no adjustments ◮ Bonferroni α = αEX/c (conservative) ◮ Tukey Honest Significance Method (for parametric analysis) ◮ Holm (step-wise) ◮ Other step procedures
29
Experimental Methods Sequential Testing
30
Experimental Methods Sequential Testing
31
Experimental Methods Sequential Testing
◮ Matched pairs versions: when, when not ◮ t-test with different variances
32
Experimental Methods Sequential Testing
33
Experimental Methods Sequential Testing
◮ Matched pairs versions: when, when not ◮ t-test Welch variant: no assumption of equal variances
34
Experimental Methods Sequential Testing
Instance HEA TSN1 ILS MinConf XRLF Instance Succ. k Succ. k Succ. k Succ. k Succ. k flat300_20_0 10 20 10 20 10 20 10 20 6 20 flat300_26_0 10 26 10 26 10 26 10 26 1 33 flat300_28_0 6 31 4 31 2 31 1 31 1 34 flat1000_50_0 4 50 2 85 6 88 4 87 1 84 flat1000_60_0 4 87 3 88 1 89 4 89 6 87 flat1000_76_0 1 88 1 88 1 89 8 90 6 87 GLS SAN2 Novelty TSN3 Instance Succ. k Succ. k Succ. k Succ. k flat300_20_0 10 20 10 20 1 22 1 33 flat300_26_0 10 33 1 32 4 29 6 35 flat300_28_0 8 33 8 33 10 35 4 35 flat1000_50_0 10 50 1 86 6 54 1 95 flat1000_60_0 4 90 1 88 4 64 1 96 flat1000_76_0 8 92 4 89 8 98 1 96
35
Experimental Methods Sequential Testing
col
Novelty HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 50 60 70 80 90
70 80 90
88 90 92 94 96 98
Novelty HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 20 25 30 35
flat300_20_0
26 28 30 32 34 36
31 32 33 34 35 36 37
36
Experimental Methods Sequential Testing
> load("gcp-all-classes.dataR") > G <− F[F$class=="Flat",] > bwplot(alg ~ col inst,data=G,scales=list(x=list(relation="free")),pch="") > boxplot(err3~alg,data=G,horizontal=TRUE,main=expression(paste("Invariant error: ",frac(x
> boxplot(rank~alg,data=G,horizontal=TRUE,main="Ranks",notch=TRUE,col="pink")
37
Experimental Methods Sequential Testing
HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 0.3 0.4 0.5 0.6 0.7
Invariant error: x − x( (opt) ) x( (worst) ) − x( (opt) )
HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 20 40 60 80
Ranks
38
Experimental Methods Sequential Testing
> pairwise.wilcox.test(G$err3,G$alg,paired=TRUE) Pairwise comparisons using Wilcoxon rank sum test data: G$err3 and G$alg
39
Experimental Methods Sequential Testing
> par(las=1,mar=c(3,8,3,1)) > plot(TukeyHSD(aov(err3~alg∗inst,data=G),which="alg"),las=1,mar=c(3,7,3,1))
0.00 0.05 0.10 0.15 0.20 TSinN3−SAKempeFI TSinN3−XRLF SAKempeFI−XRLF TSinN3−GLS2 SAKempeFI−GLS2 XRLF−GLS2 TSinN3−MinConf SAKempeFI−MinConf XRLF−MinConf GLS2−MinConf TSinN3−ILS SAKempeFI−ILS XRLF−ILS GLS2−ILS MinConf−ILS TSinN3−TSinN1 SAKempeFI−TSinN1 XRLF−TSinN1 GLS2−TSinN1 MinConf−TSinN1 ILS−TSinN1 TSinN3−HEA SAKempeFI−HEA XRLF−HEA GLS2−HEA MinConf−HEA ILS−HEA TSinN1−HEA TSinN3−Novelty SAKempeFI−Novelty XRLF−Novelty GLS2−Novelty MinConf−Novelty ILS−Novelty TSinN1−Novelty HEA−Novelty
95% family−wise confidence level
40
Experimental Methods Sequential Testing
41
Experimental Methods Sequential Testing
Novelty HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 0.50 0.55 0.60 0.65 0.70 Average Inveriant Error (Tukey's Honset Significance Difference) Novelty HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 0.50 0.55 0.60 0.65 0.70 Average Inveriant Error (Permutation Test) Novelty HEA TSinN1 ILS MinConf GLS2 XRLF SAKempeFI TSinN3 20 40 60 80 Average Rank (Friedman Test) 42
Experimental Methods Sequential Testing
43
Experimental Methods Sequential Testing
◮ F-Race use Friedman test ◮ Holm adjustment method is typically the most powerful
race(wrapper.file, maxExp=0, stat.test=c("friedman","t.bonferroni","t.holm","t.none"), conf.level=0.95, first.test=5, interactive=TRUE, log.file="", no.slaves=0,...)
44
Experimental Methods Sequential Testing
S_D_s_Y S_D_g_Y O_CCRB O_CCRA O_DCRB S_D_g_N O_CRRA O_DCRA O_CRRB S_D_s_N O_DRRA O_DRRB S_RLF_N O_CCFA S_RLF_Y O_CCFB O_DCFB O_DCFA S_Seq_SL_Y ... 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
class−GEOMb (11 Instances)
Stage
47