SLIDE 22 Jian Pei: CMPT 741/459 Classification (2) 22
Procedure of Comparison
- Using a set of data sets
- Procedure
– Compute the effectiveness measure for every data set – Compute a test statistic based on a comparison of the effectiveness measures for each data set
- E.g., the t-test, the Wilcoxon signed-rank test, and the sign test
– Compute a P-value: the probability that a test statistic value at least that extreme could be observed if the null hypothesis were true – The null hypothesis is rejected if the P-value ≤ α, where α is the significance level which is used to minimize the type I errors
- One-sided (one-tailed) tests: whether B is better than A (the
baseline method)
– Two-sided tests: whether A and B are different – the P-value is doubled