SLIDE 4 4
Experimental Software Engineering / Fernando Brito e Abreu 12-May-08
Why care about correlation then?
Correlation analysis can be useful for:
to reduce the size of that set of explanatory variables
Highly correlated ones may be measuring the same attribute
Performing a preliminary assessment of the feasibility of an
hypothesis
A very low correlation (association) between a dependent and an
independent variable may lead us to discard considering the hypothesis
Most statistical tools allow us to produce cross-
correlation tables (symmetrical matrices with one by one correlation values among considered variables)
The main diagonal is obviously filled with 1’s (100% correlation)
Experimental Software Engineering / Fernando Brito e Abreu 12-May-08
Association between variables: properties
Magnitude or size
This property pertains to the strength of the association Several correlation coefficients (e.g. Pearson, Spearman) allow to quantify
this magnitude
Signal of the association
Positive – when a variable increases, the other increases as well Negative – when a variable increases, the other decreases
Significance, reliability or truthfulness
This property pertains to the representativeness of the result found in our
specific model for the entire population
It says how probable it is that a similar relation would be found if the experiment
was replicated with other samples from the same population