$r$ ) between the outcome measures on the same participants. Software engineering theory emphasizes the importance of individual skill differences, so we would expect the values of

$r$

to be relatively high. However, few researchers have reported the values of

$r$

. Goal: To investigate the values of

$r$

found in software engineering experiments. Method: We undertook simulation studies to investigate the theoretical and empirical properties of

$r$

. Then we investigated the values of

$r$

observed in 35 software engineering crossover experiments. Results: The level of

$r$

obtained by analysing our 35 crossover experiments was small. Estimates based on means, medians, and random effect analysis disagreed but were all between 0.2 and 0.3. As expected, our analyses found large variability among the individual

$r$

estimates for small sample sizes, but no indication that

$r$

estimates were larger for the experiments with larger sample sizes that exhibited smaller variability. Conclusions: Low observed

$r$

values cast doubts on the validity of crossover designs for software engineering experiments. However, if the cause of low

$r$

values relates to training limitations or toy tasks, this affects all Software Engineering (SE) experiments involving human participants. For all human-intensive SE experiments, we recommend more intensive training and then tracking the improvement of participants as they practice using specific techniques, before formally testing the effectiveness of the techniques.">

The Importance of the Correlation in Crossover Experiments (original) (raw)

IEEE Account

Purchase Details

Profile Information

Need Help?

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
© Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.