DocumentCode
866055
Title
Shotgun correlations in software measures
Author
Courtney, Richard E. ; Gustafson, David A.
Author_Institution
Dept. of Comput. Sci., Tulane Univ., New Orleans, LA, USA
Volume
8
Issue
1
fYear
1993
fDate
1/1/1993 12:00:00 AM
Firstpage
5
Lastpage
13
Abstract
Many software measures have been forwarded on the simple basis of a high linear correlation coefficient with some measurable quantities. The linear correlation coefficient is an unreliable statistic for deciding whether an observed correlation indicates significant association. Several published software measure experiments collected more than 20 different measurements, or have 14 or fewer observations. With considerable data from small samples, the probabilit of `discovering´ a `significant´ correlation is high. The authors present a computer simulation experiment where the correlation between sets of randomly generated numbers is calculated. They also look at randomly generated numbers in the ranges that would be expected in Halstead´s software science measures. The results show that the average maximum linear correlation for randomly generated numbers is 0.70 or higher if the sample size is low compared to the number of variables. Alternative statistical approaches to obtaining meaningful significant results are presented
Keywords
software metrics; statistical analysis; Halstead software science metrics; average maximum linear correlation; computer simulation experiment; linear correlation coefficient; observed correlation; randomly generated numbers; shotgun correlations; software measures;
fLanguage
English
Journal_Title
Software Engineering Journal
Publisher
iet
ISSN
0268-6961
Type
jour
Filename
199631
Link To Document