• DocumentCode
    866055
  • Title

    Shotgun correlations in software measures

  • Author

    Courtney, Richard E. ; Gustafson, David A.

  • Author_Institution
    Dept. of Comput. Sci., Tulane Univ., New Orleans, LA, USA
  • Volume
    8
  • Issue
    1
  • fYear
    1993
  • fDate
    1/1/1993 12:00:00 AM
  • Firstpage
    5
  • Lastpage
    13
  • Abstract
    Many software measures have been forwarded on the simple basis of a high linear correlation coefficient with some measurable quantities. The linear correlation coefficient is an unreliable statistic for deciding whether an observed correlation indicates significant association. Several published software measure experiments collected more than 20 different measurements, or have 14 or fewer observations. With considerable data from small samples, the probabilit of `discovering´ a `significant´ correlation is high. The authors present a computer simulation experiment where the correlation between sets of randomly generated numbers is calculated. They also look at randomly generated numbers in the ranges that would be expected in Halstead´s software science measures. The results show that the average maximum linear correlation for randomly generated numbers is 0.70 or higher if the sample size is low compared to the number of variables. Alternative statistical approaches to obtaining meaningful significant results are presented
  • Keywords
    software metrics; statistical analysis; Halstead software science metrics; average maximum linear correlation; computer simulation experiment; linear correlation coefficient; observed correlation; randomly generated numbers; shotgun correlations; software measures;
  • fLanguage
    English
  • Journal_Title
    Software Engineering Journal
  • Publisher
    iet
  • ISSN
    0268-6961
  • Type

    jour

  • Filename
    199631