• DocumentCode
    2657392
  • Title

    Improving tree-based models of software quality with principal components analysis

  • Author

    Khoshgoftaar, Taghi M. ; Shan, Ruqun M. ; Allen, Edward B.

  • Author_Institution
    Florida Atlantic Univ., Boca Raton, FL, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    198
  • Lastpage
    209
  • Abstract
    Software quality classification models can predict which modules are to be considered fault-prone, and which are not, based on software product metrics, process metrics and execution metrics. Such predictions can be used to target improvement efforts to those modules that need them the most. Classification-tree modeling is a robust technique for building such software quality models. However, the model structure may be unstable, and accuracy may suffer when the predictors are highly correlated. This paper presents an empirical case study of four releases of a very large telecommunications system, which shows that the tree-based models can be improved by transforming the predictors with principal components analysis, so that the transformed predictors are not correlated. The case study used the regression-tree algorithm in the S-Plus package and then applied a general decision rule to classify the modules
  • Keywords
    pattern classification; principal component analysis; software metrics; software packages; software quality; subroutines; telecommunication computing; trees (mathematics); S-Plus package; accuracy; case study; classification-tree modeling; correlated predictors; decision rule; fault-prone modules; improvement efforts; principal components analysis; program module classification; regression-tree algorithm; software execution metrics; software process metrics; software product metrics; software quality classification models; telecommunications system; transformed predictors; tree-based models; unstable model structure; Accuracy; Buildings; Classification tree analysis; Logistics; Packaging; Predictive models; Principal component analysis; Robustness; Software metrics; Software quality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Reliability Engineering, 2000. ISSRE 2000. Proceedings. 11th International Symposium on
  • Conference_Location
    San Jose, CA
  • ISSN
    1071-9458
  • Print_ISBN
    0-7695-0807-3
  • Type

    conf

  • DOI
    10.1109/ISSRE.2000.885872
  • Filename
    885872