• DocumentCode
    1284374
  • Title

    Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm

  • Author

    Bishnu, Partha Sarathi ; Bhattacherjee, Vandana

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Birla Inst. of Technol., Ranchi, India
  • Volume
    24
  • Issue
    6
  • fYear
    2012
  • fDate
    6/1/2012 12:00:00 AM
  • Firstpage
    1146
  • Lastpage
    1150
  • Abstract
    Unsupervised techniques like clustering may be used for fault prediction in software modules, more so in those cases where fault labels are not available. In this paper a Quad Tree-based K-Means algorithm has been applied for predicting faults in program modules. The aims of this paper are twofold. First, Quad Trees are applied for finding the initial cluster centers to be input to the A´-Means Algorithm. An input threshold parameter δ governs the number of initial cluster centers and by varying δ the user can generate desired initial cluster centers. The concept of clustering gain has been used to determine the quality of clusters for evaluation of the Quad Tree-based initialization algorithm as compared to other initialization techniques. The clusters obtained by Quad Tree-based algorithm were found to have maximum gain values. Second, the Quad Tree- based algorithm is applied for predicting faults in program modules. The overall error rates of this prediction approach are compared to other existing algorithms and are found to be better in most of the cases.
  • Keywords
    pattern clustering; quadtrees; software fault tolerance; clustering gain; fault prediction; quad tree-based initialization algorithm; quad tree-based k-means clustering algorithm; software fault prediction; software modules; unsupervised techniques; Algorithm design and analysis; Clustering algorithms; Error analysis; Measurement; Prediction algorithms; Software; Software algorithms; K-Means clustering; Quad Tree; software fault prediction.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2011.163
  • Filename
    5963674