• DocumentCode
    2824627
  • Title

    Improved Similarity Measures for Software Clustering

  • Author

    Naseem, Rashid ; Maqbool, Onaiza ; Muhammad, Siraj

  • Author_Institution
    Dept. of Comput. Sci., Quaid-I-Azam Univ., Islamabad, Pakistan
  • fYear
    2011
  • fDate
    1-4 March 2011
  • Firstpage
    45
  • Lastpage
    54
  • Abstract
    Software clustering is a useful technique to recover architecture of a software system. The results of clustering depend upon choice of entities, features, similarity measures and clustering algorithms. Different similarity measures have been used for determining similarity between entities during the clustering process. In software architecture recovery domain the Jaccard and the Unbiased Ellenberg measures have shown better results than other measures for binary and non-binary features respectively. In this paper we analyze the Russell and Rao measure for binary features to show the conditions under which its performance is expected to be better than that of Jaccard. We also show how our proposed Jaccard-NM measure is suitable for software clustering and propose its counterpart for non-binary features. Experimental results indicate that our proposed Jaccard-NM measure and Russell & Rao measure perform better than Jaccard measure for binary features, while for non-binary features, the proposed Unbiased Ellenberg-NM measure produces results which are closer to the decomposition prepared by experts.
  • Keywords
    pattern clustering; software architecture; software maintenance; Jaccard-NM measure; Rao measure; Russell measure; Unbiased Ellenberg-NM measure; improved similarity measures; nonbinary features; software architecture recovery system; software clustering algorithm; Clustering algorithms; Couplings; Loss measurement; Software algorithms; Software measurement; Software systems; Jaccard Measure; Jaccard-NM Measure; Russell & Rao Measure; Software Clustering; Unbiased Ellenberg-NM Measure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on
  • Conference_Location
    Oldenburg
  • ISSN
    1534-5351
  • Print_ISBN
    978-1-61284-259-2
  • Type

    conf

  • DOI
    10.1109/CSMR.2011.9
  • Filename
    5741245