DocumentCode
2824627
Title
Improved Similarity Measures for Software Clustering
Author
Naseem, Rashid ; Maqbool, Onaiza ; Muhammad, Siraj
Author_Institution
Dept. of Comput. Sci., Quaid-I-Azam Univ., Islamabad, Pakistan
fYear
2011
fDate
1-4 March 2011
Firstpage
45
Lastpage
54
Abstract
Software clustering is a useful technique to recover architecture of a software system. The results of clustering depend upon choice of entities, features, similarity measures and clustering algorithms. Different similarity measures have been used for determining similarity between entities during the clustering process. In software architecture recovery domain the Jaccard and the Unbiased Ellenberg measures have shown better results than other measures for binary and non-binary features respectively. In this paper we analyze the Russell and Rao measure for binary features to show the conditions under which its performance is expected to be better than that of Jaccard. We also show how our proposed Jaccard-NM measure is suitable for software clustering and propose its counterpart for non-binary features. Experimental results indicate that our proposed Jaccard-NM measure and Russell & Rao measure perform better than Jaccard measure for binary features, while for non-binary features, the proposed Unbiased Ellenberg-NM measure produces results which are closer to the decomposition prepared by experts.
Keywords
pattern clustering; software architecture; software maintenance; Jaccard-NM measure; Rao measure; Russell measure; Unbiased Ellenberg-NM measure; improved similarity measures; nonbinary features; software architecture recovery system; software clustering algorithm; Clustering algorithms; Couplings; Loss measurement; Software algorithms; Software measurement; Software systems; Jaccard Measure; Jaccard-NM Measure; Russell & Rao Measure; Software Clustering; Unbiased Ellenberg-NM Measure;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on
Conference_Location
Oldenburg
ISSN
1534-5351
Print_ISBN
978-1-61284-259-2
Type
conf
DOI
10.1109/CSMR.2011.9
Filename
5741245
Link To Document