DocumentCode :
2824627
Title :
Improved Similarity Measures for Software Clustering
Author :
Naseem, Rashid ; Maqbool, Onaiza ; Muhammad, Siraj
Author_Institution :
Dept. of Comput. Sci., Quaid-I-Azam Univ., Islamabad, Pakistan
fYear :
2011
fDate :
1-4 March 2011
Firstpage :
45
Lastpage :
54
Abstract :
Software clustering is a useful technique to recover architecture of a software system. The results of clustering depend upon choice of entities, features, similarity measures and clustering algorithms. Different similarity measures have been used for determining similarity between entities during the clustering process. In software architecture recovery domain the Jaccard and the Unbiased Ellenberg measures have shown better results than other measures for binary and non-binary features respectively. In this paper we analyze the Russell and Rao measure for binary features to show the conditions under which its performance is expected to be better than that of Jaccard. We also show how our proposed Jaccard-NM measure is suitable for software clustering and propose its counterpart for non-binary features. Experimental results indicate that our proposed Jaccard-NM measure and Russell & Rao measure perform better than Jaccard measure for binary features, while for non-binary features, the proposed Unbiased Ellenberg-NM measure produces results which are closer to the decomposition prepared by experts.
Keywords :
pattern clustering; software architecture; software maintenance; Jaccard-NM measure; Rao measure; Russell measure; Unbiased Ellenberg-NM measure; improved similarity measures; nonbinary features; software architecture recovery system; software clustering algorithm; Clustering algorithms; Couplings; Loss measurement; Software algorithms; Software measurement; Software systems; Jaccard Measure; Jaccard-NM Measure; Russell & Rao Measure; Software Clustering; Unbiased Ellenberg-NM Measure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on
Conference_Location :
Oldenburg
ISSN :
1534-5351
Print_ISBN :
978-1-61284-259-2
Type :
conf
DOI :
10.1109/CSMR.2011.9
Filename :
5741245
Link To Document :
بازگشت