DocumentCode :
2236005
Title :
SDR: An algorithm for clustering categorical data using rough set theory
Author :
Tripathy, B.K. ; Ghosh, Adhir
Author_Institution :
SCSE, VIT Univ., Vellore, India
fYear :
2011
fDate :
22-24 Sept. 2011
Firstpage :
867
Lastpage :
872
Abstract :
In the present day scenario, there are a large number of clustering algorithms available, to group objects having similar characteristics. But, the implementation of most of these algorithms is challenging due to the fact that most of the datasets involve categorical data values. Again, those algorithms which are capable of handling categorical data are mostly unable to handle uncertainty and some of them are involved with the stability issues. This necessitated the development of algorithms for clustering categorical data while handling uncertainty. In an effort to solve these problems an algorithm, termed MMR [1] was proposed in 2007, which uses the basic rough set theory concepts to deal with the above problem in clustering categorical data. Later in 2009, another algorithm, termed MMeR was proposed [2], which is more efficient than MMR and also has the capability of handling heterogeneous data. In this paper, we further improve MMeR and propose an algorithm, which we call SDR (Standard Deviation Roughness) algorithm It is capable of handling heterogeneous data besides taking care of uncertainty. We establish its efficiency over many other algorithms using well known standard data sets for the purpose of testing and the purity ratio as the measure of efficiency.
Keywords :
data handling; pattern clustering; rough set theory; uncertainty handling; MMR; MMeR; SDR algorithm; categorical data clustering algorithm; heterogeneous data handling; rough set theory; standard deviation roughness; uncertainty handling; Algorithm design and analysis; Approximation methods; Classification algorithms; Clustering algorithms; Computer numerical control; Set theory; Uncertainty; MMR; MMeR; SDR; clustering; uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Advances in Intelligent Computational Systems (RAICS), 2011 IEEE
Conference_Location :
Trivandrum
Print_ISBN :
978-1-4244-9478-1
Type :
conf
DOI :
10.1109/RAICS.2011.6069433
Filename :
6069433
Link To Document :
بازگشت