Title :
Cluster validity measures for data with tolerance
Author :
Hamasuna, Yukihiro ; Endo, Yasunori ; Miyamoto, Sadaaki
Author_Institution :
Dept. of Risk Eng., Univ. of Tsukuba, Tsukuba, Japan
Abstract :
Cluster validity measures are used in order to determine an appropriate number of clusters and evaluate cluster partitions obtained by clustering algorithms. When we handle a set of data, data contains inherent uncertainty e.g., errors, ranges or some missing value of attributes. The concept of tolerance has been proposed from the viewpoint of handling such uncertain data. In this paper, we introduce clustering algorithms for data with tolerance. Moreover, we propose new five measures for data with tolerance, that is, the determinants and the traces of fuzzy covariance matrices, the Xie-Beni´s index, the Fukuyama-Sugeno´s index, and the Davies-Bouldin´s index. We compare the performance of conventional ones with their tolerance versions. We found that our proposed measures takes smaller value than conventional ones. These results indicate tolerance based clustering method is suitable for handling uncertain data.
Keywords :
covariance matrices; data handling; indexing; pattern clustering; Davies-Bouldin index; Fukuyama-Sugeno index; Xie-Beni index; cluster partition; cluster validity measures; clustering algorithm; data-with-tolerance; fuzzy covariance matrix; tolerance based clustering; uncertain data handling; Clustering algorithms; Clustering methods; Covariance matrix; Diseases; Heart; Indexes; Measurement uncertainty;
Conference_Titel :
Fuzzy Systems (FUZZ), 2010 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6919-2
DOI :
10.1109/FUZZY.2010.5584371