DocumentCode
3455455
Title
A Combination Approach to Cluster Validation Based on Statistical Quantiles
Author
Albalate, Amparo ; Suendermann, David
Author_Institution
Inst. of Inf. Technol., Univ. of Ulm, Ulm, Germany
fYear
2009
fDate
3-5 Aug. 2009
Firstpage
549
Lastpage
555
Abstract
In this paper, we analyse different techniques to detect the number of clusters in a dataset, also know as cluster validation techniques. We also propose a new algorithm based on the combination of several validation indexes to simultaneously validate several partitions of a dataset generated by different clustering techniques and object distances. The existing validation techniques as well as the combination algorithm have been tested on three data sets: a synthesized mixture of Gaussians data set, the NCI60 microarray data set, and the Iris data set. Evaluation results have shown the adequate performance of the proposed approach, even if the input validity scores fail to discover the true number of clusters.
Keywords
pattern clustering; statistical analysis; Gaussians data set; NCI60 microarray data set; combination algorithm; dataset cluster validation technique; object distance; statistical quantile; validation index; Bioinformatics; Biology computing; Clustering algorithms; Information analysis; Information technology; Intelligent systems; Partitioning algorithms; Speech analysis; Systems biology; Testing; Cluster Validation; Quantile;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3739-9
Type
conf
DOI
10.1109/IJCBS.2009.116
Filename
5260453
Link To Document