Title :
An improved k-medoids clustering algorithm
Author :
Cao, Danyang ; Yang, Bingru
Author_Institution :
Coll. of Inf. Eng., North China Univ. of Technol., NCUT, Beijing, China
Abstract :
In this paper, we present an improved k-medoids clustering algorithm based on CF-Tree. The algorithm based on the clustering features of BIRCH algorithm, the concept of k-medoids algorithm has been improved. We preserve all the training sample data in an CF-Tree, then use k-medoids method to cluster the CF in leaf nodes of CF-Tree. Eventually, we can get k clusters from the root of the CF-Tree. This algorithm improves obviously the drawbacks of the k-medoids algorithm, such as the time complexity, scalability on large dataset, and can´t find the clusters of sizes different very much and the convex shapes. Experiments show that this algorithm enhances the quality and scalability of clustering.
Keywords :
computational complexity; learning (artificial intelligence); pattern clustering; trees (mathematics); BIRCH algorithm; CF-tree; convex shapes; k-medoids clustering algorithm; time complexity; training sample data; Clustering algorithms; Clustering methods; Data analysis; Data mining; Educational institutions; Noise robustness; Partitioning algorithms; Scalability; Shape; Unsupervised learning; CF-Tree; clustering; k-medoids;
Conference_Titel :
Computer and Automation Engineering (ICCAE), 2010 The 2nd International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-5585-0
Electronic_ISBN :
978-1-4244-5586-7
DOI :
10.1109/ICCAE.2010.5452085