Title :
Clustering around medoids based on ultrametric properties
Author :
Fouchal, S. ; Bui, M. ; Lavallée, I.
Author_Institution :
Lab. LaISC, Univ. de Strasbourg, Strasbourg, France
Abstract :
The FFUCA (Fast and Flexible Unsupervised Clustering Algorithm) is a fast clustering method based on ultrametric properties. It aggregates data in the same way as partitional methods. But, it elects representatives differently. Indeed, in FFUCA the cluster representatives are deduced from an ultrametric structure built from a sample data. This ultrametric structure gives the data behavior according to used distance. Thus the results are independent from the cluster representatives. We propose in this paper an extension named FFUCAAM to change for better the quality of clusters. Indeed, we improve the election of these representatives. We substitute them by mediods after every new aggregation. This extension increases the complexity in the average case to O(Σi=1k mi2) where k is the number of the resulting clusters and mi is the size of the cluster Ci. In fact, its computational cost is increased but it still less than O(n2), thus it remains applicable to large databases.
Keywords :
data mining; pattern clustering; FFUCAAM; cluster representatives; computational cost; data aggregation; databases; fast and flexible unsupervised clustering algorithm; medoid clustering; partitional methods; ultrametric properties; Clustering algorithms; Complexity theory; Computational efficiency; Context; Extraterrestrial measurements; Shape;
Conference_Titel :
Industrial Informatics (INDIN), 2012 10th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-0312-5
DOI :
10.1109/INDIN.2012.6301136