DocumentCode :
1373836
Title :
CA-Tree: A Hierarchical Structure for Efficient and Scalable Coassociation-Based Cluster Ensembles
Author :
Wang, Tsaipei
Author_Institution :
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume :
41
Issue :
3
fYear :
2011
fDate :
6/1/2011 12:00:00 AM
Firstpage :
686
Lastpage :
698
Abstract :
Cluster ensembles have attracted a lot of research interests in recent years, and their applications continue to expand. Among the various algorithms for cluster ensembles, those based on coassociation matrices are probably the ones studied and used the most because coassociation matrices are easy to understand and implement. However, the main limitation of coassociation matrices as the data structure for combining multiple clusterings is the complexity that is at least quadratic to the number of patterns N. In this paper, we propose CA-tree, which is a dendogram-like hierarchical data structure, to facilitate efficient and scalable cluster ensembles for coassociation-matrix-based algorithms. All the properties of the CA-tree are derived from base cluster labels and do not require the access to the original data features. We then apply a threshold to the CA-tree to obtain a set of nodes, which are then used in place of the original patterns for ensemble-clustering algorithms. The experiments demonstrate that the complexity for coassociation-based cluster ensembles can be reduced to close to linear to N with minimal loss on clustering accuracy.
Keywords :
matrix algebra; pattern clustering; CA-Tree; base cluster labels; coassociation-based cluster ensembles; coassociation-matrix-based algorithms; dendogram-like hierarchical data structure; Buildings; Clustering algorithms; Complexity theory; Diversity reception; Partitioning algorithms; Prototypes; Upper bound; Cluster ensemble; coassociation matrix; multiple clusterings; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Decision Support Techniques; Models, Theoretical; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2010.2086059
Filename :
5625918
Link To Document :
بازگشت