DocumentCode :
1514676
Title :
A Link-Based Approach to the Cluster Ensemble Problem
Author :
Iam-On, Natthakan ; Boongoen, Tossapon ; Garrett, Simon ; Price, Chris
Author_Institution :
Sch. of Inf. Technol., Mae Fah Luang Univ., Chiang Rai, Thailand
Volume :
33
Issue :
12
fYear :
2011
Firstpage :
2396
Lastpage :
2409
Abstract :
Cluster ensembles have recently emerged as a powerful alternative to standard cluster analysis, aggregating several input data clusterings to generate a single output clustering, with improved robustness and stability. From the early work, these techniques held great promise; however, most of them generate the final solution based on incomplete information of a cluster ensemble. The underlying ensemble-information matrix reflects only cluster-data point relations, while those among clusters are generally overlooked. This paper presents a new link-based approach to improve the conventional matrix. It achieves this using the similarity between clusters that are estimated from a link network model of the ensemble. In particular, three new link-based algorithms are proposed for the underlying similarity assessment. The final clustering result is generated from the refined matrix using two different consensus functions of feature-based and graph-based partitioning. This approach is the first to address and explicitly employ the relationship between input partitions, which has not been emphasized by recent studies of matrix refinement. The effectiveness of the link-based approach is empirically demonstrated over 10 data sets (synthetic and real) and three benchmark evaluation measures. The results suggest the new approach is able to efficiently extract information embedded in the input clusterings, and regularly illustrate higher clustering quality in comparison to several state-of-the-art techniques.
Keywords :
graph theory; matrix algebra; pattern clustering; cluster ensemble problem; conventional matrix; data clusterings; feature based partitioning; graph based partitioning; link based approach; output clustering; Clustering algorithms; Data mining; Partitioning algorithms; Clustering; cluster ensembles; cluster relations; data mining.; link-based similarity;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2011.84
Filename :
5765991
Link To Document :
بازگشت