Title :
Combining multiple clusterings by soft correspondence
Author :
Long, Bo ; Zhang, Zhongfei Mark ; Yu, Philip S.
Author_Institution :
State Univ. of New York, Binghamton, NY, USA
Abstract :
Combining multiple clusterings arises in various important data mining scenarios. However, finding a consensus clustering from multiple clusterings is a challenging task because there is no explicit correspondence between the classes from different clusterings. We present a new framework based on soft correspondence to directly address the correspondence problem in combining multiple clusterings. Under this framework, we propose a novel algorithm that iteratively computes the consensus clustering and correspondence matrices using multiplicative updating rules. This algorithm provides a final consensus clustering as well as correspondence matrices that gives intuitive interpretation of the relations between the consensus clustering and each clustering from clustering ensembles. Extensive experimental evaluations also demonstrate the effectiveness and potential of this framework as well as the algorithm for discovering a consensus clustering from multiple clusterings.
Keywords :
data mining; pattern clustering; consensus clustering; correspondence matrices; data mining; multiple clusterings; multiplicative updating rule; soft correspondence; Clustering algorithms; Data analysis; Data mining; Information analysis; Iterative algorithms; Partitioning algorithms; Robust stability; Shape; Uniform resource locators; Unsupervised learning;
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
Print_ISBN :
0-7695-2278-5
DOI :
10.1109/ICDM.2005.45