Title :
A min-max cut algorithm for graph partitioning and data clustering
Author :
Ding, Chris H Q ; He, Xiaofeng ; Zha, Hongyuan ; Gu, Ming ; Simon, Horst D.
Author_Institution :
NERSC Div., Lawrence Berkeley Lab., CA, USA
Abstract :
An important application of graph partitioning is data clustering using a graph model - the pairwise similarities between all data objects form a weighted graph adjacency matrix that contains all necessary information for clustering. In this paper, we propose a new algorithm for graph partitioning with an objective function that follows the min-max clustering principle. The relaxed version of the optimization of the min-max cut objective function leads to the Fiedler vector in spectral graph partitioning. Theoretical analyses of min-max cut indicate that it leads to balanced partitions, and lower bounds are derived. The min-max cut algorithm is tested on newsgroup data sets and is found to out-perform other current popular partitioning/clustering methods. The linkage-based refinements to the algorithm further improve the quality of clustering substantially. We also demonstrate that a linearized search order based on linkage differential is better than that based on the Fiedler vector, providing another effective partitioning method
Keywords :
data mining; graph theory; minimax techniques; pattern clustering; vectors; Fiedler vector; algorithm performance; balanced partitions; clustering quality; data clustering; data object pairwise similarities; graph model; graph partitioning; linearized search order; linkage differential; linkage-based refinements; lower bounds; min-max cut algorithm; newsgroup data sets; objective function; relaxed optimization; spectral graph partition; weighted graph adjacency matrix; Application software; Clustering algorithms; Clustering methods; Computer science; Helium; Laboratories; Mathematical model; Mathematics; Partitioning algorithms; Testing;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989507