Title :
Revealing Density-Based Clustering Structure from the Core-Connected Tree of a Network
Author :
Jianbin Huang ; Heli Sun ; Qinbao Song ; Hongbo Deng ; Jiawei Han
Author_Institution :
Sch. of Software, Xidian Univ., Xi´an, China
Abstract :
Clustering is an important technique for mining the intrinsic community structures in networks. The density-based network clustering method is able to not only detect communities of arbitrary size and shape, but also identify hubs and outliers. However, it requires manual parameter specification to define clusters, and is sensitive to the parameter of density threshold which is difficult to determine. Furthermore, many real-world networks exhibit a hierarchical structure with communities embedded within other communities. Therefore, the clustering result of a global parameter setting cannot always describe the intrinsic clustering structure accurately. In this paper, we introduce a novel density-based network clustering method, called graph-skeleton-based clustering (gSkeletonClu). By projecting an undirected network to its core-connected maximal spanning tree, the clustering problem can be converted to detect core connectivity components on the tree. The density-based clustering of a specific parameter setting and the hierarchical clustering structure both can be efficiently extracted from the tree. Moreover, it provides a convenient way to automatically select the parameter and to achieve the meaningful cluster tree in a network. Extensive experiments on both real-world and synthetic networks demonstrate the superior performance of gSkeletonClu for effective and efficient density-based clustering.
Keywords :
data mining; network theory (graphs); pattern clustering; trees (mathematics); core connectivity components; core-connected maximal spanning tree; core-connected tree; density threshold; density-based clustering structure; density-based network clustering method; gSkeletonClu; graph-skeleton-based clustering; intrinsic clustering structure; intrinsic community structures mining; manual parameter specification; Clustering algorithms; Clustering methods; Communities; Educational institutions; Image edge detection; Joining processes; Periodic structures; Network clustering; density-based method; hierarchical clustering; parameter selection; spanning tree;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2012.100