Title :
Progressive clustering of networks using Structure-Connected Order of Traversal
Author :
Bortner, Dustin ; Han, Jiawei
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
Network clustering enables us to view a complex network at the macro level, by grouping its nodes into units whose characteristics and interrelationships are easier to analyze and understand. State-of-the-art network partitioning methods are unable to identify hubs and outliers. A recently proposed algorithm, SCAN, overcomes this difficulty. However, it requires a minimum similarity parameter ¿ but provides no automated way to find it. Thus, it must be rerun for each ¿ value and does not capture the variety or hierarchy of clusters. We propose a new algorithm, SCOT (or Structure-Connected Order of Traversal), that produces a length n sequence containing all possible ¿-clusterings. We propose a new algorithm, HintClus (or Hierarchy-Induced Network Clustering), to hierarchically cluster the network by finding only best cluster boundaries (not agglomerative). Results on model-based synthetic network data and real data show that SCOT´S execution time is comparable to SCAN, that HintClus runs in negligible time, and that HintClus produces sensible clusters in the presence of noise.
Keywords :
artificial intelligence; database management systems; pattern clustering; HintClus; SCOT; hierarchy induced network clustering; macro level complex network; network progressive clustering; state-of-the-art network partitioning methods; structure connected order of traversal; traversal structure connected order; Biological system modeling; Clustering algorithms; Complex networks; Computer science; Iterative algorithms; Partitioning algorithms; Simulated annealing; Size measurement; Social network services;
Conference_Titel :
Data Engineering (ICDE), 2010 IEEE 26th International Conference on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4244-5445-7
Electronic_ISBN :
978-1-4244-5444-0
DOI :
10.1109/ICDE.2010.5447895