DocumentCode :
2891819
Title :
Efficient Calculation of Structural Similarity Threshold for the SCAN Network Clustering Algorithm
Author :
Yip, Vincent ; Kockara, Sinan ; Hu, Chenyi
Author_Institution :
Comput. Inf. Syst., Umpqua Community Coll., Roseburg, OR, USA
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
600
Lastpage :
603
Abstract :
Community detection algorithms play an important role in discovering knowledge in networks. The Structural Clustering Algorithm for Network (SCAN) is a community detection algorithm which is capable of detecting hubs and outliers, in addition to cluster members. The term hub means node with the ability of collecting and delivering information among clusters while outlier is considered as a noise in the data. Currently, researchers use exhaustive search to determine the structural similarity threshold value (ε) in the SCAN. This paper reports a new approach of using interval ε value to narrow the searching domain for proper ε value for the SCAN. The approach first adopts computational results produced by the Fast Modularity and the Walktrap algorithms to bind the number of clusters of a network and then determine the interval for ε value. For each of our test datasets, the interval prediction reliably finds the true number of clusters. More importantly, the proposed prediction method helps users to eliminate an average of 67.7% of inappropriate ε values used to generate clusters.
Keywords :
data mining; network theory (graphs); pattern clustering; search problems; SCAN network clustering algorithm; community detection algorithms; exhaustive search; fast modularity; hubs detection; interval prediction; knowledge discovery; outlier detection; searching domain; structural clustering algorithm for network; structural similarity threshold value; walktrap algorithms; Algorithm design and analysis; Clustering algorithms; Communities; Detection algorithms; Educational institutions; Partitioning algorithms; Prediction algorithms; SCAN; community detection; network clustering; structural clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
Type :
conf
DOI :
10.1109/BIBM.2011.49
Filename :
6120510
Link To Document :
بازگشت