Title :
Data clustering based on complex network community detection
Author :
De Oliveira, Tatyana B S ; Zhao, Liang ; Faceli, Katti ; De Carvalho, André C P L F
Author_Institution :
Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Carlos
Abstract :
Data clustering is an important technique to extract and understand relevant information in large data sets. In this paper, a clustering algorithm based on graph theoretic models and community detection in complex networks is proposed. Two steps are involved in this processing: The first step is to represent input data as a network and the second one is to partition the network into subnetworks producing data clusters. In the network partition stage, each node has a randomly assigned initial angle and it is gradually updated according to its neighbors angle agreement. Finally, a stable state is reached and nodes belonging to the same cluster have similar angles. This process is repeated, each time a cluster is chosen and results in an hierarchical divisive clustering. Simulation results show two main advantages of the algorithm: the ability to detect clusters in different shapes, densities and sizes and the ability to generate clusters with different refinement degrees. Besides of these, the proposed algorithm presents high robustness and efficiency in clustering.
Keywords :
graph theory; pattern clustering; complex network community detection; data clustering; graph theoretic models; hierarchical divisive clustering; network partition; Bioinformatics; Clustering algorithms; Complex networks; Computer science; Data mining; Face detection; Partitioning algorithms; Proteins; Robustness; Shape;
Conference_Titel :
Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1822-0
Electronic_ISBN :
978-1-4244-1823-7
DOI :
10.1109/CEC.2008.4631080