• DocumentCode
    2914335
  • Title

    Data clustering based on complex network community detection

  • Author

    De Oliveira, Tatyana B S ; Zhao, Liang ; Faceli, Katti ; De Carvalho, André C P L F

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Carlos
  • fYear
    2008
  • fDate
    1-6 June 2008
  • Firstpage
    2121
  • Lastpage
    2126
  • Abstract
    Data clustering is an important technique to extract and understand relevant information in large data sets. In this paper, a clustering algorithm based on graph theoretic models and community detection in complex networks is proposed. Two steps are involved in this processing: The first step is to represent input data as a network and the second one is to partition the network into subnetworks producing data clusters. In the network partition stage, each node has a randomly assigned initial angle and it is gradually updated according to its neighbors angle agreement. Finally, a stable state is reached and nodes belonging to the same cluster have similar angles. This process is repeated, each time a cluster is chosen and results in an hierarchical divisive clustering. Simulation results show two main advantages of the algorithm: the ability to detect clusters in different shapes, densities and sizes and the ability to generate clusters with different refinement degrees. Besides of these, the proposed algorithm presents high robustness and efficiency in clustering.
  • Keywords
    graph theory; pattern clustering; complex network community detection; data clustering; graph theoretic models; hierarchical divisive clustering; network partition; Bioinformatics; Clustering algorithms; Complex networks; Computer science; Data mining; Face detection; Partitioning algorithms; Proteins; Robustness; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-1822-0
  • Electronic_ISBN
    978-1-4244-1823-7
  • Type

    conf

  • DOI
    10.1109/CEC.2008.4631080
  • Filename
    4631080