• DocumentCode
    2138031
  • Title

    An improved affinity propagation clustering algorithm for large-scale data sets

  • Author

    Xiaonan Liu ; Meijuan Yin ; Junyong Luo ; Wuping Chen

  • Author_Institution
    State Key Lab. or Math. Eng. & Adv. Comput., Zhengzhou, China
  • fYear
    2013
  • fDate
    23-25 July 2013
  • Firstpage
    894
  • Lastpage
    899
  • Abstract
    Affinity Propagation (AP) clustering does not need to set the number of clusters, and has advantages on efficiency and accuracy, but is not suitable for large-scale data clustering. To ensure both a low time complexity and a good accuracy for the clustering method of affinity propagation on large-scale data clustering, an improved AP clustering algorithm named hierarchical affinity propagation (HAP) is proposed, which clusters data points by using AP algorithm several times on different level data. The data set to be clustered is firstly divided into several subsets, each of which can be efficiently clustered by AP algorithm. Then, the AP algorithm is performed on each subset to respectively select cluster centers of each subset. Further, AP clustering was again implemented on all the local cluster centers to select well-suited global exemplars of whole data set. Finally, to efficiently and accurately cluster data points in a large-scale, all the data points are clustered by the similarities between each data point and the global exemplars. The experimental results on real and simulated data sets show that, compared with the traditional AP and adaptive AP algorithm, the HAP algorithm can greatly reduce the clustering time consumption with a relatively better clustering results.
  • Keywords
    computational complexity; pattern clustering; AP; HAP; affinity propagation clustering algorithm; cluster centers; clustering time consumption; hierarchical affinity propagation; large-scale data clustering; large-scale data sets; local cluster centers; time complexity; well-suited global exemplars; Algorithm design and analysis; Availability; Clustering algorithms; Computers; Indexes; Memory management; Partitioning algorithms; Data clustering; affinity propagation; clustering center; hierarchical selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2013 Ninth International Conference on
  • Conference_Location
    Shenyang
  • Type

    conf

  • DOI
    10.1109/ICNC.2013.6818103
  • Filename
    6818103