• DocumentCode
    5283
  • Title

    Data Clustering Using Variants of Rapid Centroid Estimation

  • Author

    Yuwono, Mitchell ; Su, Steven W. ; Moulton, Brace D. ; Nguyen, Hung T.

  • Author_Institution
    Fac. of Eng. & Inf. Technol., Univ. of Technol., Sydney, NSW, Australia
  • Volume
    18
  • Issue
    3
  • fYear
    2014
  • fDate
    Jun-14
  • Firstpage
    366
  • Lastpage
    377
  • Abstract
    Prior work suggests that particle swarm clustering (PSC) can be a powerful tool for solving clustering problems. This paper reviews parts of the PSC algorithm, and shows how and why a new class of algorithms is proposed in an attempt to improve the efficiency and repeatability of PSC. This new implementation is referred to as rapid centroid estimation (RCE). RCE simplifies the update rules of PSC, and greatly reduces computational complexity by enhancing the efficiency of the particle trajectories. On benchmark evaluations with an artificial dataset that has 80 dimensions and a volume of 5000, the RCE variants have iteration times of less than 0.1 s, which compares to iteration times of 2 s for PSC and modified PSC (mPSC). On UC Irvine (UCI) machine learning benchmark datasets, the RCE variants are much faster than PSC and mPSC, and produce clusters with higher purity and greatly improved optimization speeds. For example, the RCE variants are more than 100 times faster than PSC and mPSC on the UCI breast cancer dataset. It can be concluded that the RCE variants are leaner and faster than PSC and mPSC, and that the new optimization strategies also improve clustering quality and repeatability.
  • Keywords
    computational complexity; particle swarm optimisation; pattern clustering; PSC algorithm; RCE variants; UC Irvine machine learning benchmark datasets; UCI breast cancer dataset; artificial dataset; computational complexity; data clustering; mPSC; modified PSC; optimization speeds; particle swarm clustering; particle trajectories; rapid centroid estimation; Clustering algorithms; Computational complexity; Optimization; Particle swarm optimization; Standards; Vectors; White noise; Algorithm design and analysis; algorithm design and analysis; clustering algorithms; computational complexity; particle swarm optimization;
  • fLanguage
    English
  • Journal_Title
    Evolutionary Computation, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-778X
  • Type

    jour

  • DOI
    10.1109/TEVC.2013.2281545
  • Filename
    6595572