• DocumentCode
    587341
  • Title

    Parallel particle swarm optimization clustering algorithm based on MapReduce methodology

  • Author

    Aljarah, Ibrahim ; Ludwig, Simone

  • Author_Institution
    Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
  • fYear
    2012
  • fDate
    5-9 Nov. 2012
  • Firstpage
    104
  • Lastpage
    111
  • Abstract
    Large scale data sets are difficult to manage. Difficulties include capture, storage, search, analysis, and visualization of large data. In particular, clustering of large scale data has received considerable attention in the last few years and many application areas such as bioinformatics and social networking are in urgent need of scalable approaches. The new techniques need to make use of parallel computing concepts in order to be able to scale with increasing data set sizes. In this paper, we propose a parallel particle swarm optimization clustering (MR-CPSO) algorithm that is based on MapReduce. The experimental results reveal that MR-CPSO scales very well with increasing data set sizes and achieves a very close to the linear speedup while maintaining the clustering quality. The results also demonstrate that the proposed MR-CPSO algorithm can efficiently process large data sets on commodity hardware.
  • Keywords
    data analysis; data visualisation; parallel algorithms; particle swarm optimisation; pattern clustering; storage allocation; MR-CPSO algorithm; MapReduce methodology; commodity hardware; data analysis; data capture; data search; data storage; data visualization; large-scale data set clustering quality maintenance; parallel computing concepts; parallel particle swarm optimization clustering algorithm; Clustering algorithms; Computational modeling; Data mining; Equations; Mathematical model; Particle swarm optimization; Vectors; Data Clustering; Hadoop; MapReduce; Parallel Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Nature and Biologically Inspired Computing (NaBIC), 2012 Fourth World Congress on
  • Conference_Location
    Mexico City
  • Print_ISBN
    978-1-4673-4767-9
  • Type

    conf

  • DOI
    10.1109/NaBIC.2012.6402247
  • Filename
    6402247