• DocumentCode
    226643
  • Title

    Parallel glowworm swarm optimization clustering algorithm based on MapReduce

  • Author

    Al-Madi, Nailah ; Aljarah, Ibrahim ; Ludwig, Simone

  • Author_Institution
    Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Clustering large data is one of the recently challenging tasks that is used in many application areas such as social networking, bioinformatics and many others. Traditional clustering algorithms need to be modified to handle the increasing data sizes. In this paper, a scalable design and implementation of glowworm swarm optimization clustering (MRCGSO) using MapReduce is introduced to handle big data. The proposed algorithm uses glowworm swarm optimization to formulate the clustering algorithm. Glowworm swarm optimization is used to take advantage of its ability in solving multimodal problems, which in terms of clustering means finding multiple centroids. MRCGSO uses the MapReduce methodology for the parallelization since it provides fault tolerance, load balancing and data locality. The experimental results reveal that MRCGSO scales very well with increasing data set sizes and achieves a very close to linear speedup while maintaining the clustering quality.
  • Keywords
    data handling; parallel processing; particle swarm optimisation; pattern clustering; MRCGSO; MapReduce; large data clustering; parallel glowworm swarm optimization clustering algorithm; Algorithm design and analysis; Big data; Clustering algorithms; Equations; Mathematical model; Particle swarm optimization; Partitioning algorithms; Big data clustering; Hadoop; Parallel Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Swarm Intelligence (SIS), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/SIS.2014.7011794
  • Filename
    7011794