Title :
Parallel glowworm swarm optimization clustering algorithm based on MapReduce
Author :
Al-Madi, Nailah ; Aljarah, Ibrahim ; Ludwig, Simone
Author_Institution :
Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
Abstract :
Clustering large data is one of the recently challenging tasks that is used in many application areas such as social networking, bioinformatics and many others. Traditional clustering algorithms need to be modified to handle the increasing data sizes. In this paper, a scalable design and implementation of glowworm swarm optimization clustering (MRCGSO) using MapReduce is introduced to handle big data. The proposed algorithm uses glowworm swarm optimization to formulate the clustering algorithm. Glowworm swarm optimization is used to take advantage of its ability in solving multimodal problems, which in terms of clustering means finding multiple centroids. MRCGSO uses the MapReduce methodology for the parallelization since it provides fault tolerance, load balancing and data locality. The experimental results reveal that MRCGSO scales very well with increasing data set sizes and achieves a very close to linear speedup while maintaining the clustering quality.
Keywords :
data handling; parallel processing; particle swarm optimisation; pattern clustering; MRCGSO; MapReduce; large data clustering; parallel glowworm swarm optimization clustering algorithm; Algorithm design and analysis; Big data; Clustering algorithms; Equations; Mathematical model; Particle swarm optimization; Partitioning algorithms; Big data clustering; Hadoop; Parallel Processing;
Conference_Titel :
Swarm Intelligence (SIS), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/SIS.2014.7011794