DocumentCode
226643
Title
Parallel glowworm swarm optimization clustering algorithm based on MapReduce
Author
Al-Madi, Nailah ; Aljarah, Ibrahim ; Ludwig, Simone
Author_Institution
Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
1
Lastpage
8
Abstract
Clustering large data is one of the recently challenging tasks that is used in many application areas such as social networking, bioinformatics and many others. Traditional clustering algorithms need to be modified to handle the increasing data sizes. In this paper, a scalable design and implementation of glowworm swarm optimization clustering (MRCGSO) using MapReduce is introduced to handle big data. The proposed algorithm uses glowworm swarm optimization to formulate the clustering algorithm. Glowworm swarm optimization is used to take advantage of its ability in solving multimodal problems, which in terms of clustering means finding multiple centroids. MRCGSO uses the MapReduce methodology for the parallelization since it provides fault tolerance, load balancing and data locality. The experimental results reveal that MRCGSO scales very well with increasing data set sizes and achieves a very close to linear speedup while maintaining the clustering quality.
Keywords
data handling; parallel processing; particle swarm optimisation; pattern clustering; MRCGSO; MapReduce; large data clustering; parallel glowworm swarm optimization clustering algorithm; Algorithm design and analysis; Big data; Clustering algorithms; Equations; Mathematical model; Particle swarm optimization; Partitioning algorithms; Big data clustering; Hadoop; Parallel Processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Swarm Intelligence (SIS), 2014 IEEE Symposium on
Conference_Location
Orlando, FL
Type
conf
DOI
10.1109/SIS.2014.7011794
Filename
7011794
Link To Document