DocumentCode
587341
Title
Parallel particle swarm optimization clustering algorithm based on MapReduce methodology
Author
Aljarah, Ibrahim ; Ludwig, Simone
Author_Institution
Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
fYear
2012
fDate
5-9 Nov. 2012
Firstpage
104
Lastpage
111
Abstract
Large scale data sets are difficult to manage. Difficulties include capture, storage, search, analysis, and visualization of large data. In particular, clustering of large scale data has received considerable attention in the last few years and many application areas such as bioinformatics and social networking are in urgent need of scalable approaches. The new techniques need to make use of parallel computing concepts in order to be able to scale with increasing data set sizes. In this paper, we propose a parallel particle swarm optimization clustering (MR-CPSO) algorithm that is based on MapReduce. The experimental results reveal that MR-CPSO scales very well with increasing data set sizes and achieves a very close to the linear speedup while maintaining the clustering quality. The results also demonstrate that the proposed MR-CPSO algorithm can efficiently process large data sets on commodity hardware.
Keywords
data analysis; data visualisation; parallel algorithms; particle swarm optimisation; pattern clustering; storage allocation; MR-CPSO algorithm; MapReduce methodology; commodity hardware; data analysis; data capture; data search; data storage; data visualization; large-scale data set clustering quality maintenance; parallel computing concepts; parallel particle swarm optimization clustering algorithm; Clustering algorithms; Computational modeling; Data mining; Equations; Mathematical model; Particle swarm optimization; Vectors; Data Clustering; Hadoop; MapReduce; Parallel Processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Nature and Biologically Inspired Computing (NaBIC), 2012 Fourth World Congress on
Conference_Location
Mexico City
Print_ISBN
978-1-4673-4767-9
Type
conf
DOI
10.1109/NaBIC.2012.6402247
Filename
6402247
Link To Document