DocumentCode :
123712
Title :
Parallelizing K-Means Algorithm for 1-D Data Using MPI
Author :
Savvas, Ilias K. ; Sofianidou, Georgia N.
Author_Institution :
Dept. of Comput. Sci. & Eng., T.E.I. of Thessaly, Larissa, Greece
fYear :
2014
fDate :
23-25 June 2014
Firstpage :
179
Lastpage :
184
Abstract :
Nowadays, colossal amount of information is produced by computational systems and electronic instruments such as telescopes, medical devices and so on. To explore these petabytes of data, new fast algorithms must be discovered or old ones may be redesigned. One of the most popular and useful techniques in order to discover and extract information from data pools is clustering, and k-means is an algorithm which clusters data according its characteristics. Its main disadvantage is its computational complexity which makes the technique very difficult to apply on big data sets. Although k-means is a very well studied technique, a fully parallel version of it has not been explored yet. In this work, a parallel version of the k-means is presented for 1-d objects. The experimental results obtained are inline with the theoretical outcome and prove both the correctness and the effectiveness of the technique.
Keywords :
Big Data; application program interfaces; computational complexity; information retrieval; message passing; parallel algorithms; pattern clustering; 1D data; MPI; big data sets; computational complexity; computational systems; data pools; electronic instruments; information discovery; information extraction; medical devices; message passing interface; parallelizing k-means algorithm; telescopes; Algorithm design and analysis; Clustering algorithms; Computational complexity; Data mining; Equations; Peer-to-peer computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
WETICE Conference (WETICE), 2014 IEEE 23rd International
Conference_Location :
Parma
Type :
conf
DOI :
10.1109/WETICE.2014.13
Filename :
6927046
Link To Document :
بازگشت