Title :
Accelerating outlier detection with intra- and inter-node parallelism
Author :
Angiulli, Fabrizio ; Basta, Stefano ; Lodi, Stefano ; Sartori, Claudio
Author_Institution :
DIMES, UNICAL, Rende, Italy
Abstract :
Outlier detection is a data mining task consisting in the discovery of observations which deviate substantially from the rest of the data, and has many important practical applications. Outlier detection in very large data sets is however computationally very demanding and the size limit of the data that can be elaborated is considerably pushed forward by mixing three ingredients: efficient algorithms, intra-cpu parallelism of high-performance architectures, network level parallelism. In this paper we propose an outlier detection algorithm able to exploit the internal parallelism of a GPU and the external parallelism of a cluster of GPU. The algorithm is the evolution of our previous solutions which considered either GPU or network level parallelism. We discuss a set of large scale experiments executed in a supercomputing facility and show the speedup obtained with varying number of nodes.
Keywords :
data mining; parallel programming; data mining; high-performance architecture; inter-node parallelism; intra-CPU parallelism; intra-node parallelism; network level parallelism; outlier detection; supercomputing facility; Algorithm design and analysis; Arrays; Data mining; Graphics processing units; Instruction sets; Kernel; Parallel processing; Distance-based outliers; GPU; high performance computing; parallel algorithms;
Conference_Titel :
High Performance Computing & Simulation (HPCS), 2014 International Conference on
Conference_Location :
Bologna
Print_ISBN :
978-1-4799-5312-7
DOI :
10.1109/HPCSim.2014.6903723