DocumentCode :
3600884
Title :
GPUSCAN: GPU-Based Parallel Structural Clustering Algorithm for Networks
Author :
Stovall, Thomas Ryan ; Kockara, Sinan ; Avci, Recep
Author_Institution :
Dept. of Comput. Sci., Univ. of Central Arkansas, Conway, AR, USA
Volume :
26
Issue :
12
fYear :
2015
Firstpage :
3381
Lastpage :
3393
Abstract :
This paper presents a massively parallel implementation of a prominent network clustering algorithm, the structural clustering algorithm for networks (SCAN), on a graphical processing unit (GPU). SCAN is a fast and efficient clustering technique for finding hidden communities and isolating hubs/outliers within a network. However, for very large networks, it still takes considerable amount of time. With the introduction of massively parallel Compute Unified Device Architecture (CUDA) by Nvidia, applications properly employing GPUs are demonstrating high speed up. In current study, GPUSCAN, a CUDA based parallel implementation of SCAN, is presented. SCAN´s computation steps have been carefully redesigned to run very efficiently on the GPU by transforming SCAN into a series of highly regular and independent concurrent operations. All intermediate data structures are created in the GPU to efficiently benefit from GPU´s memory hierarchy. How these structures reformed and represented in the GPU memory hierarchy are illustrated. Now, through GPUSCAN, a large network or a batch of disjoint networks can be offloaded to the GPU for very fast and equivalent structural clustering. The performance of the GPU accelerated structural clustering has been shown to be much faster than the sequential CPU implementation. Both GPUSCAN and SCAN are tested on different size artificial and real-world networks. Results indicate that network becomes larger GPUSCAN significantly over performs SCAN. In tested datasets, speed-up of over 500-fold is achieved. For instance, calculating structural similarity and clustering of 5.5 million edges of the California road network in GPUSCAN is 513-fold faster than the serial version of SCAN.
Keywords :
concurrency (computers); data structures; graphics processing units; parallel architectures; pattern clustering; storage management; CUDA based parallel implementation; California road network; GPU accelerated structural clustering; GPU memory hierarchy; GPU-based parallel structural clustering algorithm for networks; GPUSCAN; Nvidia; clustering technique; concurrent operations; data structures; disjoint networks; hidden community finding; massively parallel compute unified device architecture; massively parallel implementation; network clustering algorithm; network hub isolation; network outliers isolation; structural similarity; Algorithm design and analysis; Clustering algorithms; Graphics processing units; Instruction sets; GPGPU; Structural clustering for networks; high-performance computing; social networks; structural clustering for networks; very large networks;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2014.2374607
Filename :
6967853
Link To Document :
بازگشت