DocumentCode
3437499
Title
Scalable Flow-Based Community Detection for Large-Scale Network Analysis
Author
Seung-Hee Bae ; Halperin, Dan ; West, Jevin ; Rosvall, Martin ; Howe, Brandon
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Washington, Seattle, WA, USA
fYear
2013
fDate
7-10 Dec. 2013
Firstpage
303
Lastpage
310
Abstract
Community-detection is a powerful approach to uncover important structures in large networks. Since networks often describe flow of some entity, flow-based community-detection methods are particularly interesting. One such algorithm is called Info map, which optimizes the objective function known as the map equation. While Info map is known to be an effective algorithm, its serial implementation cannot take advantage of multicore processing in modern computers. In this paper, we propose a novel parallel generalization of Info map called Relax Map. This algorithm relaxes concurrency assumptions to avoid lock overhead, achieving 70% parallel efficiency in shared-memory multicore experiments while exhibiting similar convergence properties and finding similar community structures as the serial algorithm. We evaluate our approach on a variety of real graph datasets as well as synthetic graphs produced by a popular graph generator used for benchmarking community detection algorithms. We describe the algorithm, the experiments, and some emerging research directions in high-performance community detection on massive graphs.
Keywords
concurrency control; mathematics computing; network theory (graphs); parallel algorithms; shared memory systems; Info map; Relax Map; community detection algorithms; concurrency assumptions; convergence properties; graph datasets; graph generator; high-performance community detection; large-scale network analysis; lock overhead; map equation; multicore processing; objective function; parallel efficiency; parallel generalization; scalable flow-based community detection; serial algorithm; shared-memory multicore experiments; synthetic graphs; Benchmark testing; Clustering algorithms; Communities; Equations; Heuristic algorithms; Mathematical model; Parallel algorithms; Community Detection; Infomap; The map equation;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location
Dallas, TX
Print_ISBN
978-1-4799-3143-9
Type
conf
DOI
10.1109/ICDMW.2013.138
Filename
6753935
Link To Document