DocumentCode
3600046
Title
Efficient Parallel Community Detection in Large Edge-Intensive Networks
Author
Guangliang Gao ; Zhan Bu ; Zhiang Wu ; Yuan Li ; Jie Cao
Author_Institution
Coll. of Comput. Sci. & Eng., Nanjing Univ. of Sci. & Technol., Nanjing, China
fYear
2014
Firstpage
253
Lastpage
260
Abstract
Community detection is a classic and very difficult task in social network analysis. A large number of methods have been developed for both efficient and effective community detection. However, much of the existing methods are heavily dependent on the number of links in the network, and thus they often suffer from the computational inefficiency when meeting large edge-intensive networks. In this paper, we present a novel SIMPLifying and Ensembling (SIMPLE) framework for parallel community detection. It employs the random link sampling to simplify the network and obtain basic partitionings on every sampled graphs. Then, the K-means-based Consensus Clustering is used to ensemble a number of basic partitionings to get high-quality community structures. Meanwhile, steps of random sampling and sampled graph partitioning are encapsulated into MapReduce to further improve the efficiency. Experiments on four real-world social networks analyze key parameters and factors inside SIMPLE, and demonstrate the effectiveness of the SIMPLE.
Keywords
complex networks; graph theory; information analysis; network theory (graphs); parallel programming; pattern clustering; sampling methods; social networking (online); MapReduce; SIMPLE framework; k-means-based consensus clustering; large edge-intensive networks; parallel community detection; random sampling; sampled graph partitioning; simplifying and ensembling framework; social network analysis; Approximation algorithms; Clustering algorithms; Communities; Image edge detection; Partitioning algorithms; Twitter; Social Network; Community Detection; Random;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Cloud and Big Data (CBD), 2014 Second International Conference on
Print_ISBN
978-1-4799-8086-4
Type
conf
DOI
10.1109/CBD.2014.52
Filename
7176102
Link To Document