DocumentCode
2204847
Title
A time-efficient connected densest subgraph discovery algorithm for big data
Author
Wu, Bo ; Shen, Haiying
Author_Institution
Department of Electrical and Computer Engineering, Clemson University, South Carolina 29634, USA
fYear
2015
fDate
6-7 Aug. 2015
Firstpage
305
Lastpage
314
Abstract
In this paper, we propose a time-efficient and exact algorithm for the problem of discovering the densest subgraph in big data. Current algorithms for solving this problem have three problems: i) they cannot handle the dilemma between the efficiency of handing big data and the precision of the discovered densest subgraph; ii) they cannot take advantage of both the parallel computing on MapReduce and in-memory computing on one computer; iii) their applicability to different kinds of graphs has not been discussed. Our proposed algorithm combines the MapReduce parallel computing with in-memory computing on one computer together to improve the efficiency and precision of discovering the densest subgraphs. The algorithm consists of two computational phases: i) the graph reduction in the MapReduce framework; ii) the densest subgraph discovery in memory. Further, we theoretically analyze the correctness of this algorithm and its applicability in different natural graphs. We conduct extensive experimental evaluations in a MapReduce framework on both massive real-world graphs and simulated graphs to test our algorithm in comparison with other algorithms. Experimental results show that our algorithm is more time-efficient and precise than other algorithms.
Keywords
Algorithm design and analysis; Approximation algorithms; Big data; Complex networks; Computers; Parallel processing; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on
Conference_Location
Boston, MA, USA
Type
conf
DOI
10.1109/NAS.2015.7255197
Filename
7255197
Link To Document