DocumentCode :
48572
Title :
Graft: An Efficient Graphlet Counting Method for Large Graph Analysis
Author :
Rahman, Mosaddequr ; Bhuiyan, Mansurul A. ; Al Hasan, Mohammad
Author_Institution :
Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. (IUPUI), Indianapolis, IN, USA
Volume :
26
Issue :
10
fYear :
2014
fDate :
Oct. 2014
Firstpage :
2466
Lastpage :
2478
Abstract :
Majority of the existing works on network analysis study properties that are related to the global topology of a network. Examples of such properties include diameter, power-law exponent, and spectra of graph Laplacian. Such works enhance our understanding of real-life networks, or enable us to generate synthetic graphs with real-life graph properties. However, many of the existing problems on networks require the study of local topological structures of a network, which did not get the deserved attention in the existing works. In this work, we use graphlet frequency distribution (GFD) as an analysis tool for understanding the variance of local topological structure in a network; we also show that it can help in comparing, and characterizing real-life networks. The main bottleneck to obtain GFD is the excessive computation cost for obtaining the frequency of each of the graphlets in a large network. To overcome this, we propose a simple, yet powerful algorithm, called GRAFT, that obtains the approximate graphlet frequency for all graphlets that have up-to five vertices. Comparing to an exact counting algorithm, our algorithm achieves a speedup factor between 10 and 100 for a negligible counting error, which is, on average, less than 5 percent.
Keywords :
graph theory; network theory (graphs); GFD; GRAFT algorithm; diameter; exact counting algorithm; excessive computation cost; global network topology; graph Laplacian spectra; graphlet counting method; graphlet frequency distribution; large graph analysis; local topological structures; network analysis study; power-law exponent; real-life graph properties; synthetic graphs; Accuracy; Algorithm design and analysis; Approximation algorithms; Approximation methods; Biology; Software algorithms; Topology; GFD; Graph mining; graphlet counting;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2013.2297929
Filename :
6702435
Link To Document :
بازگشت