DocumentCode
48572
Title
Graft: An Efficient Graphlet Counting Method for Large Graph Analysis
Author
Rahman, Mosaddequr ; Bhuiyan, Mansurul A. ; Al Hasan, Mohammad
Author_Institution
Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. (IUPUI), Indianapolis, IN, USA
Volume
26
Issue
10
fYear
2014
fDate
Oct. 2014
Firstpage
2466
Lastpage
2478
Abstract
Majority of the existing works on network analysis study properties that are related to the global topology of a network. Examples of such properties include diameter, power-law exponent, and spectra of graph Laplacian. Such works enhance our understanding of real-life networks, or enable us to generate synthetic graphs with real-life graph properties. However, many of the existing problems on networks require the study of local topological structures of a network, which did not get the deserved attention in the existing works. In this work, we use graphlet frequency distribution (GFD) as an analysis tool for understanding the variance of local topological structure in a network; we also show that it can help in comparing, and characterizing real-life networks. The main bottleneck to obtain GFD is the excessive computation cost for obtaining the frequency of each of the graphlets in a large network. To overcome this, we propose a simple, yet powerful algorithm, called GRAFT, that obtains the approximate graphlet frequency for all graphlets that have up-to five vertices. Comparing to an exact counting algorithm, our algorithm achieves a speedup factor between 10 and 100 for a negligible counting error, which is, on average, less than 5 percent.
Keywords
graph theory; network theory (graphs); GFD; GRAFT algorithm; diameter; exact counting algorithm; excessive computation cost; global network topology; graph Laplacian spectra; graphlet counting method; graphlet frequency distribution; large graph analysis; local topological structures; network analysis study; power-law exponent; real-life graph properties; synthetic graphs; Accuracy; Algorithm design and analysis; Approximation algorithms; Approximation methods; Biology; Software algorithms; Topology; GFD; Graph mining; graphlet counting;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2013.2297929
Filename
6702435
Link To Document