• DocumentCode
    48572
  • Title

    Graft: An Efficient Graphlet Counting Method for Large Graph Analysis

  • Author

    Rahman, Mosaddequr ; Bhuiyan, Mansurul A. ; Al Hasan, Mohammad

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. (IUPUI), Indianapolis, IN, USA
  • Volume
    26
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    2466
  • Lastpage
    2478
  • Abstract
    Majority of the existing works on network analysis study properties that are related to the global topology of a network. Examples of such properties include diameter, power-law exponent, and spectra of graph Laplacian. Such works enhance our understanding of real-life networks, or enable us to generate synthetic graphs with real-life graph properties. However, many of the existing problems on networks require the study of local topological structures of a network, which did not get the deserved attention in the existing works. In this work, we use graphlet frequency distribution (GFD) as an analysis tool for understanding the variance of local topological structure in a network; we also show that it can help in comparing, and characterizing real-life networks. The main bottleneck to obtain GFD is the excessive computation cost for obtaining the frequency of each of the graphlets in a large network. To overcome this, we propose a simple, yet powerful algorithm, called GRAFT, that obtains the approximate graphlet frequency for all graphlets that have up-to five vertices. Comparing to an exact counting algorithm, our algorithm achieves a speedup factor between 10 and 100 for a negligible counting error, which is, on average, less than 5 percent.
  • Keywords
    graph theory; network theory (graphs); GFD; GRAFT algorithm; diameter; exact counting algorithm; excessive computation cost; global network topology; graph Laplacian spectra; graphlet counting method; graphlet frequency distribution; large graph analysis; local topological structures; network analysis study; power-law exponent; real-life graph properties; synthetic graphs; Accuracy; Algorithm design and analysis; Approximation algorithms; Approximation methods; Biology; Software algorithms; Topology; GFD; Graph mining; graphlet counting;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2013.2297929
  • Filename
    6702435