• DocumentCode
    2985564
  • Title

    GUISE: Uniform Sampling of Graphlets for Large Graph Analysis

  • Author

    Bhuiyan, Mansurul A. ; Rahman, Mosaddequr ; Rahman, Mosaddequr ; Al Hasan, Mohammad

  • Author_Institution
    Dept. of Comput. Sci., Indiana Univ., Indianapolis, IN, USA
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    91
  • Lastpage
    100
  • Abstract
    Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose GUISE, which uses a Markov Chain Monte Carlo (MCMC) sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that GUISE obtains the GFD within few minutes, whereas the exhaustive counting based approach takes several days.
  • Keywords
    Markov processes; Monte Carlo methods; graph theory; sampling methods; GFD; GUISE; MCMC; Markov chain Monte Carlo sampling method; computationally expensive task; embedded graphlets; graphlet frequency distribution; large graph analysis; uniform graphlet sampling; Biological information theory; Context; Markov processes; Monte Carlo methods; Probability distribution; Radiation detectors; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.87
  • Filename
    6413912