DocumentCode
2985564
Title
GUISE: Uniform Sampling of Graphlets for Large Graph Analysis
Author
Bhuiyan, Mansurul A. ; Rahman, Mosaddequr ; Rahman, Mosaddequr ; Al Hasan, Mohammad
Author_Institution
Dept. of Comput. Sci., Indiana Univ., Indianapolis, IN, USA
fYear
2012
fDate
10-13 Dec. 2012
Firstpage
91
Lastpage
100
Abstract
Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose GUISE, which uses a Markov Chain Monte Carlo (MCMC) sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that GUISE obtains the GFD within few minutes, whereas the exhaustive counting based approach takes several days.
Keywords
Markov processes; Monte Carlo methods; graph theory; sampling methods; GFD; GUISE; MCMC; Markov chain Monte Carlo sampling method; computationally expensive task; embedded graphlets; graphlet frequency distribution; large graph analysis; uniform graphlet sampling; Biological information theory; Context; Markov processes; Monte Carlo methods; Probability distribution; Radiation detectors; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location
Brussels
ISSN
1550-4786
Print_ISBN
978-1-4673-4649-8
Type
conf
DOI
10.1109/ICDM.2012.87
Filename
6413912
Link To Document