DocumentCode :
2985564
Title :
GUISE: Uniform Sampling of Graphlets for Large Graph Analysis
Author :
Bhuiyan, Mansurul A. ; Rahman, Mosaddequr ; Rahman, Mosaddequr ; Al Hasan, Mohammad
Author_Institution :
Dept. of Comput. Sci., Indiana Univ., Indianapolis, IN, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
91
Lastpage :
100
Abstract :
Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose GUISE, which uses a Markov Chain Monte Carlo (MCMC) sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that GUISE obtains the GFD within few minutes, whereas the exhaustive counting based approach takes several days.
Keywords :
Markov processes; Monte Carlo methods; graph theory; sampling methods; GFD; GUISE; MCMC; Markov chain Monte Carlo sampling method; computationally expensive task; embedded graphlets; graphlet frequency distribution; large graph analysis; uniform graphlet sampling; Biological information theory; Context; Markov processes; Monte Carlo methods; Probability distribution; Radiation detectors; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.87
Filename :
6413912
Link To Document :
بازگشت