Title :
Graph sampling: Estimation of degree distributions
Author :
Deri, Joya A. ; Moura, Jose M. F.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Online social networks and the World Wide Web lead to large underlying graphs that might not be completely known because of their size. To compute reliable statistics, we have to resort to sampling the network. In this paper, we investigate four network sampling methods to estimate the network degree distribution and the so-called biased degree distribution of a 3.7 million wireless subscriber network. We measure the quality of our estimates of the degree distributions by using the Kolmogorov-Smirnov statistic. Among all four sampling methods, node sampling yields Pareto optimal sample sizes in terms of the Kolomogorov-Smirnov statistic for the degree distribution, while node-by-edge sampling yields optimal sample sizes for the biased distribution. We also find that random walk sampling performs better than the Metropolis-Hastings random walk.
Keywords :
Internet; Pareto optimisation; signal sampling; social networking (online); subscriber loops; Kolmogorov-Smirnov statistic; Pareto optimal sample; World Wide Web; biased degree distribution; degree distribution estimation; graph sampling; network sampling; online social networks; random walk sampling; wireless subscriber network; Educational institutions; Knee; Maximum likelihood estimation; Measurement; Pareto optimization; Sampling methods; Graph sampling; Markov Chain Monte Carlo (MCMC) sampling; Pareto optimality; large-scale networks;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6638918