Title :
On the estimation accuracy of degree distributions from graph sampling
Author :
Ribeiro, Bernardete ; Towsley, Don
Author_Institution :
Comput. Sci. Dept., Univ. of Massachusetts, Amherst, MA, USA
Abstract :
Estimating characteristics of large graphs via sampling is vital in the study of complex networks. In this work, we study the Mean Squared Error (MSE) associated with different sampling methods for the degree distribution. These sampling methods include independent random vertex (RV) and random edge (RE) sampling, and crawling methods such as random walks (RWs) and the widely used Metropolis-Hastings algorithm for uniformly sampling vertices (MHRWu). We see that the RW MSE is upper bounded by a quantity that is proportional to the RE MSE and inversely proportional to the spectral gap of the RW transition probability matrix. We also determine conditions under which RW is preferable to RV. Finally, we present an approximation of the MHRWu MSE. We evaluate the accuracy of our approximations and bounds through simulations on large real world graphs.
Keywords :
complex networks; graph theory; matrix algebra; mean square error methods; probability; sampling methods; MHRWu; MSE; Metropolis-Hastings algorithm; RE; RV; RW transition probability matrix; complex networks; crawling methods; degree distributions; estimation accuracy; graph sampling; independent random vertex sampling; large graph characteristics estimation; large real world graphs; mean squared error; random edge sampling; random walks; Approximation methods; Eigenvalues and eigenfunctions; Estimation error; Random variables; Reactive power; Standards; Upper bound;
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2012.6425857