DocumentCode
592186
Title
On the estimation accuracy of degree distributions from graph sampling
Author
Ribeiro, Bernardete ; Towsley, Don
Author_Institution
Comput. Sci. Dept., Univ. of Massachusetts, Amherst, MA, USA
fYear
2012
fDate
10-13 Dec. 2012
Firstpage
5240
Lastpage
5247
Abstract
Estimating characteristics of large graphs via sampling is vital in the study of complex networks. In this work, we study the Mean Squared Error (MSE) associated with different sampling methods for the degree distribution. These sampling methods include independent random vertex (RV) and random edge (RE) sampling, and crawling methods such as random walks (RWs) and the widely used Metropolis-Hastings algorithm for uniformly sampling vertices (MHRWu). We see that the RW MSE is upper bounded by a quantity that is proportional to the RE MSE and inversely proportional to the spectral gap of the RW transition probability matrix. We also determine conditions under which RW is preferable to RV. Finally, we present an approximation of the MHRWu MSE. We evaluate the accuracy of our approximations and bounds through simulations on large real world graphs.
Keywords
complex networks; graph theory; matrix algebra; mean square error methods; probability; sampling methods; MHRWu; MSE; Metropolis-Hastings algorithm; RE; RV; RW transition probability matrix; complex networks; crawling methods; degree distributions; estimation accuracy; graph sampling; independent random vertex sampling; large graph characteristics estimation; large real world graphs; mean squared error; random edge sampling; random walks; Approximation methods; Eigenvalues and eigenfunctions; Estimation error; Random variables; Reactive power; Standards; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location
Maui, HI
ISSN
0743-1546
Print_ISBN
978-1-4673-2065-8
Electronic_ISBN
0743-1546
Type
conf
DOI
10.1109/CDC.2012.6425857
Filename
6425857
Link To Document