DocumentCode
2209529
Title
Patterns on the Connected Components of Terabyte-Scale Graphs
Author
Kang, U. ; McGlohon, Mary ; Akoglu, Leman ; Faloutsos, Christos
fYear
2010
fDate
13-17 Dec. 2010
Firstpage
875
Lastpage
880
Abstract
How do connected components evolve? What are the regularities that govern the dynamic growth process and the static snapshot of the connected components? In this work, we study patterns in connected components of large, real-world graphs. First, we study one of the largest static Web graphs with billions of nodes and edges and analyze the regularities among the connected components using GFD(Graph Fractal Dimension) as our main tool. Second, we study several time evolving graphs and find dynamic patterns and rules that govern the dynamics of connected components. We analyze the growth rates of top connected components and study their relation over time. We also study the probability that a newcomer absorbs to disconnected components as a function of the current portion of the disconnected components and the degree of the newcomer. Finally, we propose a generative model that explains both the dynamic growth process and the static regularities of connected components.
Keywords
Internet; data mining; fractals; graph theory; probability; connected components; dynamic growth process; generative model; graph fractal dimension; real-world graphs; static Web graphs; static regularities; terabyte scale graph; CommunityConnection Model; Evolution of Connected Components; Graph Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-4786
Print_ISBN
978-1-4244-9131-5
Electronic_ISBN
1550-4786
Type
conf
DOI
10.1109/ICDM.2010.121
Filename
5694054
Link To Document