• DocumentCode
    2209529
  • Title

    Patterns on the Connected Components of Terabyte-Scale Graphs

  • Author

    Kang, U. ; McGlohon, Mary ; Akoglu, Leman ; Faloutsos, Christos

  • fYear
    2010
  • fDate
    13-17 Dec. 2010
  • Firstpage
    875
  • Lastpage
    880
  • Abstract
    How do connected components evolve? What are the regularities that govern the dynamic growth process and the static snapshot of the connected components? In this work, we study patterns in connected components of large, real-world graphs. First, we study one of the largest static Web graphs with billions of nodes and edges and analyze the regularities among the connected components using GFD(Graph Fractal Dimension) as our main tool. Second, we study several time evolving graphs and find dynamic patterns and rules that govern the dynamics of connected components. We analyze the growth rates of top connected components and study their relation over time. We also study the probability that a newcomer absorbs to disconnected components as a function of the current portion of the disconnected components and the degree of the newcomer. Finally, we propose a generative model that explains both the dynamic growth process and the static regularities of connected components.
  • Keywords
    Internet; data mining; fractals; graph theory; probability; connected components; dynamic growth process; generative model; graph fractal dimension; real-world graphs; static Web graphs; static regularities; terabyte scale graph; CommunityConnection Model; Evolution of Connected Components; Graph Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2010 IEEE 10th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-9131-5
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2010.121
  • Filename
    5694054