• DocumentCode
    3124442
  • Title

    Beyond ´Caveman Communities´: Hubs and Spokes for Graph Compression and Mining

  • Author

    Kang, U. ; Faloutsos, Christos

  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    300
  • Lastpage
    309
  • Abstract
    Given a real world graph, how should we lay-out its edges? How can we compress it? These questions are closely related, and the typical approach so far is to find clique-like communities, like the `cavemen graph´, and compress them. We show that the block-diagonal mental image of the `cavemen graph´ is the wrong paradigm, in full agreement with earlier results that real world graphs have no good cuts. Instead, we propose to envision graphs as a collection of hubs connecting spokes, with super-hubs connecting the hubs, and so on, recursively. Based on the idea, we propose the Slash Burn method (burn the hubs, and slash the remaining graph into smaller connected components). Our view point has several advantages: (a) it avoids the `no good cuts´ problem, (b) it gives better compression, and (c) it leads to faster execution times for matrix-vector operations, which are the back-bone of most graph processing tools. Experimental results show that our Slash Burn method consistently outperforms other methods on all datasets, giving good compression and faster running time.
  • Keywords
    data compression; data mining; graph theory; block-diagonal mental image; caveman community; cavemen graph; graph compression; graph mining; slashburn method; Algorithm design and analysis; Communities; Complexity theory; Cost function; Equations; Matrix decomposition; Vectors; Graph Compression; Graph Mining; Hubs and Spokes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.26
  • Filename
    6137234