• DocumentCode
    2080366
  • Title

    Discovery-driven graph summarization

  • Author

    Zhang, Ning ; Tian, Yuanyuan ; Patel, Jignesh M.

  • Author_Institution
    Comput. Sci. Dept., Univ. of Wisconsin-Madison, Madison, WI, USA
  • fYear
    2010
  • fDate
    1-6 March 2010
  • Firstpage
    880
  • Lastpage
    891
  • Abstract
    Large graph datasets are ubiquitous in many domains, including social networking and biology. Graph summarization techniques are crucial in such domains as they can assist in uncovering useful insights about the patterns hidden in the underlying data. One important type of graph summarization is to produce small and informative summaries based on user-selected node attributes and relationships, and allowing users to interactively drill-down or roll-up to navigate through summaries with different resolutions. However, two key components are missing from the previous work in this area that limit the use of this method in practice. First, the previous work only deals with categorical node attributes. Consequently, users have to manually bucketize numerical attributes based on domain knowledge, which is not always possible. Moreover, users often have to manually iterate through many resolutions of summaries to identify the most interesting ones. This paper addresses both these key issues to make the interactive graph summarization approach more useful in practice. We first present a method to automatically categorize numerical attributes values by exploiting the domain knowledge hidden inside the node attributes values and graph link structures. Furthermore, we propose an interestingness measure for graph summaries to point users to the potentially most insightful summaries. Using two real datasets, we demonstrate the effectiveness and efficiency of our techniques.
  • Keywords
    data handling; data mining; graph theory; pattern classification; categorical node attribute; graph link structures; graph summarization; Biology computing; Computer networks; Data mining; Databases; Facebook; Humans; Navigation; Pervasive computing; Social network services; Usability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2010 IEEE 26th International Conference on
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    978-1-4244-5445-7
  • Electronic_ISBN
    978-1-4244-5444-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2010.5447830
  • Filename
    5447830