• DocumentCode
    2210111
  • Title

    Bonsai: Growing Interesting Small Trees

  • Author

    Seufert, Stephan ; Bedathur, Srikanta ; Mestre, Julian ; Weikum, Gerhard

  • Author_Institution
    Max-Planck-Inst. for Inf., Saarbrücken, Germany
  • fYear
    2010
  • fDate
    13-17 Dec. 2010
  • Firstpage
    1013
  • Lastpage
    1018
  • Abstract
    Graphs are increasingly used to model a variety of loosely structured data such as biological or social networks and entity-relationships. Given this profusion of large-scale graph data, efficiently discovering interesting substructures buried within is essential. These substructures are typically used in determining subsequent actions, such as conducting visual analytics by humans or designing expensive biomedical experiments. In such settings, it is often desirable to constrain the size of the discovered results in order to directly control the associated costs. In this paper, we address the problem of finding cardinality-constrained connected sub trees in large node-weighted graphs that maximize the sum of weights of selected nodes. We provide an efficient constant-factor approximation algorithm for this strongly NP-hard problem. Our techniques can be applied in a wide variety of application settings, for example in differential analysis of graphs, a problem that frequently arises in bioinformatics but also has applications on the web.
  • Keywords
    approximation theory; bioinformatics; optimisation; tree data structures; NP-hard problem; approximation algorithm; bioinformatics; cardinality-constrained connected subtrees; data structure; differential analysis; grpahs; Cardinalty Constrained Subgraphs; Graph Mining; Subgraph Discovery; Visual Analytics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2010 IEEE 10th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-9131-5
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2010.86
  • Filename
    5694077