DocumentCode
2210111
Title
Bonsai: Growing Interesting Small Trees
Author
Seufert, Stephan ; Bedathur, Srikanta ; Mestre, Julian ; Weikum, Gerhard
Author_Institution
Max-Planck-Inst. for Inf., Saarbrücken, Germany
fYear
2010
fDate
13-17 Dec. 2010
Firstpage
1013
Lastpage
1018
Abstract
Graphs are increasingly used to model a variety of loosely structured data such as biological or social networks and entity-relationships. Given this profusion of large-scale graph data, efficiently discovering interesting substructures buried within is essential. These substructures are typically used in determining subsequent actions, such as conducting visual analytics by humans or designing expensive biomedical experiments. In such settings, it is often desirable to constrain the size of the discovered results in order to directly control the associated costs. In this paper, we address the problem of finding cardinality-constrained connected sub trees in large node-weighted graphs that maximize the sum of weights of selected nodes. We provide an efficient constant-factor approximation algorithm for this strongly NP-hard problem. Our techniques can be applied in a wide variety of application settings, for example in differential analysis of graphs, a problem that frequently arises in bioinformatics but also has applications on the web.
Keywords
approximation theory; bioinformatics; optimisation; tree data structures; NP-hard problem; approximation algorithm; bioinformatics; cardinality-constrained connected subtrees; data structure; differential analysis; grpahs; Cardinalty Constrained Subgraphs; Graph Mining; Subgraph Discovery; Visual Analytics;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-4786
Print_ISBN
978-1-4244-9131-5
Electronic_ISBN
1550-4786
Type
conf
DOI
10.1109/ICDM.2010.86
Filename
5694077
Link To Document