• DocumentCode
    1241387
  • Title

    Robustness of Topological Supertree Methods for Reconciling Dense Incompatible Data

  • Author

    Willson, Stephen J.

  • Author_Institution
    Dept. of Math., Iowa State Univ., Ames, IA
  • Volume
    6
  • Issue
    1
  • fYear
    2009
  • Firstpage
    62
  • Lastpage
    75
  • Abstract
    Given a collection of rooted phylogenetic trees with overlapping sets of leaves, a compatible supertree S is a single tree whose set of leaves is the union of the input sets of leaves and such that S agrees with each input tree when restricted to the leaves of the input tree. Typically with trees from real data, no compatible supertree exists, and various methods may be utilized to reconcile the incompatibilities in the input trees. This paper focuses on a measure of robustness of a supertree method called its "radius" R. The larger the value of R is, the further the data set can be from a natural correct tree T and yet the method will still output T. It is shown that the maximal possible radius for a method is R = 1/2. Many familiar methods, both for supertrees and consensus trees, are shown to have R = 0, indicating that they need not output a tree T that would seem to be the natural correct answer. A polynomial-time method Normalized Triplet Supertree (NTS) with the maximal possible R = 1/2 is defined. A geometric interpretion is given, and NTS is shown to solve an optimization problem. Additional properties of NTS are described.
  • Keywords
    bioinformatics; genetics; topology; trees (mathematics); dense incompatible data; graph algorithm; input tree; natural correct tree; polynomial-time method normalized triplet supertree; rooted phylogenetic trees; rooted triple; topological supertree methods; Graph algorithms; Trees; phylogenetic tree; supertree; Algorithms; Computational Biology; Data Interpretation, Statistical; Databases, Genetic; Humans; Mathematical Concepts; Phylogeny; Sequence Analysis, DNA;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.51
  • Filename
    4538208