• DocumentCode
    821986
  • Title

    Maximum Parsimony for Tree Mixtures

  • Author

    Grünewald, Stefan ; Moulton, Vincent

  • Author_Institution
    Dept. of Combinatorics & Geometry, Chinese Acad. of Sci., Shanghai
  • Volume
    6
  • Issue
    1
  • fYear
    2009
  • Firstpage
    97
  • Lastpage
    102
  • Abstract
    With the number of sequenced genomes growing ever larger, it is now common practice to concatenate sequence alignments from several genomic loci as a first step to phylogenetic tree inference. However, as different loci may support different trees due to processes such as gene duplication and lineage sorting, it is important to better understand how commonly used phylogenetic inference methods behave on such "phylogenetic mixtures". Here we shall focus on how parsimony, one of the most popular methods for reconstructing phylogenetic trees, behaves for mixtures of two trees. In particular, we show that (i) the parsimony problem is NP-complete for mixtures of two trees, (ii) there are mixtures of two trees that have exponentially many (in the number of leaves) most parsimonious trees, and (iii) give an explicit description of the most parsimonious tree(s) and scores corresponding to the mixture of a pair of trees related by a single TBR operation.
  • Keywords
    bioinformatics; computational complexity; genetics; genomics; trees (mathematics); NP-complete problems; TBR operation; concatenate sequence alignments; gene duplication; genomic loci; lineage sorting; linear Steiner tree; maximum parsimony; median network; phylogenetic tree inference; phylogenetic tree mixture; sequenced genomes; Binary sequences; Bioinformatics; Genomics; Labeling; Phylogeny; Solid modeling; Sorting; Applications; Biology and genetics; Computational Biology; Databases, Genetic; Genomics; Models, Genetic; Phylogeny; Sequence Alignment; Sequence Analysis, DNA;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.75
  • Filename
    4585356