DocumentCode :
1042732
Title :
Quartets MaxCut: A Divide and Conquer Quartets Algorithm
Author :
Snir, Sagi ; Rao, Satish
Author_Institution :
Inst. of Evolution, Univ. of Haifa, Haifa, Israel
Volume :
7
Issue :
4
fYear :
2010
Firstpage :
704
Lastpage :
718
Abstract :
Accurate phylogenetic reconstruction methods are currently limited to a maximum of few dozens of taxa. Supertree methods construct a large tree over a large set of taxa, from a set of small trees over overlapping subsets of the complete taxa set. Hence, in order to construct the tree of life over a million and a half different species, the use of a supertree method over the product of accurate methods, is inevitable. Perhaps the simplest version of this task that is still widely applicable, yet quite challenging, is quartet-based reconstruction. This problem lies at the root of many tree reconstruction methods and theoretical as well as experimental results have been reported. Nevertheless, dealing with false, conflicting quartet trees remains problematic. In this paper, we describe an algorithm for constructing a tree from a set of input quartet trees even with a significant fraction of errors. We show empirically that conflicts in the inputs are handled satisfactorily and that it significantly outperforms and outraces the Matrix Representation with Parsimony (MRP) methods that have previously been most successful in dealing with supertrees. Our algorithm is based on a divide and conquer algorithm where our divide step uses a semidefinite programming (SDP) formulation of MaxCut. We remark that this builds on previous work of ours [29] for piecing together trees from rooted triplet trees. The recursion for unrooted quartets, however, is more complicated in that even with completely consistent set of quartet trees the problem is NP-hard, as opposed to the problem for triples where there is a linear time algorithm. This complexity leads to several issues and some solutions of possible independent interest.
Keywords :
biology computing; divide and conquer methods; evolution (biological); genetics; mathematical programming; set theory; trees (mathematics); NP-hard problem; divide and conquer quartet algorithm; linear time algorithm; matrix representation with Parsimony method; phylogenetic reconstruction method; quartet maxcut; quartet tree; semidefinite programming formulation; subset; supertree method; taxa set; tree reconstruction method; unrooted quartet; Bioinformatics; Computational biology; Computer science; Evolution (biology); History; Materials requirements planning; Phylogeny; Reconstruction algorithms; Sequences; Topology; MaxCut; Phylogenetic reconstruction; quartets; supertree.; Algorithms; Base Sequence; Phylogeny;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2008.133
Filename :
4721363
Link To Document :
بازگشت