Title :
Analysis on the complexity of the problem to construct the most parsimonious phylogenetic trees
Author_Institution :
Media Lab., Massachusetts Inst. of Technol., Cambridge, MA, USA
Abstract :
With the availability of ever-increasing gene sequence data across a large number of species, reconstruction of phylogenetic trees to reveal the evolution relationship among those species becomes more and more important. In this paper, we present a novel proof of the NP-completeness of the large parsimony problem by reduction from a newly-proved NP-complete problem to gain additional insight of this fundamental problem in computational biology. We then conduct experiments based upon our recent work of a random tree optimization algorithm based on cross-entropy method for the construction of the most parsimonious phylogenetic trees across 12 Drosophila genomes.
Keywords :
biology; computational complexity; entropy; genomics; optimisation; probability; trees (mathematics); Drosophila genomes; NP-complete problem; NP-completeness proof; complexity analysis; computational biology; cross-entropy method; large parsimony problem; phylogenetic trees reconstruction; random tree optimization algorithm; Binary trees; Bioinformatics; Computational biology; Cost function; Genomics; Iterative algorithms; NP-complete problem; Optimization methods; Phylogeny; Sequences; Complexity; Cross Entropy; NP Completeness; Random Tree Optimization;
Conference_Titel :
Information Sciences and Systems (CISS), 2010 44th Annual Conference on
Conference_Location :
Princeton, NJ
Print_ISBN :
978-1-4244-7416-5
Electronic_ISBN :
978-1-4244-7417-2
DOI :
10.1109/CISS.2010.5464727