DocumentCode
1664283
Title
Phylogenetic tree inference on PC architectures with AxML/PAxML
Author
Stamatakis, Alexandros P. ; Ludwig, Thomas
Author_Institution
Dept. of Comput. Sci., Tech. Univ. Munich, Germany
fYear
2003
Abstract
Inference of phylogenetic trees comprising hundreds or even thousands of organisms based on the maximum likelihood method is computationally extremely expensive. In previous work, we have introduced subtree equality vectors (SEV) to significantly reduce the number of required floating point operations during topology evaluation and implemented this method in (P)AxML, which is a derivative of (parallel) fastDNAml. Experimental results show that (P)AxML scales particularly well on inexpensive PC-processor architectures obtaining global run time accelerations between 51% and 65% over (parallel) fastDNAml for large data sets, yet rendering exactly the same output. In this paper, we present an additional SEV-based algorithmic optimization which scales well on PC processors and leads to a further improvement of global execution times of 14% to 19% compared to the initial version of AxML. Furthermore, we present novel distance-based heuristics for reducing the number of analyzed tree topologies, which further accelerate the program by 4% up to 8%. Finally, we discuss a novel experimental tree-building algorithm and potential heuristic solutions for inferring large high quality trees, which for some initial tests rendered better trees and accelerated program execution at the same time by a factor greater than 6.
Keywords
DNA; biology computing; genetics; optimisation; parallel algorithms; topology; tree searching; very large databases; AxML/PAxML; PC architectures; SEV-based algorithmic optimization; distance-based heuristics; heuristic solutions; large data sets; large high quality trees; parallel fastDNAml; phylogenetic tree inference; run time accelerations; subtree equality vectors; tree topologies; tree-building algorithm; Acceleration; Algorithm design and analysis; Computer architecture; Computer science; Heuristic algorithms; Organisms; Phylogeny; Supercomputers; Testing; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
ISSN
1530-2075
Print_ISBN
0-7695-1926-1
Type
conf
DOI
10.1109/IPDPS.2003.1213296
Filename
1213296
Link To Document