DocumentCode :
2048314
Title :
Phylogenetic models of rate heterogeneity: a high performance computing perspective
Author :
Stamatakis, Alexandros
Author_Institution :
Inst. of Comput. Sci., Found. for Res. & Technol.-Hellas, Heraklion
fYear :
2006
fDate :
25-29 April 2006
Abstract :
Inference of phylogenetic trees using the maximum likelihood (ML) method is NP-hard. Furthermore, the computation of the likelihood function for huge trees of more than 1,000 organisms is computationally intensive due to a large amount of floating point operations and high memory consumption. Within this context, the present paper compares two competing mathematical models that account for evolutionary rate heterogeneity: the Gamma and CAT models. The intention of this paper is to show that - from a purely empirical point of view - CAT can be used instead of Gamma. The main advantage of CAT over Gamma consists in significantly lower memory consumption and faster inference times. An experimental study using RAxML has been performed on 19 real-world datasets comprising 73 up to 1,663 DNA sequences. Results show that CAT is on average 5.5 times faster than Gamma and - surprisingly enough - also yields trees with slightly superior Gamma likelihood values. The usage of the CAT model decreases the amount of average L2 and L3 cache misses by factor 8.55
Keywords :
biocomputing; biology computing; computational complexity; genetics; maximum likelihood estimation; trees (mathematics); CAT models; DNA sequences; NP-hard problem; RAxML; floating point operations; high memory consumption; high performance computing perspective; maximum likelihood; phylogenetic trees; rate heterogeneity; Biology computing; Context modeling; DNA; High performance computing; Inference algorithms; Mathematical model; Organisms; Phylogeny; Sequences; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Conference_Location :
Rhodes Island
Print_ISBN :
1-4244-0054-6
Type :
conf
DOI :
10.1109/IPDPS.2006.1639535
Filename :
1639535
Link To Document :
بازگشت