Title :
Load Balance in the Phylogenetic Likelihood Kernel
Author :
Stamatakis, Alexandros ; Ott, Michael
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. Munchen, Garching, Germany
Abstract :
Recent advances in DNA sequencing techniques have led to an unprecedented accumulation and availability of molecular sequence data that needs to be analyzed. This data explosion in combination with the multi-core revolution also affects the computational kernels for phylogenetic inference (reconstruction of evolutionary trees from molecular sequence data) under the widely-used Maximum Likelihood (ML) model. At present, analyses of so called multi-gene or phylogenomic alignments, i.e., input data sets that comprise concatenated sequence data of several genes, are becoming increasingly popular. Usually such multi-gene analyses are partitioned, i.e., a separate set of likelihood model parameters is estimated for each gene/partition. While the phylogenetic likelihood function exhibits intrinsic fine-grained parallelism, the parallel computation of the likelihood function in such partitioned multigene analyses can lead to significant load-balance problems. Here, we describe these problems for the first time, discuss the implications on the design of "classic" ML-based as well as Bayesian search algorithms, and provide an initial solution that yields up to eight-fold improvements in speedup values on AMD Barcelona and Sun x4600 16-core systems for realistic application scenarios.
Keywords :
Bayes methods; DNA; biology computing; data analysis; evolutionary computation; genetics; maximum likelihood estimation; parallel processing; search problems; trees (mathematics); Bayesian search algorithm; DNA sequencing; computational kernels; data explosion; evolutionary tree reconstruction; likelihood function; load balancing; maximum likelihood model; molecular sequence data; multicore revolution; multigene analysis; parallel computation; phylogenetic inference; phylogenetic likelihood kernel; phylogenomic alignment analysis; Availability; Concatenated codes; DNA; Explosions; Kernel; Maximum likelihood estimation; Parallel processing; Parameter estimation; Phylogeny; Sequences; Load Imbalance; PKL; Parallelization; Phylogenetic Likelihood Function;
Conference_Titel :
Parallel Processing, 2009. ICPP '09. International Conference on
Conference_Location :
Vienna
Print_ISBN :
978-1-4244-4961-3
Electronic_ISBN :
0190-3918
DOI :
10.1109/ICPP.2009.29