DocumentCode :
2800927
Title :
Load Balance in the Phylogenetic Likelihood Kernel
Author :
Stamatakis, Alexandros ; Ott, Michael
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. Munchen, Garching, Germany
fYear :
2009
fDate :
22-25 Sept. 2009
Firstpage :
348
Lastpage :
355
Abstract :
Recent advances in DNA sequencing techniques have led to an unprecedented accumulation and availability of molecular sequence data that needs to be analyzed. This data explosion in combination with the multi-core revolution also affects the computational kernels for phylogenetic inference (reconstruction of evolutionary trees from molecular sequence data) under the widely-used Maximum Likelihood (ML) model. At present, analyses of so called multi-gene or phylogenomic alignments, i.e., input data sets that comprise concatenated sequence data of several genes, are becoming increasingly popular. Usually such multi-gene analyses are partitioned, i.e., a separate set of likelihood model parameters is estimated for each gene/partition. While the phylogenetic likelihood function exhibits intrinsic fine-grained parallelism, the parallel computation of the likelihood function in such partitioned multigene analyses can lead to significant load-balance problems. Here, we describe these problems for the first time, discuss the implications on the design of "classic" ML-based as well as Bayesian search algorithms, and provide an initial solution that yields up to eight-fold improvements in speedup values on AMD Barcelona and Sun x4600 16-core systems for realistic application scenarios.
Keywords :
Bayes methods; DNA; biology computing; data analysis; evolutionary computation; genetics; maximum likelihood estimation; parallel processing; search problems; trees (mathematics); Bayesian search algorithm; DNA sequencing; computational kernels; data explosion; evolutionary tree reconstruction; likelihood function; load balancing; maximum likelihood model; molecular sequence data; multicore revolution; multigene analysis; parallel computation; phylogenetic inference; phylogenetic likelihood kernel; phylogenomic alignment analysis; Availability; Concatenated codes; DNA; Explosions; Kernel; Maximum likelihood estimation; Parallel processing; Parameter estimation; Phylogeny; Sequences; Load Imbalance; PKL; Parallelization; Phylogenetic Likelihood Function;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 2009. ICPP '09. International Conference on
Conference_Location :
Vienna
ISSN :
0190-3918
Print_ISBN :
978-1-4244-4961-3
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2009.29
Filename :
5362392
Link To Document :
بازگشت