DocumentCode
167339
Title
Efficient Computation of the Phylogenetic Likelihood Function on the Intel MIC Architecture
Author
Kozlov, A.M. ; Goll, C. ; Stamatakis, A.
Author_Institution
Sci. Comput. Group, Heidelberg Inst. for Theor. Studies, Heidelberg, Germany
fYear
2014
fDate
19-23 May 2014
Firstpage
518
Lastpage
527
Abstract
Phylogenetic inference is the process of reconstructing the evolutionary history of species based on their traits, nowadays mostly using molecular sequence data. Current state-of-the-art inference methods, like Bayesian and Maximum Likelihood (ML) inference, rely on the Phylogenetic Likelihood Function (PLF) as their computational core. Due to the large number of floating-point operations involved, the PLF evaluation is the major bottleneck for large-scale phylogenetic analyses comprising thousands of genes or even whole genomes. Here, we describe an optimized implementation of the PLF kernel for the novel Intel Many Integrated Core (MIC) architecture. Using a MIC-based accelerator (Xeon Phi 5110P), we were able to achieve speedups ranging from 1.9× to 2.8× for different PLF kernels, compared to a highly optimized AVX implementation running on dual-socket Xeon E5-2680 system. By integrating the optimized PLF into the phylogenetic inference program RAxML-Light, we reduced the overall execution times by up to factor of two. To assess the scalability on multiple Xeon Phi cards, we also developed a hybrid MPIOpenMP version of the ExaML code. When ExaML is executed on two coprocessors on the same node, we obtain speedups of up to a factor of 3.7 (vs. a CPU baseline) and 1.8 (vs. a single MIC). As expected, speedups increase with growing dataset size and become stable for alignments that require processing 1-2 million sites per MIC card.
Keywords
biocomputing; coprocessors; floating point arithmetic; message passing; multiprocessing systems; parallel processing; ExaML code; Intel MIC architecture; Intel many integrated core architecture; MIC-based accelerator; MPIOpenMP version; PLF kernel; RAxML-Light phylogenetic inference program; Xeon Phi 5110P; coprocessors; floating-point operations; large-scale phylogenetic analyses; molecular sequence data; phylogenetic likelihood function; Coprocessors; Hardware; Kernel; Microwave integrated circuits; Phylogeny; Prefetching; Vectors; bioinformatics; phylogenetics; maximum likelihood; Intel MIC; parallel processing; MPI;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location
Phoenix, AZ
Print_ISBN
978-1-4799-4117-9
Type
conf
DOI
10.1109/IPDPSW.2014.198
Filename
6969431
Link To Document