• DocumentCode
    167339
  • Title

    Efficient Computation of the Phylogenetic Likelihood Function  on the Intel MIC Architecture

  • Author

    Kozlov, A.M. ; Goll, C. ; Stamatakis, A.

  • Author_Institution
    Sci. Comput. Group, Heidelberg Inst. for Theor. Studies, Heidelberg, Germany
  • fYear
    2014
  • fDate
    19-23 May 2014
  • Firstpage
    518
  • Lastpage
    527
  • Abstract
    Phylogenetic inference is the process of reconstructing the evolutionary history of species based on their traits, nowadays mostly using molecular sequence data. Current state-of-the-art inference methods, like Bayesian and Maximum Likelihood (ML) inference, rely on the Phylogenetic Likelihood Function (PLF) as their computational core. Due to the large number of floating-point operations involved, the PLF evaluation is the major bottleneck for large-scale phylogenetic analyses comprising thousands of genes or even whole genomes. Here, we describe an optimized implementation of the PLF kernel for the novel Intel Many Integrated Core (MIC) architecture. Using a MIC-based accelerator (Xeon Phi 5110P), we were able to achieve speedups ranging from 1.9× to 2.8× for different PLF kernels, compared to a highly optimized AVX implementation running on dual-socket Xeon E5-2680 system. By integrating the optimized PLF into the phylogenetic inference program RAxML-Light, we reduced the overall execution times by up to factor of two. To assess the scalability on multiple Xeon Phi cards, we also developed a hybrid MPIOpenMP version of the ExaML code. When ExaML is executed on two coprocessors on the same node, we obtain speedups of up to a factor of 3.7 (vs. a CPU baseline) and 1.8 (vs. a single MIC). As expected, speedups increase with growing dataset size and become stable for alignments that require processing 1-2 million sites per MIC card.
  • Keywords
    biocomputing; coprocessors; floating point arithmetic; message passing; multiprocessing systems; parallel processing; ExaML code; Intel MIC architecture; Intel many integrated core architecture; MIC-based accelerator; MPIOpenMP version; PLF kernel; RAxML-Light phylogenetic inference program; Xeon Phi 5110P; coprocessors; floating-point operations; large-scale phylogenetic analyses; molecular sequence data; phylogenetic likelihood function; Coprocessors; Hardware; Kernel; Microwave integrated circuits; Phylogeny; Prefetching; Vectors; bioinformatics; phylogenetics; maximum likelihood; Intel MIC; parallel processing; MPI;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
  • Conference_Location
    Phoenix, AZ
  • Print_ISBN
    978-1-4799-4117-9
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2014.198
  • Filename
    6969431