Title :
On-Chip Network-Enabled Multicore Platforms Targeting Maximum Likelihood Phylogeny Reconstruction
Author :
Majumder, Turbo ; Borgens, Michael Edward ; Pande, Partha Pratim ; Kalyanaraman, Ananth
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Washington State Univ., Pullman, WA, USA
fDate :
7/1/2012 12:00:00 AM
Abstract :
In phylogenetic inference, which aims at finding a phylogenetic tree that best explains the evolutionary relationship among a given set of species, statistical estimation approaches such as maximum likelihood (ML) and Bayesian inference provide more accurate estimates than other nonstatistical approaches. However, the improved quality comes at a higher computational cost, as these approaches, even though heuristic driven, involve optimization over multidimensional real continuous space. The number of possible search trees in ML is at least exponential, thereby making runtimes on even modest-sized datasets to clock up to several million CPU hours. Evaluation of these trees, involving node-level likelihood vector computation and branch-length optimization, can be partitioned into tasks (or kernels), providing the application with the potential to benefit from hardware acceleration. The range of hardware acceleration architectures tried so far offer limited degree of fine-grain parallelism. Network-on-chip (NoC) is an emerging paradigm that can efficiently support integration of massive number of cores on a chip. In this paper, we explore the design and performance evaluation of 2-D and 3-D NoC architectures for RAxML, which is one of the most widely used ML software suites. Specifically, we implement the computation kernels of the top three functions consuming more than 85% of the total software runtime. Simulations show that through appropriate choice of NoC architecture, and novel core design, allocation and placement strategies, our NoC-based implementation can achieve individual function-level speedups of 390x to 847x, speed up the targeted kernels in excess of 6500x, and provide end-to-end runtime reductions up to 5x over state-of-the-art multithreaded software.
Keywords :
bioinformatics; inference mechanisms; multi-threading; multiprocessing systems; network-on-chip; optimisation; search problems; Bayesian inference; CPU hours; ML software suites; NoC architectures; RAxML; branch-length optimization; end-to-end runtime reductions; evolutionary relationship; maximum likelihood phylogeny reconstruction; multidimensional real continuous space; network-on-chip; node-level likelihood vector computation; on-chip network-enabled multicore platforms; phylogenetic inference; phylogenetic tree; search trees; state-of-the-art multithreaded software; statistical estimation approaches; Hardware; Kernel; Multicore processing; Phylogeny; Resource management; Switches; Hardware accelerator; multicore; network-on-chip (NoC); phylogeny reconstruction;
Journal_Title :
Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on
DOI :
10.1109/TCAD.2012.2188401