DocumentCode
611063
Title
Optimizing Burrows-Wheeler Transform-Based Sequence Alignment on Multicore Architectures
Author
Jing Zhang ; Heshan Lin ; Balaji, Pavan ; Wu-Chun Feng
fYear
2013
fDate
13-16 May 2013
Firstpage
377
Lastpage
384
Abstract
Computational biology sequence alignment tools using the Burrows-Wheeler Transform (BWT) are widely used in next-generation sequencing (NGS) analysis. However, despite extensive optimization efforts, the performance of these tools still cannot keep up with the explosive growth of sequencing data. Through an in-depth performance analysis of BWA, a popular BWT-based aligner on multicore architectures, we demonstrate that such tools are limited by memory bandwidth due to their irregular memory access patterns. We then propose a locality-aware implementation of BWA that aims at optimizing its performance by better exploiting the caching mechanisms of modern multicore processors. Experimental results show that our improved BWA implementation can reduce last-level cache (LLC) misses by 30% and translation look aside buffer (TLB) misses by 20%, resulting in up to 2.6-fold speedup over the original BWA implementation.
Keywords
biology computing; cache storage; microprocessor chips; multiprocessing systems; optimisation; transforms; BWT-based aligner; Burrows-Wheeler transform-based sequence alignment optimization; LLC; NGS; TLB; caching mechanisms; computational biology sequence alignment tools; irregular memory access patterns; last-level cache misses; multicore architectures; multicore processors; next-generation sequencing analysis; translation lookaside buffer misses; Algorithm design and analysis; Bioinformatics; Data structures; Genomics; Multicore processing; Program processors; Sequential analysis; Burrows-Wheeler transform; FM-index; bioinformatics; data locality; multicore; short-read mapping;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on
Conference_Location
Delft
Print_ISBN
978-1-4673-6465-2
Type
conf
DOI
10.1109/CCGrid.2013.67
Filename
6546115
Link To Document