Fast CPU-based DNA exact sequence aligner

Author

Arbabi, Aryan ; Gholami, Milad ; Varmazyar, Mojtaba ; Daneshpajouh, Shervin

Author_Institution

Sch. of Comput. Sci., Inst. for Res. in Fundamental Sci. (IPM), Tehran, Iran

fYear

2012

fDate

16-17 July 2012

Firstpage

95

Lastpage

98

Abstract

Fast alignment of small sequences to a very large sequence has recently been under attention of many researchers, due to the applications in processing the biological sequences and specially mapping the short reads of Next Generation Sequencing to an already assembled reference genome. The MEMOCODE 2012 contest was aimed to design a very efficient exact DNA sequence matching method to map a huge number of short subsequences of the human genome and to optimize the performance of design: the running time and the cost of the required hardware platform. We designed and implemented two separate solutions, one based on Burrows Wheeler Transform (BWT) with some optimizations to the algorithm that improve its speed significantly, the other one which won the normalized section of the contest uses a hash indexing method.

Keywords

DNA; bioinformatics; data compression; file organisation; genomics; indexing; sequences; BWT; Burrows Wheeler transform; CPU-based DNA exact sequence aligner; DNA sequence matching method; MEMOCODE 2012 contest; biological sequences; hash indexing method; human genome; next generation sequencing; reference genome; small sequence alignment; very large sequence; Bioinformatics; DNA; Genomics; Humans; Memory management; Optimization; Random access memory;

fLanguage

English

Publisher

ieee

Conference_Titel

Formal Methods and Models for Codesign (MEMOCODE), 2012 10th IEEE/ACM International Conference on

Conference_Location

Arlington, VA

Print_ISBN

978-1-4673-1314-8

Type

conf

DOI

10.1109/MEMCOD.2012.6292305

Filename

6292305