Title :
Fast CPU-based DNA exact sequence aligner
Author :
Arbabi, Aryan ; Gholami, Milad ; Varmazyar, Mojtaba ; Daneshpajouh, Shervin
Author_Institution :
Sch. of Comput. Sci., Inst. for Res. in Fundamental Sci. (IPM), Tehran, Iran
Abstract :
Fast alignment of small sequences to a very large sequence has recently been under attention of many researchers, due to the applications in processing the biological sequences and specially mapping the short reads of Next Generation Sequencing to an already assembled reference genome. The MEMOCODE 2012 contest was aimed to design a very efficient exact DNA sequence matching method to map a huge number of short subsequences of the human genome and to optimize the performance of design: the running time and the cost of the required hardware platform. We designed and implemented two separate solutions, one based on Burrows Wheeler Transform (BWT) with some optimizations to the algorithm that improve its speed significantly, the other one which won the normalized section of the contest uses a hash indexing method.
Keywords :
DNA; bioinformatics; data compression; file organisation; genomics; indexing; sequences; BWT; Burrows Wheeler transform; CPU-based DNA exact sequence aligner; DNA sequence matching method; MEMOCODE 2012 contest; biological sequences; hash indexing method; human genome; next generation sequencing; reference genome; small sequence alignment; very large sequence; Bioinformatics; DNA; Genomics; Humans; Memory management; Optimization; Random access memory;
Conference_Titel :
Formal Methods and Models for Codesign (MEMOCODE), 2012 10th IEEE/ACM International Conference on
Conference_Location :
Arlington, VA
Print_ISBN :
978-1-4673-1314-8
DOI :
10.1109/MEMCOD.2012.6292305