DocumentCode :
167575
Title :
Extracting Maximal Exact Matches on GPU
Author :
Abu-Doleh, Anas ; Kaya, Kamer ; Abouelhoda, Mohamed ; Catalyurek, Umit V.
Author_Institution :
Dept. of Biomed. Inf., Ohio State Univ., Columbus, OH, USA
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
1417
Lastpage :
1426
Abstract :
The revolution in high-throughput sequencing technologies accelerated the discovery and extraction of various genomic sequences. However, the massive size of the generated datasets raise several computational problems. For example, aligning the sequences or finding the similar regions in them, which is one of the crucial steps in many bioinformatics pipelines, is a time consuming task. Maximal exact matches have been considered important to detect and evaluate the similarity. Most of the existing tools that are designed and developed to find the maximal matches are based on advanced index structures such as suffix tree or array. Although these structures triggered the development of efficient search algorithms, they need large indexing tables which yield large memory footprint for the software using them and bring significant overhead. In this article, we introduce a novel tool GPUMEM which effectively utilizes the massively parallel GPU threads while finding maximal exact matches inside two genome sequences using a lightweight indexing structure. The index construction, which is also handled in GPU, is so fast that even by including the index generation time, GPUMEM can be faster in practice than a state-of-the-art tool that uses a pre-built index.
Keywords :
biology computing; database indexing; genomics; graphics processing units; multi-threading; storage management; GPUMEM; generated datasets; genome sequences; genomic sequences; high-throughput sequencing technologies; index construction; index generation time; index structures; lightweight indexing structure; massively parallel GPU threads; maximal exact matches; memory footprint; Arrays; Bioinformatics; Genomics; Graphics processing units; Indexing; Instruction sets; GPUs; indexing; maximal exact matches; parallel programming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
Type :
conf
DOI :
10.1109/IPDPSW.2014.159
Filename :
6969544
Link To Document :
بازگشت