DocumentCode :
166086
Title :
GPU accelerated inexact matching for multiple patterns in DNA sequences
Author :
Rastogi, Priyank ; Guddeti, Ram
Author_Institution :
Dept. of Inf. Technol., Nat. Inst. of Technol., Mangalore, India
fYear :
2014
fDate :
24-27 Sept. 2014
Firstpage :
163
Lastpage :
167
Abstract :
DNA sequencing technology generates millions of patterns on Every run of the machine and it poses a challenge for matching these patterns to the reference genome effectively with high execution speed. The main idea here is inexact matching of patterns with mismatches and gaps (insertions and deletions). In Inexact match up pattern DNA sequence is to be matched with some allowed number of errors. Here we have considered 2 errors. Errors can be mismatches or gaps. Existing algorithm as SOAP3 performs inexact matching on GPU with mismatches only. SOAP3 doesn´t consider gaps (insertion and deletion). General Purpose Graphical Processing Unit (GPGPU) is an effective solution in terms of the cost and speed and there by providing a high degree of parallelism. This paper presents a parallel implementation of multiple pattern inexact matching in genome reference using CUDA based on BWT. The algorithm incorporates DFS (Depth First Search) Strategy for For matching multiple patterns, each thread of GPGPU is provided with a different pattern and hence millions of patterns can be matched using only one CUDA kernel. Since the memory of the GPU is limited then memory management should handled carefully. Synchronization of multiple threads is provided in order to prevent illegal access to the shared memory. GPU results are compared with that of CPU execution Experimental results of the proposed methodology achieved an average speedup factor of seven as compared to that of CPU execution.
Keywords :
DNA; bioinformatics; graphics processing units; parallel architectures; pattern matching; synchronisation; tree searching; BWT; CUDA kernel; DFS; DNA sequencing technology; GPGPU; GPU accelerated inexact matching; SOAP3; depth first search strategy; general purpose graphical processing unit; genome reference; memory management; multiple pattern inexact matching; multiple thread synchronization; parallel computing; Bioinformatics; DNA; Genomics; Graphics processing units; Instruction sets; Memory management; Pattern matching; GPGPU; Parallel Computing; Pattern Matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-3078-4
Type :
conf
DOI :
10.1109/ICACCI.2014.6968404
Filename :
6968404
Link To Document :
بازگشت