DocumentCode :
2721986
Title :
CUDA acceleration of P7Viterbi algorithm in HMMER 3.0
Author :
Quirem, Saddam ; Ahmed, Fahian ; Lee, Byeong Kil
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at San Antonio, San Antonio, TX, USA
fYear :
2011
fDate :
17-19 Nov. 2011
Firstpage :
1
Lastpage :
2
Abstract :
Dynamic programming matrices and the P7Viterbi algorithm of HMMER 3.0 show high parallelism in its code. Within the code, every query can have its score calculated in parallel with one thread per query. In this paper, these parallel features were exploited through the use of CUDA and a GPGPU. The CUDA implementation of this algorithm being performed on the Tesla C1060 enabled a 10-15x speedup depending on the number of queries. Without concurrent kernel execution and memory transfers a speedup of over 4x was achieved in terms of the total execution time. With a wide range of data sizes where the CPU has greater performance, it would be important that CUDA enabled programs properly select when to and not utilize the GPU for acceleration.
Keywords :
dynamic programming; graphics processing units; parallel architectures; query processing; CUDA acceleration; CUDA implementation; GPGPU; HMMER 3.0; P7Viterbi algorithm; Tesla C1060; dynamic programming matrices; kernel execution; parallel features; Acceleration; Graphics processing unit; Hidden Markov models; Instruction sets; Kernel; Random access memory; CUDA; Database; GPGPU; HMM; Viterbi Algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Computing and Communications Conference (IPCCC), 2011 IEEE 30th International
Conference_Location :
Orlando, FL
ISSN :
1097-2641
Print_ISBN :
978-1-4673-0010-0
Type :
conf
DOI :
10.1109/PCCC.2011.6108104
Filename :
6108104
Link To Document :
بازگشت