Title :
CUDA acceleration of P7Viterbi algorithm in HMMER 3.0
Author :
Quirem, Saddam ; Ahmed, Fahian ; Lee, Byeong Kil
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at San Antonio, San Antonio, TX, USA
Abstract :
Dynamic programming matrices and the P7Viterbi algorithm of HMMER 3.0 show high parallelism in its code. Within the code, every query can have its score calculated in parallel with one thread per query. In this paper, these parallel features were exploited through the use of CUDA and a GPGPU. The CUDA implementation of this algorithm being performed on the Tesla C1060 enabled a 10-15x speedup depending on the number of queries. Without concurrent kernel execution and memory transfers a speedup of over 4x was achieved in terms of the total execution time. With a wide range of data sizes where the CPU has greater performance, it would be important that CUDA enabled programs properly select when to and not utilize the GPU for acceleration.
Keywords :
dynamic programming; graphics processing units; parallel architectures; query processing; CUDA acceleration; CUDA implementation; GPGPU; HMMER 3.0; P7Viterbi algorithm; Tesla C1060; dynamic programming matrices; kernel execution; parallel features; Acceleration; Graphics processing unit; Hidden Markov models; Instruction sets; Kernel; Random access memory; CUDA; Database; GPGPU; HMM; Viterbi Algorithm;
Conference_Titel :
Performance Computing and Communications Conference (IPCCC), 2011 IEEE 30th International
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4673-0010-0
DOI :
10.1109/PCCC.2011.6108104