Title :
Achieving TeraCUPS on Longest Common Subsequence Problem Using GPGPUs
Author :
Ozsoy, Adnan ; Chauhan, Anamika ; Swany, Martin
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
Abstract :
In this paper, we describe a novel technique to optimize longest common subsequence (LCS) algorithm for one-to-many matching problem on GPUs by transforming the computation into bit-wise operations and a post-processing step. The former can be highly optimized and achieves more than a trillion operations (cell updates) per second (CUPS)-a first for LCS algorithms. The latter is more efficiently done on CPUs, in a fraction of the bit-wise computation time. The bit-wise step promises to be a foundational step and a fundamentally new approach to developing algorithms for increasingly popular heterogeneous environments that could dramatically increase the applicability of hybrid CPU-GPU environments.
Keywords :
graphics processing units; parallel programming; string matching; GPGPU; LCS algorithm optimization; TeraCUPS; bit-wise computation time; bit-wise operations; cell updates; heterogeneous environments; hybrid CPU-GPU environments; longest common subsequence algorithm optimization; one-to-many matching problem; parallel programming; postprocessing step; Computer architecture; Dynamic programming; Equations; Graphics processing units; Instruction sets; Kernel; Parallel processing; CUDA; GPU; Longest Common Subsequence; semi-regular algorithms;
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2013 International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICPADS.2013.22