Title :
A Parallel Rectangle Intersection Algorithm on GPU+CPU
Author :
Lo, Shih-Hsiang ; Lee, Che-Rung ; Chung, Yeh-Ching ; Chung, I-Hsin
Author_Institution :
Comput. Sci. Dept., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Abstract :
In this paper, we investigate efficient algorithms and implementations using GPU plus CPU to solve the rectangle intersection problem on a plane. The problem is to report all intersecting pairs of iso-oriented rectangles, whose parallelization on GPUs poses two major computational challenges: data partition and the massive output. The algorithm we presented is called PRI-GC, Parallel Rectangle Intersection algorithm on GPU+CPU, which consists of two phases: mapping and intersection-checking. In the mapping phase, rectangles are hashed into different subspaces (called cells) to reduce the unnecessary intersection checking for far-apart rectangles. In the intersection-checking phase, pairs of rectangles within the same cell are examined in parallel, and the intersecting pairs of rectangles are reported. Several optimization techniques, including rectangles re-ordering, output data compressing/encoding, and the execution overlapping of GPU and CPU, are applied to enhance the performance. We had evaluated the performance of PRI-GC and the result shows over 30x speedup against two well-implemented sequential algorithms on single CPU. The effectiveness of each optimization technique for this problem was evaluated as well. Several parameters, including different degrees of rectangle coverage, different block sizes, and different cell sizes, were also experimented to explore their influences on the performance of PRI-GC.
Keywords :
computer graphic equipment; coprocessors; optimisation; GPU plus CPU; PRI-GC; data compressing-encoding; data partition; execution overlapping; graphics processing units; intersection-checking phase; iso-oriented rectangles; mapping phase; massive output; optimization techniques; parallel rectangle intersection algorithm; rectangles reordering; sequential algorithms; Algorithm design and analysis; Arrays; Graphics processing unit; Heuristic algorithms; Instruction sets; Kernel; Partitioning algorithms; CUDA; Parallel Algorithms; Rectangle Intersection;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2011 11th IEEE/ACM International Symposium on
Conference_Location :
Newport Beach, CA
Print_ISBN :
978-1-4577-0129-0
Electronic_ISBN :
978-0-7695-4395-6
DOI :
10.1109/CCGrid.2011.13