DocumentCode :
1826945
Title :
Accelerating Viola-Jones Facce Detection Algorithm on GPUs
Author :
Jia, Haipeng ; Zhang, Yunquan ; Wang, Weiyan ; Xu, Jianliang
Author_Institution :
Lab. of Parallel Software & Comput. Sci., Inst. of Software, Beijing, China
fYear :
2012
fDate :
25-27 June 2012
Firstpage :
396
Lastpage :
403
Abstract :
The Viola-Jones face detection algorithm represents a class of parallel algorithms that both memory accesses and work distributions are irregular, thereby hard to obtain high performance on GPUs. Furthermore, conventional GPU programming wisdom usually guides us on how to optimize data parallel workloads with regular inputs and outputs. While how to efficiently write task-level parallelism programs with irregular workloads have not much material to reference. In this paper, we present an OpenCL-implementation of Viola-Jones face detection algorithm with high performance on both NVIDIA and AMD GPUs through five main techniques: warp size work granularity, persistent threads, Uberkernel, local and global queues. We also demonstrate the high performance of our implementation by comparing it with a well-optimized CPU version from OpenCV library. Experiment results show that the speedup reaches up to 5.193 ~35.08 times (16.91 on average) and 5.85 ~32.641 times (17.535 on average) on AMD and NVIDIA GPU respectively.
Keywords :
face recognition; graphics processing units; parallel algorithms; parallel programming; software libraries; storage management; AMD GPU; GPU programming; NVIDIA GPU; OpenCL-implementation; OpenCV library; Uberkernel; Viola-Jones face detection algorithm; data parallel workloads; global queues; local queues; memory accesses; parallel algorithms; persistent threads; task-level parallelism programs; warp size work granularity; well-optimized CPU version; work distributions; Algorithm design and analysis; Face; Face detection; Graphics processing unit; Instruction sets; Kernel; Parallel processing; Global Queues; Imbalanced Computation; Local Queues; Persistent Threads; Viola-Jones;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
Type :
conf
DOI :
10.1109/HPCC.2012.60
Filename :
6332199
Link To Document :
بازگشت