DocumentCode :
2601524
Title :
SURF cascade face detection acceleration on Sandy Bridge processor
Author :
Li, Eric ; Yang, Liu ; Wang, Bin ; Li, Jianguo ; Peng, Ya-ti
Author_Institution :
Intel Labs. China, Intel Corp, China
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
41
Lastpage :
47
Abstract :
Along with the inclusion of GPU cores within the same CPU die, the performance of Intel´s processor-graphics has been significantly improved over earlier generation of integrated graphics. This paper presents a highly optimized SURF cascade based face detector which efficiently exploits both CPU and GPU computing power on the latest Sandy Bridge processor. The SURF cascade classifier procedure is partitioned into two phases in order to leverage both thread level and data level parallelism in the GPU. The integral image function running in the CPU core can work with the GPU in parallel. We measure the performance and power of the GPU implementation on the latest Sandy Bridge platform. The experimental results show that our proposed GPU implementation achieves a 2.98 speedup and a 1.42 speedup compared to the single thread and multi-thread CPU implementation. At the same time, the power usage can be reduced as much as 50% compared to the CPU implementation. In addition, our proposed method presents a general approach for task partitioning between CPU and GPU, thus being beneficial not only for face detection but also for other computer vision applications.
Keywords :
computer vision; face recognition; graphics processing units; image classification; multiprocessing systems; parallel processing; performance evaluation; power aware computing; CPU die; GPU cores; Intel processor-graphics performance improvement; SURF cascade classifier; Sandy bridge processor; computer vision applications; data level parallelism; highly optimized SURF cascade based face detector; integral image function; power usage reduction; task partitioning; thread level parallelism; Bridges; Face detection; Feature extraction; Graphics processing unit; Instruction sets; Kernel; Parallel processing; CPU and GPU cooperative computation; GPU processing; SURF cascade face detection; Sandy Bridge architecture;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on
Conference_Location :
Providence, RI
ISSN :
2160-7508
Print_ISBN :
978-1-4673-1611-8
Electronic_ISBN :
2160-7508
Type :
conf
DOI :
10.1109/CVPRW.2012.6238893
Filename :
6238893
Link To Document :
بازگشت